Home
jp jp
procyon
ai text generation benchmark

Procyon® AI Text Generation Benchmark

Simplifying Local LLM AI Performance testing

Testing AI LLM performance can be very complicated and time-consuming, with full AI models requiring large amounts of storage space and bandwidth to download. There are also many variables such as quantization, conversion, and variations in input tokens that can reduce a test’s reliability if not configured correctly.

The Procyon AI Text Generation Benchmark provides a more compact and easier way to repeatedly and consistently test AI performance with multiple LLM AI models. We worked closely with many AI software and hardware leaders to ensure our benchmark tests take full advantage of the local AI accelerator hardware in your systems.

Buy now Contact us

Prompt 7 (RAG Query): How can benchmarking save time and money for my organization? How to choose a reference benchmark score for RFPs? Summarize how to efficiently test the performance of PCs for Enterprise IT. Answer based on the context provided.

Results and insights

Built with input from industry leaders

Built with input from leading AI vendors to take full advantage of next-generation local AI accelerator hardware.
Seven prompts simulating multiple real-world use cases, with RAG (Retrieval-Augmented Generation) and non-RAG queries
Designed to run consistent, repeatable workloads, minimizing common AI LLM workload variables.

Ai text generation benchmark benchmark scores

Ai text generation benchmark detailed scores

Detailed Results

Get in-depth reporting as to how system resources are being used during AI workloads.
Reduced install size vs testing with entire AI models.
Easily compare results between devices to help identify the best systems for your use cases.

AI Testing Simplified

Easily and quickly test using four industry standard AI Models of varying parameter sizes.
Get a real-time view of responses being generated during the benchmark
One-click to easily test with all supported inference engines, or configure based on your preference.

Ai text generation benchmark hardware monitoring

Developed with Industry expertise

Procyon benchmarks are designed for industry, enterprise, and press use, with tests and features created specifically for professional users. The Procyon AI Text Generation Benchmark was designed and developed with industry partners through the UL Benchmark Development Program (BDP). The BDP is an initiative from UL Solutions that aims to create relevant and impartial benchmarks by working in close cooperation with program members.

Inference Engine Performance

With the Procyon AI Text Generation Benchmark, you can measure the performance of dedicated AI processing hardware and verify inference engine implementation quality with tests based on a heavy AI image generation workload.

Designed for Professionals

We created our Procyon AI Inference Benchmarks for engineering teams who need independent, standardized tools for assessing the general AI performance of inference engine implementations and dedicated hardware.

Fast and easy to use

The benchmark is easy to install and run—no complicated configuration is required. Run the benchmark using the Procyon application or via command-line. View benchmark scores and charts or export detailed result files for further analysis.

Free trial

Request trial

Site license

Get quote Press license

Annual site license for Procyon AI Text Generation Benchmark.
Unlimited number of users.
Unlimited number of devices.
Priority support via email and telephone.

Benchmark Development Program

Contact us

The Benchmark Development Program™ is an initiative from UL Solutions for building partnerships with technology companies.

OEMs, ODMs, component manufacturers and their suppliers are invited to join us in developing new AI processing benchmarks. Please contact us for details.

System requirements

All ONNX models

Storage: 18.25 GB

All OpenVINO models

Storage: 15.45 GB

Phi-3.5-mini

ONNX with DirectML

6GB VRAM (discrete GPU)
16GB System RAM (iGPU)
Storage: 2.15 GB

Intel OpenVINO

4GB VRAM (Discrete GPU)
16GB System RAM (iGPU)
Storage: 1.84 GB

Llama-3.1-8B

ONNX with DirectML

8GB VRAM (Discrete GPU)
32GB System RAM (iGPU)
Storage: 5.37 GB

Intel OpenVINO

8GB VRAM (Discrete GPU)
32GB System RAM (iGPU)
Storage: 3.88 GB

Mistral-7B

ONNX with DirectML

8GB VRAM (discrete GPU)
32GB System RAM (iGPU)
Storage: 3.69 GB

Intel OpenVINO

8GB VRAM (Discrete GPU)
32GB System RAM (iGPU)
Storage: 3.48 GB

Llama-2-13B

ONNX with DirectML

12GB VRAM (Discrete GPU)
32GB System RAM (iGPU)
Storage: 7.04 GB

Intel OpenVINO

10GB VRAM (Discrete GPU)
32GB System RAM (iGPU)
Storage: 6.25 GB

Support

Latest 1.0.82.0 | December 9, 2024

Languages

English
German
Japanese
Portuguese (Brazil)
Simplified Chinese
Spanish

Procyon® AI Text Generation Benchmark

Simplifying Local LLM AI Performance testing

Results and insights

Built with input from industry leaders

Detailed Results

AI Testing Simplified

Developed with Industry expertise

Inference Engine Performance

Designed for Professionals

Fast and easy to use

Free trial

Site license

Benchmark Development Program

Contact us

System requirements

All ONNX models

All OpenVINO models

Phi-3.5-mini

ONNX with DirectML

Intel OpenVINO

Llama-3.1-8B

ONNX with DirectML

Intel OpenVINO

Mistral-7B

ONNX with DirectML

Intel OpenVINO

Llama-2-13B

ONNX with DirectML

Intel OpenVINO

Support

Languages

Procyon

Benchmarks

Services

Support

Insights

Compare

UL Benchmarks

About UL Solutions

Languages