Developers Learn Company

Chat with and directly compare LLM endpoints

Compare LLM endpoints with live performance benchmarks

Learn how to use the Unify API

Read about LLM deployment infrastructure

Stay up to date with the latest in AI

Join our discussions around cuttin-edge AI research

Dive deep with us into the AI landscape

Join our team and let’s Unify AI!

Reach out to our team

Privacy & Cookies

How we treat your navigation data

Terms Of Service

General requirements for using our Service

Follow us through our social accounts:

Chat with and directly compare LLM endpoints

Compare LLM endpoints with live performance benchmarks

Learn how to use the Unify API

Read about LLM deployment infrastructure

Stay up to date with the latest in AI

Join our discussions around cuttin-edge AI research

Dive deep with us into the AI landscape

Join our team and let’s Unify AI!

Reach out to our team

Privacy & Cookies

How we treat your navigation data

Terms Of Service

General requirements for using our Service

Follow us through our social accounts:

Back to Benchmarks

llama-2-13b-chat

text-generation

Uploaded: 09.01.2024

⏱️ Benchmarks

✨ Query this model

Developers

Chat Benchmarks Documentation

Learn

Blog Newsletter Paper Readings Talks

Socials

Discord LinkedIn Medium Twitter YouTube

Company

Careers Contact Privacy Terms Of Service

Region:

Hong Kong Belgium Iowa

Seq Length:

Providers

aws-bedrock

octoai

together-ai

anyscale

deepinfra

lepton-ai

replicate

fireworks-ai

Learn more about how we are collecting this data here

Output Tks / Sec

_{P90}

_{P90}

_{P90}

_{P90}

0.75 $/1M tks

1 $/1M tks

35.81 tks/sec

774.78 ms

27.93 ms

13341.27 ms

0 sec

0.2 $/1M tks

0.5 $/1M tks

52.53 tks/sec

818.62 ms

19.04 ms

3978.72 ms

0 sec

0.23 $/1M tks

0.23 $/1M tks

59.86 tks/sec

424.22 ms

16.71 ms

2796.42 ms

0 sec

0.25 $/1M tks

0.25 $/1M tks

65.15 tks/sec

893.87 ms

15.35 ms

4086.43 ms

0 sec

0.22 $/1M tks

0.22 $/1M tks

68.51 tks/sec

696.51 ms

14.6 ms

2550.27 ms

0 sec

0.3 $/1M tks

0.3 $/1M tks

102.2 tks/sec

1412.24 ms

9.78 ms

3437.72 ms

0 sec

0.1 $/1M tks

0.5 $/1M tks

103.45 tks/sec

870.22 ms

9.67 ms

2039.85 ms

0 sec

0.2 $/1M tks

0.2 $/1M tks

163.94 tks/sec

523.89 ms

6.1 ms

1359.55 ms

0 sec