NOTICE We rebranded from!
NEW Updated to Awanllm-Llama-3-8B-Dolfin-v0.6!

Awan LLM

High-Throughput LLM API Platform for Power Users and Developers

Stop paying per token for LLMs.

Unlimited tokens. Parallel requests. Pay per month.


Ask for help as much as you want with an AI Assistant powered by Awan LLM API.

AI Agents

Let your agents run wild working on something big without worrying about token usage.


Go on grand adventures with your AI companions without being censored or counting your tokens.

Data Processing

Process your immense amount of data quickly wihout limits.

Code Completion

Write more code faster and better with limitless code completions.


Make your AI powered applications profitable by eliminating your tokens cost.

Frequently asked questions

How can you provide unlimited token generation?

Unlike other API providers, we own our own datacenters and GPUs.

How do I use this?

Check out our Docs page to see how easy it is to use our API endpoints.

How do I contact Awan LLM support?

You can contact us at or clicking the contact button at the top of the page.

Do you keep logs of prompts and generation?

No. We do not log any prompt or generation as explained in our Privacy Policy page.

Why is Awan LLM better than other LLM API providers?

We provide unlimited tokens generations with unlimited parallel requests for batching and have generous request limits even for the free tier.

Is there a hidden limit imposed?

All the limits are clearly explained in our Pricing page.

Why use Awan LLM API instead of self-hosting LLMs?

It will cost you significantly less to use our API than renting GPUs in the cloud or paying electricity to run your own GPUs.

What if I want to use a model that's not here?

If a model you want to use is not in our Models page, you cant contact us to request to add it.