I can call it and get some embeddings, but after around 3000 or so embeddings I get hit with a limit and receive a 402 “Payment Required” exception. This surprised me since we do have a billing method.
Then I looked into it a bit more and saw that “Inference Usage” has a max limit of $0 per month unless you have a team/entreprise account. So that means that you can’t pay per usage at all as a company until you set that up? Am I understading this correctly?
Hi @lrizzellotaskbase Thanks for posting! Upgrading your org to Team or Enterprise for Inference Providers usage comes with many perks:
Your organization has a pool of $2 of included usage per seat, shared among org members
Usage past those included credits is billed on top of the subscription (pay-as-you-go)
Organization admins can enable/disable usage of Inference Providers and set a spending limit (on top of included credits)
Team & Enterprise orgs have a dedicated Inference Providers dashboard, offering full visibility into team usage across our serverless inference partners
Thanks for the reply, but that still leaves my main question open: Is it possible to use huggingface’s pay-per-use inference (more specifically for Qwen Embedding 8B) as a company without having to upgrade to team or entreprise?