Inference Catalog
Search
Inference Task
Price $ 0 - 50 / hour
- 0
- 0.1
- 0.5
- 1
- 5
- 50
Hardware Accelerator
Inference Server
License
Hub Models
Browse All Models70 items
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 1x Intel Sapphire Rapids
$ 0.033
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hourSentence Ranking
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hourZero Shot Classification
GPU 1x Nvidia T4
$ 0.5
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourImage-Text-to-Text
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hourSentence Ranking
CPU 1x Intel Sapphire Rapids
$ 0.033
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourSentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hourText Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour