Inference Catalog

70 items
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 1x Intel Sapphire Rapids
$ 0.033
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hour
Text-to-Image
GPU 1x Nvidia L4
$ 0.8
/ hour
Text-to-Image
GPU 1x Nvidia L4
$ 0.8
/ hour
Zero Shot Classification
GPU 1x Nvidia T4
$ 0.5
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Zero Shot Classification
GPU 1x Nvidia T4
$ 0.5
/ hour
Sentence Ranking
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Zero Shot Classification
GPU 1x Nvidia T4
$ 0.5
/ hour
Automatic Speech Recognition
GPU 1x Nvidia L4
$ 0.8
/ hour
Automatic Speech Recognition
GPU 1x Nvidia L4
$ 0.8
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text-to-Image
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L4
$ 3.8
/ hour
Text Generation
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hour
Text Generation
CPU 4x Intel Sapphire Rapids
$ 0.134
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
CPU 2x Intel Sapphire Rapids
$ 0.067
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Text Generation
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Image-Text-to-Text
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 2x Nvidia A100
$ 8
/ hour
Sentence Ranking
CPU 1x Intel Sapphire Rapids
$ 0.033
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia T4
$ 0.5
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L40S
$ 1.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text-to-Image
GPU 1x Nvidia L4
$ 0.8
/ hour
Sentence Embeddings
TEI
Accelerated Text Embeddings Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 4x Nvidia L40S
$ 8.3
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Text-to-Image
GPU 1x Nvidia T4
$ 0.5
/ hour
Text-to-Image
GPU 1x Nvidia L4
$ 0.8
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour
Automatic Speech Recognition
GPU 1x Nvidia T4
$ 0.5
/ hour
Automatic Speech Recognition
GPU 1x Nvidia T4
$ 0.5
/ hour
Text Generation
TGI
Accelerated Text Generation Inference
GPU 1x Nvidia L4
$ 0.8
/ hour