HuggingFace
The central hub for open-source ML models, datasets, and spaces. Offers Inference API, Inference Endpoints, and the Transformers library for running models.
Fal.ai
Developer API platform for running image, video, and audio generation models (Flux, SDXL, Whisper, and more) at low latency. Popular as a serverless GPU layer for multimodal AI apps, with a clean Python/JS SDK and pay-per-use pricing.