Fal.ai
Developer API platform for running image, video, and audio generation models (Flux, SDXL, Whisper, and more) at low latency. Popular as a serverless GPU layer for multimodal AI apps, with a clean Python/JS SDK and pay-per-use pricing.
HuggingFace
The central hub for open-source ML models, datasets, and spaces. Offers Inference API, Inference Endpoints, and the Transformers library for running models.