SambaNova Systems
SambaNova Systems
sambanova.ai

Locations

Palo Alto, CA, USA

industry

AI Infrastructure · Analytics · Artificial Intelligence (AI) · Machine Learning · Semiconductor · Software

Size

201 - 1000 employees

Stage

Other

founded in

2017

SambaNova provides AI infrastructure products and services focused on AI inference and model deployment. The company offers a cloud platform and APIs designed to run large open-source models with high throughput and energy-efficient performance using its custom dataflow hardware architecture and reconfigurable dataflow units (RDUs). Its stack includes integrated software for load balancing, model management, and orchestration across data center hardware. SambaNova supports developers and enterprises in deploying agentic AI applications at scale and provides solutions for sovereign AI data centers. Its technology emphasizes scalable inference on frontier models with flexible infrastructure support. Samba Cloud delivers the fastest inferences on the largest open-source models like Meta Llama Models, Qwen, DeepSeek, and OpenAI. Developers can get started building in minutes with our OpenAI compatible APIs. All customers start on the developer tier, and when they need more capacity, they can scale into our enterprise tier. SambaStack is our on-premise offering which includes the system, the platform, and foundation models. These components combine into a powerful technology stack that delivers unparalleled performance, ease of use, accuracy, data privacy, and the ability to power every use case across the world's largest organizations. SambaManaged is a modular and ready-to-deploy AI cloud designed to deliver unmatched efficiency for data centers and cloud service providers. This solution allows organizations to quickly deploy advanced AI inference services—without the need for costly infrastructure upgrades or specialized expertise—in as little as 90 days. At the heart of SambaNova innovation is the Reconfigurable Dataflow Unit (RDU). Purpose-built for AI workloads, the RDU takes advantage of a dataflow architecture and a three-tiered memory design. The three tiers of memory enable the platform to run hundreds of models on a single node and to switch between them in microseconds. In 2023, SambaNova released its 4th generation RDU chip, the SN40L.

Something looks off?
Open jobs at SambaNova Systems

On-site & Remote