GPT-OSS from OpenAI is a major win for open source AI and allowing companies to have control over their own data. We at vast.ai are proud to support GPT-OSS from Day 1 on our platform! Run GPT-OSS on your own infra with full control over reasoning depth (low → high). We made it easy to launch GPT-OSS on Vast.ai with vLLM. https://lnkd.in/eJjRQGYz
Vast.ai
Software Development
Los Angeles, California 10,129 followers
The AI Infrastructure Platform: Scalable, efficient, on Demand GPUs
About us
Vast.ai is the market leader for low cost GPU rentals. The service connects data centers and professionals running the Vast hosting software with users who can quickly find the best deals for compute according to their specific requirements. Vast.ai GPU rentals are ~3-5X cheaper than current alternatives. Consumer computers and consumer GPUs in particular are considerably more cost effective than equivalent enterprise hardware. We are helping the millions of underutilized consumer GPUs around the world enter the cloud computing market for the first time.
- Website
-
https://vast.ai
External link for Vast.ai
- Industry
- Software Development
- Company size
- 11-50 employees
- Headquarters
- Los Angeles, California
- Type
- Privately Held
- Founded
- 2018
Locations
-
Primary
1100 Glendon Ave
Suite #1840
Los Angeles, California 90024, US
Employees at Vast.ai
Updates
-
In our blog, we break down the top recommended templates on Vast.ai and how you can use them to get your LLM project off the ground quickly and easily.
-
Introducing the Vast.ai Vulnerability Bounty Program. This is a new initiative we’ve created to help us improve, expand, and innovate our platform while making it more secure. Vast.ai’s mission is to provide the best GPUs for AI compute at accessible and affordable prices for everyone who needs it. With this bounty program, we’re inviting AI developers, researchers, and enthusiasts to collaborate with us. The launch of this program is in direct response to our community’s feedback, and we appreciate all the collaboration happening already.
-
The DeepSeek R1-0528 release introduces a significant improvement to the reasoning capabilities of language models by making it easier to access "thinking" mode without requiring complex prompt engineering or pre-pending thinking tokens. This enhanced model provides transparent, step-by-step reasoning that's particularly valuable for educational applications, complex problem solving, and scenarios where transparency in AI decision-making is crucial. Let's explore how to deploy the DeepSeek-R1-0528-Qwen3-8B model using vLLM on Vast.ai's cloud GPU platform, leveraging the new qwen3 reasoning parser that simplifies access to the model's internal thinking process.
-
Nearly every startup faces the same two pressures: to move fast and make every dollar count. Traditional cloud providers often aren't built with startups in mind. Vast.ai offers another path. Our market-based cloud GPU rental platform gives startups the flexibility to run powerful workloads without the usual friction.