deepsilicon recently launched!

Launch YC: deepsilicon: Running neural nets with 5x less RAM and up to 20x as fast
"Hardware and Software to Run AI Fast"
Deepsilicon GIF

Founded by Abhi Reddy and Alex Nanda

📉 The Problem

Transformer-based models have become increasingly crucial in various industries, from natural language processing to Vision Language Action models for robotics. However, the deployment and operation of these models, particularly those exceeding a few billion parameters, present significant challenges regarding hardware capabilities, energy consumption, and operational costs.
Traditional approaches to this problem typically fall into two categories:

  1. Utilize massive, power-hungry GPU clusters to distribute the computational load.
  2. Compromise on model size and capabilities to fit within existing hardware constraints on the edge.

Both of these approaches have significant drawbacks. GPU clusters are expensive to acquire and operate, with substantial energy costs and complex cooling requirements. They also introduce latency issues due to inter-device communication and can’t be deployed on the edge. On the other hand, compromising on model size can limit the AI's capabilities and potential applications, putting organizations at a competitive disadvantage.

📈 The Solution

deepsilicon helps eliminate the need for inefficient distributed computing and compromised model capabilities by providing a full-stack system where they run transformer-based models on a single chip, including existing hardware. Their solution can run on a custom ASIC, dramatically reducing power consumption and operational costs.

https://www.youtube.com/watch?v=MctVUhuXgeA


Here's why this is a game-changer:

  • Immediate Deployment: Your large-scale AI model can be operational on a single chip, eliminating the need for complex distributed setups. This means you can leverage the full power of multi-billion parameter models right from the start.
  • Customization: The chiplet architecture allows for near-infinite customization, enabling hardware/software co-design tailored to specific client needs.
  • Efficiency: Even on existing Nvidia hardware, their software provides a 5x reduction in memory usage
  • Ease of Use: Developers can switch their linear layer with their optimized version, dramatically simplifying the integration process and democratizing access to large-scale AI capabilities.

If you’re a YC company, they offer a 50% discount to help you train your model and deploy it on the device or the cloud!

🙏 How You Can Help

They are looking for connections and collaborations to drive their mission forward. If you or anyone in your network is interested in learning more or helping out, they want to hear from you! They’re specifically looking to connect with:

  • Robotics, Drone, and Autonomous Vehicle Companies
  • People looking to train their own foundation model
  • Experts in edge AI deployment

If that’s you or someone you know, please email them here.

Learn More

🌐 Visit www.deepsilicon.net to learn more.

💥 If you’re a YC company, they offer a 50% discount to help you train your model and deploy it on the device or the cloud!

🛠️ Interested? Start building now!


🤝 They are looking for intros to
Robotics, Drone, and Autonomous Vehicle Companies / People looking to train their own foundation model / Experts in edge AI deployment - email them here
🌟 Find them on Github

👣 Follow deepsilicon on
LinkedIn.

Posted 
September 17, 2024
 in 
Launch
 category
← Back to all posts  

Join Our Newsletter and Get the Latest
Posts to Your Inbox

No spam ever. Read our Privacy Policy
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.