OctoAI

OctoAI

Software Development

Seattle, Washington 12,814 followers

Run, tune, and scale the models that power AI applications.

About us

OctoAI delivers infrastructure to run, tune, and scale generative AI applications. OctoAI makes models work for you, not the other way around. Developers get easy access to efficient AI infrastructure so they can run the models they choose, tune them for their specific use case, and scale from dev to production seamlessly. With the fastest foundation models on the market (including Llama-2, Stable Diffusion, and SDXL), integrated customization solutions, and world-class ML systems under the hood, developers can focus on building apps that wow their customers without becoming AI infrastructure experts. Backed by leading venture capital firms, the company is headquartered in Seattle, WA. OctoAI is founded and led by the creators Apache TVM, an open-source ML stack for model performance and portability. 

Website
https://octo.ai
Industry
Software Development
Company size
51-200 employees
Headquarters
Seattle, Washington
Type
Privately Held
Founded
2019
Specialties
machine learning, artificial intelligence, Stable Diffusion, SDXL, LLMs, and Generative AI

Products

Locations

Employees at OctoAI

Updates

  • OctoAI reposted this

    View profile for Luis Ceze, graphic

    CEO at OctoAI & Lazowska Endowed Professor at University of Washington

    Great to see an independent evaluation by Alex Volkov featuring OctoAI's fresh offering of Llama3.1 70B. It shows OctoAI's performance leadership using mainstream cloud GPUs. It is all about striking the right balance of speed, quality and cost to deliver value to our customers. Behind the scenes is our custom LLM serving infrastructure that builds on years of work in AI systems from ML compilers up to cloud engineering. Customers can run Llama3.1 in our managed SaaS as well as our OctoStack private offering. WanB dashboard: https://lnkd.in/gf9g2b7x

    • No alternative text description for this image
  • View organization page for OctoAI, graphic

    12,814 followers

    Pass 16x MORE information into the new Llama 3.1 models! Now with context windows up to 128k, unlocking new possibilities for: Retrieval Augmented Generation (RAG) Document summarization Business intelligence reporting tools And more! Take your enterprise applications to the next level 👆 Try the Llama 3.1 herd on OctoAI: https://octoai.cloud/text Read more here: https://lnkd.in/gmWjKJHR #OctoAI #RAG #DocumentSummarization #BusinessIntelligence

    • No alternative text description for this image
  • View organization page for OctoAI, graphic

    12,814 followers

    Catch Jared Roesch next week for Statsig's Founders by Founders! For #SeattleTechWeek hosted by Madrona 🙌 Register here: https://lu.ma/pvczoic7

    View organization page for Statsig, graphic

    13,786 followers

    T-1 week today: We are hosting Founders by Founders for #SeattleTechWeek 👏🏻 Hear start-up stories from Jared Roesch (CTO and Founder of OctoAI), Linda Lian (CEO and Founder of Common Room), and Justin Uberti (CTO and Co-Founder of Fixie.ai). Moderated by CEO and Founder of Statsig, Vijaye R.. Join us for founder stories, enjoy food and drinks, and network! Save your spot: https://lu.ma/pvczoic7 Seattle Tech Week is hosted by Madrona #statsig #abtesting #featureflags #experimentation #developercommunity

    • No alternative text description for this image
  • View organization page for OctoAI, graphic

    12,814 followers

    🎉 A year older, a year wiser, and a whole lot more AI-savvy! We're celebrating our birthday and reflecting on the amazing journey that's brought us to where we are today. 🎁 🎉 Just in the past year, we've achieved some incredible milestones: ✨ Launched the OctoAI platform and held a launch party at AWS ✨ Introduced our Text Gen and Media Gen solution and welcomed amazing customers like HyperWrite (OthersideAI) ✨ Changed our name to OctoAI! ✨ Launched OctoStack and onboarded a second cloud (GCP) in April ✨ Connected with some of the top AI engineers and developers through hackathons, in-office events, and builder's roundtables ✨ Grown our team, celebrated milestones, and helped customers build products that bring joy to people's lives And this week got even better when, with the integration of Llama 3.1 models from Meta! Now available on octo.ai (make sure to give it a try!) Our mission to make AI more sustainable and accessible is possible because of the incredible people who've joined us on this journey. Thank you to our team, customers, and partners! ❤️

  • OctoAI reposted this

    View profile for Luis Ceze, graphic

    CEO at OctoAI & Lazowska Endowed Professor at University of Washington

    Today is OctoAI’s birthday! Its amazing how much we accomplished in a short few years. We pioneered ML compliers and now 100s of commercial and R&D efforts are built on those technologies. We built Octomizer, the first SaaS tool for model optimization and deployment in edge and cloud. And now we have a world-class customizable, efficient and reliable GenAI serving stack. But what makes me most proud is our team culture, which is as much about celebrating wins as it is working through the challenges. I love this company and this team ❤️. Happy Birthday, OctoAI! With a huge thank you to our investors and customers for supporting us and trusting us with their business! 🙏 🐙 🚀

    • No alternative text description for this image
  • OctoAI reposted this

    View profile for Luis Ceze, graphic

    CEO at OctoAI & Lazowska Endowed Professor at University of Washington

    This time last year there were 16,000 open source LLMs available on Hugging Face — and that seemed like a lot at the time! Yet people there were questioning when OSS models would be “really ready for primetime.” Since then the state-of-the-art has evolved at warp speed, thanks in large part to AI at Meta and their Llama family of models. Today OctoAI has hundreds of happy customers running Llama 3 in production and saving up to 60X vs. closed models. Perhaps most exciting are the encouraging quality benchmarks that reveal Llama 3.1 405B to be the most capable open-source LLM ever — competing with best-in-class closed models for math, function calling, and reasoning. Our customers are already testing it out, and you can too https://lnkd.in/dDaWVfJw

    View profile for Yann LeCun, graphic
    Yann LeCun Yann LeCun is an Influencer

    A hugely important commitment to the openness of Meta's AI ecosystem by Mark Zuckerberg: "Open Source AI Is the Path Forward " Llama 3.1 is free, open, and on par with the best proprietary systems. To maximize performance, safety, customizability, and efficiency, AI platforms must be open, just like the software infrastructure of the Internet became open. - Open Source AI is good for developers: fine-tuning, distillation, safety, efficiency, privacy, flexibility, portability, affordability, and a large ecosystem of contributors. - Open source AI is good for Meta: the larger the community, the faster the progress. - Open source AI is good for the world: enables more diversity in languages, cultures, value systems, and centers of interest in AI assistants. Enables a wider access with less concentrated control. https://lnkd.in/gFYVuV_s

    Open Source AI Is the Path Forward | Meta

    Open Source AI Is the Path Forward | Meta

    https://about.fb.com

  • OctoAI reposted this

    View profile for Luis Ceze, graphic

    CEO at OctoAI & Lazowska Endowed Professor at University of Washington

    I am thrilled to share that the entire Llama 3.1 herd — 8B, 70B and 405B — is available on OctoAI today. Thank you AI at Meta for your incredible partnership and to all of our Octonauts for working fast to make these model available to our customers immediately. Llama 3.1 is full of high-value features we know they’re going to love — a massive 128K context window, native tool calling, and support for 8 global languages, including my native tongue Portuguese! For enterprise users, Llama 3.1 405B unlocks new possibilities: an open source model that delivers transparency and model control that doesn’t sacrifice on quality. We’re also really excited to offer the entire 3.1 herd, including 405B, on OctoStack private deployment for privacy and compliance sensitive use cases. Get started today for free at https://octoai.cloud.

    View organization page for OctoAI, graphic

    12,814 followers

    OctoAI is proud to introduce the full Llama 3.1 herd to our customers, featuring 8B, 70B, and 405B models! These cutting-edge models unlock new possibilities for AI natives and enterprises, with: ✍ Massive 128k context window 🌍 Support for 8 global languages 📞 Native tool calling The Llama 3.1 herd boasts the largest, most capable open-source LLM available, Llama 3.1 405B, which sets a new standard for quality benchmarks. Say goodbye to the tradeoff between privacy and quality! With OctoStack, enterprises can self-host a powerful LLM that meets or exceeds GPT-4 quality, without compromising security and compliance. Ready to experience the power of Llama 3.1? Get started with the entire llama 3.1 herd for free at octoai.cloud and contact us to request a no-cost POC for Llama 405B on OctoStack! #OctoAI #meta AI at Meta https://lnkd.in/gmWjKJHR

    Introducing the Llama 3.1 Herd on OctoAI & OctoStack | OctoAI

    Introducing the Llama 3.1 Herd on OctoAI & OctoStack | OctoAI

    octo.ai

  • OctoAI reposted this

    View profile for Luis Ceze, graphic

    CEO at OctoAI & Lazowska Endowed Professor at University of Washington

    Huge achievement by the AI at Meta team on launching the Llama 3.1 models!  The quality benchmarks look incredible, our customers are going to be really excited for the whole Llama 3.1 herd. Learn more and try them on OctoAI here: https://lnkd.in/giU-mP8S. Thank you AI at Meta for the models and the OctoAI team for the incredible work!

    View organization page for AI at Meta, graphic

    829,197 followers

    Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new models including our long awaited 405B. Llama 3.1 delivers stronger reasoning, a larger 128K context window & improved support for 8 languages including English — among other improvements. Details in the full announcement ➡️ https://go.fb.me/hvuqhb Download the models ➡️ https://go.fb.me/11ffl7 We evaluated performance across 150 benchmark datasets across a range of languages — in addition to extensive human evaluations in real-world scenarios. Trained on >16K NVIDIA H100 GPUs, Llama 3.1 405B is the industry leading open source foundation model and delivers state-of-the-art capabilities that rival the best closed source models in general knowledge, steerability, math, tool use and multilingual translation. We’ve also updated our license to allow developers to use the outputs from Llama models — including the 405B — to improve other models for the first time. We’re excited about how synthetic data generation and model distillation workflows with Llama will help to advance the state of AI. As Mark Zuckerberg shared this morning, we have a strong belief that open source will ensure that more people around the world have access to the benefits and opportunities of AI and that’s why we continue to take steps on the path for open source AI to become the industry standard. With these releases we’re setting the stage for unprecedented new opportunities and we can’t wait to see the innovation our newest Llama models will unlock across all levels of the AI community.

  • View organization page for OctoAI, graphic

    12,814 followers

    OctoAI is proud to introduce the full Llama 3.1 herd to our customers, featuring 8B, 70B, and 405B models! These cutting-edge models unlock new possibilities for AI natives and enterprises, with: ✍ Massive 128k context window 🌍 Support for 8 global languages 📞 Native tool calling The Llama 3.1 herd boasts the largest, most capable open-source LLM available, Llama 3.1 405B, which sets a new standard for quality benchmarks. Say goodbye to the tradeoff between privacy and quality! With OctoStack, enterprises can self-host a powerful LLM that meets or exceeds GPT-4 quality, without compromising security and compliance. Ready to experience the power of Llama 3.1? Get started with the entire llama 3.1 herd for free at octoai.cloud and contact us to request a no-cost POC for Llama 405B on OctoStack! #OctoAI #meta AI at Meta https://lnkd.in/gmWjKJHR

    Introducing the Llama 3.1 Herd on OctoAI & OctoStack | OctoAI

    Introducing the Llama 3.1 Herd on OctoAI & OctoStack | OctoAI

    octo.ai

Similar pages

Browse jobs

Funding