modular

星期三 05 下午 七月 2o 2025

How Inworld built the world’s most advanced speech pipeline in less than 8 weeks with Modular

How Inworld built the world’s most advanced speech pipeline in less than 8 weeks with Modular

How Inworld built the world’s most advanced speech pipeline in less than 8 weeks with Modular @media only screen and (max-width:639px){img.stretch-on-mobile,.hs_rss_email_entries_table img,.hs-stretch-cta .hs-cta-img{height:auto !important;width:100% !important} .display_block_on_small_screens{display:block}.hs_padded{padding-left:20px !important;padding-right:20px !important} .hs-hm,table.hs-hm{display:none}.hs-hd{display:block !important}table.hs-hd{display:table !important} }@media only screen and (max-width:639px){.hse-border-m{border-left:1px solid #cbd6e2 !important;border-right:1px solid #cbd6e2 !important;box-sizing:border-box} .hse-border-bottom-m{border-bottom:1px solid #cbd6e2 !important}.hse-border-top-m{border-top:1px solid #cbd6e2 !important} .hse-border-top-hm{border-top:none !important}.hse-border-bottom-hm{border-bottom:none !important} }.moz-text-html .hse-column-container{max-width:600px !important;width:600px !important} .moz-text-html .hse-column{display:table-cell;vertical-align:top}.moz-text-html .hse-section .hse-size-12{max-width:600px !important;width:600px !important} @media only screen and (min-width:640px){.hse-column-container{max-width:600px !important;width:600px !important} .hse-column{display:table-cell;vertical-align:top}.hse-section .hse-size-12{max-width:600px !important;width:600px !important} }@media only screen and (max-width:639px){.hse-body-wrapper-td{padding-top:20px !important} #section-0 .hse-column-container{padding-top:10px !important;padding-bottom:10px !important;background-color:transparent !important} #section-0 .hse-column-container{background-color:transparent !important} }@media only screen and (max-width:639px){ #section-1 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-1 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){ #section-2 .hse-column-container{padding-top:10px !important;padding-bottom:0px !important} #section-2 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){ #section-3 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-3 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){ #section-4 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-4 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){ #section-5 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-5 .hse-column-container{background-color:#fff !important} }@media screen and (max-width:639px){.social-network-cell{display:inline-block} }@media only screen and (max-width:639px){ #section-6 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-6 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){ #section-7 .hse-column-container{padding-top:0px !important;padding-bottom:0px !important} #section-7 .hse-column-container{background-color:#fff !important} }@media only screen and (max-width:639px){.hse-body-wrapper-td{padding-bottom:20px !important} #section-8 .hse-column-container{padding-top:30px !important;padding-bottom:0px !important;background-color:transparent !important} #section-8 .hse-column-container{background-color:transparent !important} }#hs_body #hs_cos_wrapper_main a[x-apple-data-detectors]{color:inherit !important;text-decoration:none !important;font-size:inherit !important;font-family:inherit !important;font-weight:inherit !important;line-height:inherit !important} a{text-decoration:underline}p{margin:0}body{-ms-text-size-adjust:100%;-webkit-text-size-adjust:100%;-webkit-font-smoothing:antialiased;moz-osx-font-smoothing:grayscale} table{border-spacing:0;mso-table-lspace:0;mso-table-rspace:0}table,td{border-collapse:collapse} img{-ms-interpolation-mode:bicubic}p,a,li,td,blockquote{mso-line-height-rule:exactly}

Powered by Modular and NVIDIA Blackwell, Inworld’s TTS stack is fast, affordable, and production-ready. Dive into the full story and explore more testimonials on our new case studies page.

Modular

Inworld + Modular: scalable, SoTA speech synthesis 🔥

Inworld

Building high-performance AI infrastructure doesn’t have to take months. Inworld proved that by launching a state-of-the-art speech pipeline into production in under 8 weeks with Modular. Their blog post explains how they used MAX and Mojo to run on NVIDIA Blackwell GPUs, meet real-time latency targets that were 70% faster than using the latest vLLM. This cut their serving costs by 60%, and enabled them to offer one of the lowest priced TTS API’s available. If you’re aiming for fast, affordable, real-time AI, their approach provides a useful playbook.

Read the full breakdown on Inworld’s blog

New on our site: Modular Case Studies

Customers Page

We just launched a new case studies page featuring companies that are solving tough AI infra problems with Modular. 

• Qwerky is using Mojo and MAX to run their custom Mamba models on NVIDIA, AMD, and Apple Silicon with a single codebase, making personality-rich AI accessible on everyday hardware.
• Inworld scaled real-time speech synthesis with fast, affordable serving built on Modular’s stack.

Whether you’re optimizing for price, performance, or portability, check out what’s possible–with many more coming soon!

Discover how teams are building with Modular

发布者