Shush
Shush is an application that showcases the deployment of a WhisperV3 model with Flash Attention v2 on Modal and interacts with it through a NextJS app. The main purpose of this app is to offer a comprehensive demonstration for individuals interested in utilizing high-performance models and reliable APIs on demand with auto-scaling capabilities. It is a demo application constructed with Next.js for the frontend and Modal for the backend.

About Shush
Shush demonstrates a WhisperV3 deployment with Flash Attention v2 on Modal, delivered through a Next.js UI. This end-to-end demo highlights on-demand, auto-scaling AI hosting and provides a reusable boilerplate for high-performance model workloads.
Shush is an application that showcases the deployment of a WhisperV3 model with Flash Attention v2 on Modal and interacts with it through a NextJS app. The main purpose of this app is to offer a comprehensive demonstration for individuals interested in utilizing high-performance models and reliable APIs on demand with auto-scaling capabilities. It is a demo application constructed with Next.js for the frontend and Modal for the backend.
Features
- WhisperV3 Model Deployment
- NextJS Integration
- Auto-Scaling Capabilities
- Visit modal.com and create a free account.
- Install the Modal python package and authenticate in your CLI by following the instructions on the website.
- Run the NextJS app by executing the following commands after navigating to the root of the repository
Use Cases
Target Audience
Discover More Tools
Explore our comprehensive directory to find complementary tools and innovative solutions that enhance your development workflow alongside Shush.