Tarteel Migrates Cloud Infrastructure to CoreWeave with Help from Zeet

Copied

Bridget Schrier

Published on

March 9, 2023

Tarteel AI leveraged Zeet to smoothly move its deployment from AWS to CoreWeave, translating to a 22% improvement in latency and ~56% cost reduction.

‍

For Tarteel, an AI-powered Quran app, switching from AWS to CoreWeave was a no-brainer.

Lured by the flexibility to choose from a broad portfolio of the latest NVIDIA GPUs, affordable pricing, and integration with deployment partner Zeet, Tarteel migrated its entire cloud infrastructure over to CoreWeave.

Groundbreaking AI

Tarteel is an AI-powered Quran study app that helps 2B+ Muslims around the world practice and improve their relationship with their faith. The app is interactive and voice-guided, meaning users converse with it, much like they would with a teacher.

Tarteel follows along as users recite verses, highlighting the words as they’re said and flagging mistakes in real time. Users can also search the Quran and related texts in Arabic using their voice.

Tarteel corrects recitation (the eponymous use case of the app), sentence structure, detects incorrect word choices, and points out the distinctions between similar verses to help users understand context and aid in memorizing the Quran.

With 3M+ downloads, a growing user base, and 250k+ audio hours transcribed per month, Zeet and CoreWeave are helping Tarteel hit its stride.

Big Opportunity

Unsatisfied with legacy cloud offerings, including a lack of observability for workloads, an underwhelming GPU selection and an unoptimized solution for inference and production workloads, Tarteel saw multiple growth opportunities with CoreWeave.

CoreWeave offers clients tangible advantages over large legacy cloud providers, including out-of-the-box observability, the industry's broadest range of NVIDIA GPUs, responsive autoscaling, best-in-class support, excellent documentation, and seamless integration with Zeet.

CoreWeave gives Tarteel out-of-the-box observability with access to Grafana and Prometheus and offers a native S3 solution, which minimizes Tarteel’s cloud and data transfer costs.

NVIDIA Riva & NeMo

Tarteel had experimented with various toolkits, but most were tailored for research or specific use cases, with poor inference latency and difficult-to-reproduce SOTA results.

“Many STT toolkits and libraries we explored failed to deliver on both the performance and accuracy requirements necessary to deliver our unique product experience. They also were not optimized for deploying models in a production environment.”

– Anas Abou Allaban, Co-Founder and CEO, Tarteel

On CoreWeave, Tarteel can access NVIDIA’s Riva and NVIDIA NeMo toolkits. NeMo gave the team the ability to quickly train and fine-tune SOTA ASR and TTS models on Tarteel’s own data and custom hyperparameters. This allowed the team to maximize WER performance and achieve amazing results recognizing Quran recitation.

The team then leveraged Riva to deploy NeMo models with Triton, delivering blazing-fast inference speeds compared to other frameworks. Riva also gave Tarteel the ability to modify the behavior of the models in production by adjusting parameters related to audio chunk size, padding, and Voice Activity Detection (VAD) to deliver the best end-user experience possible.

Post-Migration

With Riva and NeMo, Tarteel achieved real-time production performance and reproducible SOTA results by optimizing latency and streaming audio, and support for scaling AI model inferencing to multiple GPUs with NVIDIA Triton.

After migration, Tarteel recorded:

~1,600 requests/min. On average over 40 NVIDIA GPUs (A4000/5000s)
22% improvement in latency
Median request latency: 423.5ms
~56% cost reduction

Tarteel plans to use NVIDIA’s SDKs and toolkits for new use cases like NLP and TTS.

Can’t Beat Zeet

Tarteel uses Zeet to deploy services to multiple cloud providers in multiple regions (US, EU, AP).

Zeet made the migration to CoreWeave swift and simple, quickly resolving issues, like setting up load balancing and DNS, to ensure that migration time was minimal, at 1-2 days total.

“Zeet made the switch to CoreWeave a breeze. We had minimal downtime and did not need any K8s or CW expertise, we just clicked a few buttons and we were live.”

– Anas Abou Allaban, Co-Founder and CEO, Tarteel

Zeet made autoscaling and deploying incredibly straightforward. Without having to worry about deployments, Tarteel has been able to focus on improving its product and services.

“Just write a Docker container and your app is ready to be shipped on Zeet and CoreWeave. When you’re ready to scale, all you need is a few clicks to configure autoscaling or manually increase your replica count.”

– Anas Abou Allaban, Co-Founder and CEO, Tarteel

What’s Next for Tarteel?

As Tarteel continues to harness AI to help enrich Muslims' experience with the Quran, they seek to save ancient texts that are at risk of being lost to history and provide important context on verses of the Quran.

“Our goal is to continue leveraging AI to deliver unique and personalized experiences for Muslims to better practice & engage with their faith. This includes experiences related to semantic search, Q&A (NLP), forecasting/spaced repetition to find the best chapter to recite/revise, and digitizing Islamic/Arabic literature.”

– Anas Abou Allaban, Co-Founder and CEO, Tarteel

Zeet and CoreWeave will be with them every step of the way to help deliver maximum value to their users.

Tarteel Migrates Cloud Infrastructure to CoreWeave with Help from Zeet

Copied

Bridget Schrier

Published on

March 9, 2023

Copied

Tarteel Migrates Cloud Infrastructure to CoreWeave with Help from Zeet

Tarteel AI leveraged Zeet to smoothly move its deployment from AWS to CoreWeave, translating to a 22% improvement in latency and ~56% cost reduction.

Groundbreaking AI

Big Opportunity

NVIDIA Riva & NeMo

Post-Migration

Can’t Beat Zeet

What’s Next for Tarteel?

Tarteel Migrates Cloud Infrastructure to CoreWeave with Help from Zeet

Share article

Related Blogs

Announcing distributed AI on CoreWeave with fully managed Ray on Anyscale

Building Pennsylvania into the Mid-Atlantic AI Hub

CoreWeave Launches the First Generally Available NVIDIA RTX PRO 6000 Blackwell Server Instances

CoreWeave to Acquire Core Scientific

CoreWeave Leads the Way with First NVIDIA GB300 NVL72 Deployment

Benchmark Results: CoreWeave AI Object Storage Delivers 2+ GB/s per GPU Throughput Across any Number of GPUs

Accelerating AI Leadership: How CoreWeave’s MLPerf Results Unlock Customer Innovation

CoreWeave, NVIDIA, and IBM Set MLPerf Record with Largest NVIDIA GB200 Blackwell Cluster, Achieving Over 2× Faster Training

CoreWeave Expands its NVIDIA Blackwell Fleet with Generally Available NVIDIA HGX B200 Instances

Unlocking AI Inference at Scale: CoreWeave Joins Red Hat Open Source Project llm-d as Founding Member

Products

Solutions

AI Infrastructure

Why CoreWeave

Resources

About