Products
Resource Hub
.png)
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
.jpeg)

Blog
.jpeg)
Blog
Building A High Performance Data Access Layer for Model Training and Model Distribution for LLM at Zhihu
Bringing a large language model from its initial training to deployment requires numerous systems and components. At Zhihu, we grappled with a multi-cloud, cross-region AI platform, requiring an efficient solution to facilitate the rapid training and delivery of models for production use cases. This led us to adopt Alluxio, the high-performance data access layer for LLM. This blog provides an in-depth look at Zhihu’s challenges, journey, and solution for LLM training and deployment. Through adopting Alluxio, we’ve significantly enhanced model training performance by 2 to 3 times and can deploy updated models every minute instead of hours or days. Also, our GPU utilization has doubled, infrastructure and operation costs have been halved, and we have established a resilient, efficient infrastructure capable of meeting our escalating AI demands.
Model Training Acceleration
Model Distribution
GPU Acceleration
Cloud Cost Savings
Your selections don't match any items.
.jpeg)

.jpeg)

.jpeg)
.jpeg)
.jpeg)

.jpeg)


