Optimizing Cloud Infrastructure for On-Demand LLM Deployment

Project scope
Categories
Cloud technologies Information technology Machine learning Artificial intelligence DatabasesSkills
cloud computing cloud infrastructure aws bedrock qdrant large language modeling machine learning amazon elastic compute cloud database management cost benefit analysis operationsGluu.Repair is seeking to enhance the efficiency of its cloud infrastructure by optimizing the deployment of large language models (LLMs) on Amazon EC2 instances.
Currently, the models are running continuously, leading to unnecessary costs. The goal of this project is to explore and implement solutions that allow these models to be triggered only on demand, thereby reducing operational expenses. Students will investigate the potential of using Amazon Bedrock or other cloud providers to host the LLMs more efficiently. Additionally, the project involves gaining a deeper understanding of embedding techniques in QDrant, a vector database, to improve data handling and retrieval processes. This project provides an opportunity for learners to apply their knowledge of cloud computing, machine learning, and database management in a practical setting.
The project deliverables include a comprehensive report detailing the proposed solution for on-demand LLM deployment, including cost-benefit analysis and implementation steps. Additionally, students will provide a demonstration of the optimized infrastructure setup using a selected cloud provider. The final deliverable will also include documentation on the embedding process in QDrant, highlighting any improvements made. This will ensure that Gluu.Repair can efficiently manage its LLM operations while minimizing costs.
Providing specialized, in-depth knowledge and general industry insights for a comprehensive understanding.
Sharing knowledge in specific technical skills, techniques, methodologies required for the project.
Direct involvement in project tasks, offering guidance, and demonstrating techniques.
Providing access to necessary tools, software, and resources required for project completion.
Scheduled check-ins to discuss progress, address challenges, and provide feedback.
About the company
Simplify your product research for the most streamlined trading, and sharing marketplace tool. You may delete, edit, and share/trade your individual items to a friend or potential client.
Take photos of your item and the platform will provide clients with information on the product, and the option to either trade and or repair to a list of marketplaces and experts required.
Whether our clients simply want to know more about their item, or use the information for a fair market valuation of the item, users, may list up to 10 items for free.