Ram Vegiraju – TechToday

Optimized Deployment of Mistral7B on Amazon SageMaker Real-Time Inference | by Ram Vegiraju | Feb, 2024

By Ram Vegiraju February 21, 2024AINo Comments

[ad_1] Utilize large model inference containers powered by DJL Serving & Nvidia TensorRTImage from Unsplash by KommersThe Generative AI space continues to expand at an unprecedented rate, with the introduction…

An Introduction To Fine-Tuning Pre-Trained Transformers Models | by Ram Vegiraju | Feb, 2024

By Ram Vegiraju February 17, 2024AINo Comments

[ad_1] Simplified utilizing the HuggingFace trainer objectImage from Unsplash by Markus SpiskeHuggingFace serves as a home to many popular open-source NLP models. Many of these models are effective as is,…

Building an LLMOPs Pipeline. Utilize SageMaker Pipelines, JumpStart… | by Ram Vegiraju | Jan, 2024

By Ram Vegiraju January 18, 2024AINo Comments

[ad_1] Utilize SageMaker Pipelines, JumpStart, and Clarify to Fine-Tune and Evaluate a Llama 7B ModelImage from Unsplash by Sigmund2023 was the year that witnessed the rise of various Large Language…

Augmenting LLMs with RAG. An End to End Example Of Seeing How… | by Ram Vegiraju | Oct, 2023

By Ram Vegiraju October 10, 2023AINo Comments

[ad_1] An End to End Example Of Seeing How Well An LLM Model Can Answer Amazon SageMaker Related QuestionsImage from UnsplashI’ve written quite a few blogs on Medium around different…

Deploying Large Language Models With HuggingFace TGI | by Ram Vegiraju | Jul, 2023

By Ram Vegiraju July 14, 2023AINo Comments

[ad_1] Another way to efficiently host and scale your LLMs with Amazon SageMakerImage from UnsplashLarge Language Models (LLMs) continue to soar in popularity as a new one is released nearly…

Deploying Multiple Models with SageMaker Pipelines | by Ram Vegiraju | Mar, 2023

By Ram Vegiraju March 23, 2023AINo Comments

[ad_1] Applying MLOps best practices to advanced serving optionsImage from Unsplash by GrowtikaMLOps is an essential practice to productionizing your Machine Learning workflows. With MLOps you can establish workflows that…

Deploying SageMaker Endpoints With Terraform | by Ram Vegiraju | Mar, 2023

By Ram Vegiraju March 14, 2023AINo Comments

[ad_1] Infrastructure as Code With TerraformImage from Unsplash by Krishna PandeyInfrastructure as Code (IaC) is an essential concept to optimize and take your resources and infrastructure to production. IaC is…