Krakow, Poland, 11 - 13 June 2025

Sylwester Lewandowski
DATEV eG

Sylwester is a Software Engineer with over 15 years of professional experience, primarily in the telecommunications and financial domains. Throughout his career, he has coordinated several projects where improving performance and reducing resource consumption were key success factors. Currently, he works as a Tech Lead at DATEV eG, where he is involved in DevOps and MLOps projects.

View
Monitoring resource consumption of on-premises deployed LLMs
Conference (INTERMEDIATE level)
Room 4A

Large Language Models (LLMs) have gained significant popularity, with industry giants offering them as Software-as-a-Service (SaaS) solutions. However, deploying AI models on your private cloud, data center, or personal hardware can be a smart choice when considering security, data privacy, and cost-effectiveness.

In this session, we will explore the details of the resources consumed by LLMs. We will begin with a concise introduction to MLOps, providing a foundation for efficient model operations and management. Following this, we will delve into the technical aspects using command line tools to reveal what occurs behind the scenes when prompting LLMs. Finally, we will establish a straightforward observability pipeline to collect GPU usage statistics, ensuring transparency.

Join to gain valuable insights and practical skills for harnessing the power of LLMs in a secure and cost-effective manner.

More

Searching for speaker images...

Ticket prices will go up in...

25
Days
:
 
14
Hours
:
 
10
Minutes
:
 
42
Seconds

You missed out!

Venue address

ICE Krakow, ul. Marii Konopnickiej 17

Phone

+48 691 793 877

Email

info@devoxx.pl

Social Media