Warning: Undefined variable $content in /home/beyond/domains/devoxx.pl/public_html/wp-content/plugins/cfp-dev-shortcodes/shortcode/shortcode-cfp-talk-details.php on line 205

Krakow, Poland, 17 - 19 June 2026

Small Language Models are finally practical for everyday applications: they're cheaper to run, fast enough for real-time tasks, and easy to ship. This session shows how to build one end-to-end with open tooling: selecting a corpus of data, leveraging a well-known established tokenizer, defining a model architecture, and training with knowledge distillation. We'll walk through the whole training process... how to fine-tune, compare outputs between the teacher and student, and use standard tricks like batching, learning rate tuning, and checkpointing to keep the model stable and efficient. You'll learn how to copy the teacher model's weights into your smaller student network, then compress it with dynamic int8 quantization to make it run faster on standard CPUs.

We'll also talk about what breaks in practice and how to avoid it: tokenizer mismatches, unstable learning rates, temperature choices, over-checkpointing, and memory pressure on single-GPU rigs. Expect concrete code samples, reproducible scripts, and tips for running the pipeline locally or in your homelab. You'll leave with a working recipe for training and deploying a compact language model that delivers strong accuracy while running faster and cheaper than its larger counterparts.
David vonThenen
NetApp
David is a Senior AI/ML Engineer within the Office of the CTO at NetApp, where he’s dedicated to empowering developers to build, scale, and deploy AI/ML solutions in production environments. He brings deep expertise in building and training models for applications, including NLP, vision, real-time analytics, and even classifying debilitating diseases. His mission is to help users build, train, and deploy AI models efficiently, making advanced machine learning accessible to users of all levels.

Before NetApp, he was heavily involved in the AI/ML community, specifically in conversational AI solutions and driving AI platform growth in a DevRel and pre-sales role. David frequently shares his insights at industry conferences and events, offering hands-on guidance for implementing AI/ML in cloud environments. David's prior experience includes contributing to the Kubernetes and CNCF ecosystems, working hands-on with VMware virtualization, implementing backup/recovery solutions, and developing hardware storage adapter firmware and drivers.

Ticket prices will go up in...

44
Days
:
 
14
Hours
:
 
20
Minutes
:
 
30
Seconds

You missed out!

Venue address

ICE Krakow, ul. Marii Konopnickiej 17

Phone

+48 691 793 877

Email

info@devoxx.pl

Social Media