The 10^th international conference on Machine Learning and Artificial Intelligence applications,
taking place in person in Prague and online.

Machine Learning Prague 2025

April 28 – 30, 2025

Registration

World class expertise and practical content packed in 3 days!

You can look forward to an excellent lineup of 45 international experts in ML and AI business and academic applications at ML Prague 2025. They will present advanced practical talks, hands-on workshops, and other forms of interactive content to you.

What to expect

1000+ Attendees
3 Days
45 Speakers
10 Workshops

Phenomenal Confirmed speakers

Hava Siegelmann

Provost Professor, University of Massachusetts Amherst

Senior faculty for bio-inspired AI, Dr. Siegelmann is an internationally known UMass Provost Professor in Computer Science and a recognized expert in neural networks. She is a core member of the University of Massachusetts Neuroscience and Behavior Program and director of the Biologically Inspired Neural and Dynamical Systems (BINDS) Laboratory. She has been particularly acclaimed for her groundbreaking work in computation beyond the Turing limit, and for achieving advanced learning capabilities through a new type of Artificial Intelligence: Lifelong Learning. Siegelmann conducts highly interdisciplinary research in next-generation machine learning, neural networks, intelligent machine-human collaboration, and computational studies of the brain - with application to AI, data science, and high-tech industry. Prof. Siegelmann is a co-inventor of the Support Vector Clustering (SVC) algorithm, which is widely used across industry and government. Among her recent Nature publications is Biological Underpinning of Lifelong Learning AI, a bio-inspired replay algorithm for advanced lifelong learning, dual fractal structure & function of the human brain, and identification of a previously unknown brain connectome mechanism, which enables cognitive abstraction.

Stanislav Fort

Senior Research Scientist, Google DeepMind

Dr Stanislav Fort is a prominent artificial intelligence researcher specializing in Large Language Models (LLMs), interpretability, and AI safety. His career includes the position of a language model lead at Stability AI, a contribution towards the Anthropic's AI system Claude, and research roles at Google Brain and DeepMind. He is currently working on AI security on the Gemini team as a senior research scientist at Google DeepMind. He obtained his PhD in artificial intelligence at Stanford University in California, USA, and studied physics for his Bachelor's and Master's degrees at Cambridge University, eventually specializing in black holes. He is an author of over 30 academic publications with over 5000 citations.

Iryna Gurevych

Full professor, Technical University of Darmstadt

Iryna Gurevych is a Full Professor (W3) at the Computer Science Department of the Technical University Darmstadt, Germany and head of the UKP Lab. She has a strong background in information extraction, semantic text processing, machine learning and innovative applications of NLP to social sciences and humanities.

Since 2014, she is co-director of the Centre for the Digital Foundation of Research in the Humanities, Social, and Educational Sciences (CEDIFOR[3]), which is funded by the Federal Ministry of Education and Research.[4] The following year, she founded the research training group AIPHES[5] (Adaptive Information Preparation from Heterogeneous Sources) funded by the German Research Foundation. Since 2020, Gurevych is the director of CA-SG,[6] a research initiative "Content Analytics for the Social Good" of the Rhine-Main Universities and co-director of the Natural Language Processing (NLP) program of ELLIS, a European Network of Excellence in Machine Learning.

In 2020, Gurevych was awarded as a Fellow of the international scientific Association for Computational Linguistics (ACL) for her outstanding contributions in the field of Natural Language Processing and Machine Learning.[7] On January 1, 2021, Gurevych has taken over the office of Vice-president-elect and becomes president of the most important international organization in computational linguistics in 2023: the Association for Computational Linguistics (ACL).[8]

Gurevych receives the first LOEWE-professorship of the LOEWE programme, a Hessian research funding programme in Germany, in March 2021.

Gurevych's research interests include Natural Language Processing, Machine Learning, Multimodal Data Analysis, Digital Humanities, and Computational Social Science.

Johan Loeckx

Assistant Professor, Vrije Universiteit Brussel

Johan Loeckx first experimented with AI in 1992 when he was 12, and created his first startup at 16, delivering application software for lawyers. After his MSc and PhD engineering studies, he co-designed the Belgian encryption system for exchanging health information for which a patent was filed. Johan currently manages the applied R&D team at the Artificial Intelligence Lab at the Vrije Universiteit Brussels with a dedicated focus on trustworthy AI engineering and Cybersecurity.

Ariel Azia

Distinguished Data Scientist, Similarweb

20+ years of programming experience.
BSc in Biophysics from Bar Ilan University, Israel
Msc In Computational Biophysics from Weizmann Institute, Israel
PhD in Computational Proteomics from Bar Ilan University / Weizmann Institute, Israel
Data Scientist as SentinelOne and Similarweb
Team leader of several data science teams and Tech lead of data science in similarweb for the last 4 years.

Jon McLoone

Director of Technical Communication & Strategy, Wolfram Research

Jon McLoone is central to driving the company's technical business strategy and leading the consulting solutions team. With over 35 years of experience working with Wolfram Technologies, Jon has helped in directing software development, system design, technical marketing, corporate policy, business strategies and much more. Jon gives regular keynote appearances and media interviews on topics such as the Future of AI, Enterprise Computation Strategies and Education Reform, across multiple fields including healthcare, fintech and data science. He holds a degree in mathematics from the University of Durham.

Jon is also Co-founder and Director of Development for computerbasedmath.org, an organisation dedicated to fundamental reform of maths education and the introduction of computational thinking. The movement is now a worldwide force in re-engineering the STEM curriculum with early projects in Estonia, Sweden and Africa.

Ondřej Dušek

Assistant Professor, MFF Charles University

Ondřej Dušek is an Assistant Professor at Charles University in Prague, focusing on natural language generation and human-computer dialogue. His recent research focuses on generative language models, mostly applied to the data-to-text and dialogue response generation tasks. He is specifically interested in semantic accuracy and grounding in language generation, as well as ways of evaluating generation accuracy. After obtaining his PhD in Prague, Ondřej spent 2 years as a postdoc at Heriot-Watt University in Edinburgh in 2016-2018, where he also co-advised the university team in the Amazon Alexa Prize chatbot competition. He is currently the PI of an ERC Starting Grant titled Next-Generation Natural Language Generation, which aims to adapt neural models in order to produce fluent, accurate and explainable NLG systems.

Alexander Jesser

Full Professor, University Heilbronn

Prof. Dr. Alexander Jesser holds the diploma degree in Computer Engineering from the University of Paderborn, Germany and the Ph.D. in computer engineering from the Johann-Wolfgang Goethe University of Frankfurt a.M. Since 2013 he is a full professor for embedded systems and communications engineering at the University of Applied Sciences Heilbronn, Germany. Since 2021 he is the head of the Institute of Intelligent Cyber-Physical Systems ICPS at the University Heilbronn, Germany. He is conducting research in the field of Cyber-Physical Systems, Signal- Image-, and Voice Processing in industrial and medical technology applications. Besides this he is represented on several international scientific committees, such as RICOTED (South America International Network for Cross-Border Telehealth Cooperation in Emergency and Disaster Situations). This group is part of the CYTED Networks. He was also a visiting professor at the Paraguayan-German University (UPA) in Asuncion, the Shenzhen University of Technology (SZTU) and the Dulaty University in Kazakhstan. The ICPS maintains intensive contact with numerous international universities, particularly in Asia. In 2024, he was awarded the additional title of professor at the Paraguayan-German University (UPA) in Asuncion due to his extraordinary relationships.

Joseph Pareti

AI Consultant, Joseph Pareti's AI consulting

I am a long time industry expert having worked for SKF, Digital Equiment, Compaq and Hewlett Packard Enterprise in various roles including R&D Engineer, Application Engineer, CAE and HPC Consultant, and Pre-Sales solutions architect.

Ondřej Filip

Research Team Lead, Seznam.cz

Ondřej's current mission is training Czech LLMs at Seznam.cz. At the same company, he gained extensive experience prototyping machine learning models for various search verticals, including web, images, and maps. He holds a degree in Artificial Intelligence from Charles University in Prague.

Raid Arfua

Head of AI, GR8 Tech

I’m the Head of AI and Technology Consultant with extensive experience in ML Engineering.
My work bridges Machine Learning, Data Science, Product Management, and Strategic Communication. I prioritize aligning cutting-edge AI with business objectives, delivering scalable solutions in Recommendation Systems, Generative AI, Computer Vision, and Advanced Data Analysis.
I emphasize continuous learning, open collaboration, and a balance of strategic oversight with hands-on involvement, ensuring practical solutions and tangible results.

Tomas Pevny

Researcher and Assistant Professor, Czech Technical University

Tomas Pevny is an associate professor at FEE, CTU in Prague. While his main interest was machine learning, an offer to study in USA at Binghamton University introduced him to steganography, in which he pursued his PhD. After one year of post-doc in Grenoble, France, he returned in fall 2009 to CTU and remained there since then, with part-time excursions to Cognitive security, Cisco systems and Avast (now Gen digital). His current research interest is in tailoring machine learning for computer security. He has co-authored more than 50 papers and holds approximately 20 patents.

Vladimir Macko

Research Engineer, GrizzlyTech, former Google AI

With over a decade of experience in machine learning, Vladimir began his journey in the field working with startups. He then joined Google AI as a Machine Learning Researcher, working on large scale ML for optimization problems and autoML.

Over the past six years, Vladimir has collaborated with a wide range of organizations to bring their machine learning visions to life. Among others, he contributed to privacy-preserving authentication systems for a biometric company, time series classification for clinical studies, resume grading system for career portal and achieving top 50 NIST with facial recognition client.

Currently, Vladimir is pursuing a late PhD, driven by his passion for advancing machine learning research. His doctoral work focuses on pruning of neural networks, making them smaller and faster. Contrary to common academic practice, he is also applying these findings directly in the industry.

Alessandro Crimi

Professor of Machine Learning, AGH University of Krakow

Dr. Alessandro Crimi after completing his studies in engineering at the university of Palermo, obtained a PhD in machine learning applied for medical imaging by the University of Copenhagen, and an MBA in healthcare management by the University of Basel.
Alessandro worked as post-doctoral researcher at the French Institute for Research in Computer Science (INRIA), Technical School of Switzerland (ETH-Zurich), Italian Institute for Technology (IIT), and University Hospital of Zurich. He is currently a professor at AGH Krakow.

Ondřej Finke

Senior Data Scientist, O2/Dataclair

Ondřej Finke is a Senior Data Scientist at Dataclair, where he has been working since 2023. His primary focus is on the practical application of Large Language Models in various projects. Ondřej holds a master’s degree in laser physics from Czech Technical University. Before joining Dataclair, he gained experience in experimental physics and data processing on large scale experiments, which has influenced his approach to problem-solving in AI and machine learning.

Martin Dlask

Principal ML Engineer, King

Martin is a Principal ML Engineer at King, Sweden, specializing in recommender systems, interpretable learning, contextual multi-armed bandits, and off-policy learning. He is currently involved in projects aimed at enhancing recommendation relevance in Candy Crush Saga and improving the experience for millions of players worldwide. Martin has experience in software engineering, data science, and AI/ML across various industries, including pharmaceuticals, financial services, and, most recently, gaming. He holds an M.Sc. in Software Engineering and a Ph.D. in Computational Statistics, both from the Czech Technical University in Prague.

Martin Neznal

Senior Data Scientist, Productboard

A senior data scientist at Productboard, Martin focuses on applying natural language processing (NLP) techniques to help companies process, analyze, and make sense of customer feedback. He is passionate about taking business problems and developing and deploying models that solve the underlying customer needs. In addition to NLP, Martin’s experience spans network security and customer churn, and he has a Master’s degree in Applied Mathematics from FNSPE CTU.

Jakub Sochor

Chief Technology Officer, Innovatrics

Jakub Sochor is the Chief Technology Officer at Innovatrics, a world-class provider of biometric technology and solutions, consistently ranked in top positions by NIST. Before joining Innovatrics, Jakub worked at Google, contributing to the development of Google Assistant. He holds a PhD from Brno University of Technology, specializing in computer science and artificial intelligence.

Jan Čurn

CEO, Apify

Jan is the founder and CEO of Apify. He has a lifelong passion for software engineering, which earned him an MSc and PhD in computer science and eventually led him to founding Apify, a full-stack web scraping platform for developers. Jan is active in the Prague tech community, talks about software, startups, or AI, and regularly hosts related events in their rooftop office.

Ivan Cimrák

Lead Researcher, University of Zilina

Ivan Cimrák is a researcher and university lecturer at the University of Žilina in Slovakia, specializing in applied mathematics and informatics with a focus on modeling the separation of circulating cancer cells and the application of artificial intelligence in biomedicine.

He has held postdoctoral positions at Ghent University in Belgium and St. Pölten University of Applied Sciences in Austria.

Dr. Cimrák has been the recipient of an individual Marie Curie EU grant and has successfully secured several national grants, underscoring his contributions to the scientific community.

In 2018, he co-authored the book "Computational Blood Cell Mechanics: Road Towards Models and Biomedical Applications," which presents a comprehensive study on modeling blood cells and their behavior under flow conditions.

His research includes work on classifying red blood cells using time-distributed convolutional neural networks from simulated videos, as well as developing curated datasets for red blood cell tracking in microfluidic devices.

Dr. Cimrák leads a research lab at the University of Žilina, where he mentors a team of researchers and students in advancing computational methods in biomedicine.

He has been recognized as an exceptional figure in Slovak science, being nominated to be between four finalists for the ESET Science Award in the category of Outstanding Personality of University Education.

The award Science and Technology Award 2024 in the category Personality of Science and Technology is a prestigious recognition of Dr. Ivan Cimrák's significant contributions to the field of applied mathematics, biomedicine, and artificial intelligence. This honor awarded by Ministry of Education, Research, Development and Youth of the Slovak Republic highlights his dedication to advancing scientific understanding and innovation, solidifying his status as a leading figure in Slovak science and technology.

Dr. Cimrák actively participates in conferences and discussions related to technology in oncology, contributing to the advancement of biomedical applications of artificial intelligence.

His work continues to bridge the gap between computational modeling and practical biomedical applications, enhancing the understanding and treatment of complex medical conditions.

Philipp Wendland

AI Consultant, Deloitte Consulting

Philipp Wendland is a Senior Consultant in the Deloitte AI Institute, Germany. He is an expert in AI, with a focus on particularly Generative AI. His approach to AI is multifaceted ranging from co-authoring thought-leadership pieces, conducting workshops and C-level briefings to implementations of (Generative) AI applications. Making AI approachable and the inherit complexities understandable is one of his key undertakings. In this effort he has trained small groups of executives on AI as well as over 1,500 employees on e.g., Prompt Engineering techniques. Philipp holds a bachelor’s degree in physics from the university of Heidelberg and a masters in “Robotics, Cognition and Intelligence” from the Technical University of Munich.

Jakub Tomasz Gnyp

Computational Scientist, International Centre for Theory of Quantum Technologies

Jakub Tomasz Gnyp is a lab technician with tenure at the Condensed Matter Spectroscopy Division of the University of Gdańsk. Since 2022, he has also conducted research on quantum key distribution optimization at the International Centre for Theory of Quantum Technologies (ICTQT), as part of the Quantum Cybersecurity and Communications Group. He has contributed to projects such as "Theoretical Study of Time-Bin Entanglement Properties for Quantum Internet" and "Development of a Quantum Repeater in Optical Fiber Networks for Quantum Internet," in collaboration with the Electronics and Telecommunications Research Institute (ETRI) in South Korea.

His current research focuses on the applications of computational intelligence, machine learning, and stochastic analysis in physics. He is particularly interested in QKD error correction algorithms, near-infrared spectroscopy, and modeling weather at sea.

Jakub is a mathematician and physicist with data science experience. He worked on offshore wind farm construction projects in Antwerp, Belgium, with Ultra NDT, and has completed various assignments for companies in Poland, including Jubitom, QCG, StatXplorer, 7willows, and Talkersi. Currently, as an employee of the University of Gdańsk, he is continuing his education there.

Agata Gurzynska

Senior Analyst, PricewaterhouseCoopers

Agata is a senior analyst in the Financial Crime Unit at PricewaterhouseCoopers in Poland. She holds a master's in experimental physics from the University of Gdańsk, Poland. Additionally, she pursued further postgraduate education in data science. While she studied, she also worked at the University as a lab technician at the Experimental Physics Institute. Agata has experience with LLMs and physical systems simulations, including computational science tools, from freelance work as a data scientist. Outside of work, she enjoys hiking and playing the violin.

Tobias Kietreiber

Junior Researcher, St. Pölten University of Applied Sciences

Tobias Kietreiber is a Junior Researcher at the University of Applied Sciences St. Pölten, specializing in reinforcement learning, imitation learning, and explainable AI. With a strong foundation in mathematics, Tobias explores practical applications of these techniques, including hate speech detection and user group imitation on websites. His work involves developing intelligent systems that not only learn complex tasks through demonstration but also identify harmful online content and mimic user behaviors to enhance website interaction. Tobias also focuses on explainable AI, ensuring transparency and interpretability in machine learning models, contributing to ethical and effective AI development.

Sebastian Eresheim

Junior Researcher, St. Pölten University of Applied Sciences

Sebastian Eresheim is a Junior Researcher at University of Applied Sciences St. Pölten. He has a master's degree in Information Security from UAS St. Pölten, but also studied Technical Mathematics at Technical University Vienna. His research interests are in Reinforcement Learning and its real-world applications. His work involves finding optimal defense strategies in cyber systems to prevent and/or mitigate cyber threats.

Alexander Buchelt

Junior Researcher, St. Pölten University of Applied Sciences

Alexander Buchelt is a Junior Researcher at University of Applied Sciences St. Pölten and a PhD Student at the University for Natural Resources and Life Sciences Vienna, specializing in artificial intelligence and digital twins. His current research revolves around developing advanced AI algorithms for autonomous drones, aiming to optimize their application in forestry management to improve data collection and analysis in forest environments. His work is at the forefront of integrating AI-driven solutions into sustainable natural resource management.

Filip Roskovec

Head of Data Science, O2/Dataclair

As a senior leader of the data science team at AI centre at O2 Czech Republic, Filip specializes in internal projects, including next best action prediction and developing machine learning models for mobile network optimization. He also organizes events to promote cross-team collaboration and communication, often focusing on the latest advancements in large language models (LLMs). Filip holds a Ph.D. in Numerical and Computational Mathematics from Charles University in Prague.

Stefan Josef

Senior Data Scientist, O2/Dataclair

Stefan is a Senior Data Scientist at Dataclair, AI centre at O2 Czech Republic, where he has been leading research efforts in applying advanced NLP and Deep Learning techniques to geospatial data. Drawing on years of experience in working with language models, Stefan recently shifted his focus on improving production RAG systems through a combination of synthetic data generation and model fine-tuning. Stefan holds a M.Sc. in Economics from Stockholm University, where he specialized in empirical macroeconomics.

Ondřej Čermák

Data Scientist, O2/Dataclair

Ondřej Čermák is a Data Scientist at Dataclair, O2 Czech Republic, specializing in Retrieval-Augmented Generation (RAG) systems. He focuses on optimizing search pipelines, fine-tuning embedding models, and generating high-quality synthetic data. He is also a PhD candidate at the Czech Technical University in Prague, researching deep learning applications in quantum computing. Passionate about advancing AI, Ondřej combines research with practical implementations to push the boundaries of intelligent systems.

Kryštof Šaml

Researcher, Emplifi

Krystof Saml is a Researcher at Emplifi who develops practical AI solutions using data-centric approaches. He works on synthetic data generation systems and evaluation solutions for language models, focusing on improving data quality. Currently, he explores ways to combine LLMs with smaller specialized models to create more efficient AI systems.

Tomáš Sikora

Researcher, Emplifi

Tomas Sikora is a Researcher in Emplifi's Innovations department. He focuses on large language models (LLMs), agentic systems, and machine reasoning and their practical implementation through model training and tuning. He uses his expertise as part of the Search and Recommendation team, where he scales up solutions to address complex user requirements.

Jérémy Cochoy

CEO, Redstone Solution OÜ

Jérémy Cochoy is an expert in technology with a strong academic background. Holding a PhD in Computer Science and Mathematics with a focus on Persistent Homology, he leveraged his expertise to co-found Symphonia, an app that creatively transforms voices into music. Currently, as CEO of Redstone Solutions, Cochoy applies his skills in deep learning to the field of financial market forecasting. His career is a testament to the fusion of advanced scientific knowledge and practical technological applications, underscoring his commitment to driving innovation in complex fields. Beyond his professional realm, Cochoy's interests in music and other artistic pursuits reflect a multifaceted personality, equally engaged in intellectual and creative endeavors.

Szymon Bubak

CTO, Jiai

CTO at Jiai since 2024, focusing on Python, MLOps, and tech leadership. Founder leader of Silesia AI, fostering a strong AI community. I studied mathematics and spent nearly 8 years at Roche, where I led software development for drug discovery, improving processes across labs, enhancing developer efficiency, and integrating biotech tools critical for research on diseases like cancer and hemophilia. I also led rapid prototyping efforts at POC Now! in various industries.

Humera Noor Minhas

CTO, Digital Munich

Dr. Humera developed her love for data early on in her career and is still fascinated by its simplicity and power. As a PhD in Machine Learning, she has delivered several ML based projects successfully. Notably, she led the team that pioneered the use of machine learning for commercial-scale ad filtering. She is an entrepreneur and brings with herself twenty five years of professional experience with majors in computer vision and machine learning. Currently, she is the CTO of Digital Munich Tech GmbH, which is aimed to deliver Innovation as a Service and develop solutions for artificial intelligence, mixed reality, IOT, and data analytics. Dr. Humera loves to be in the nature and starts her day at sunrise by indulging in a book on a bench by the fields. Of course, with coffee! She can be reached at: https://www.linkedin.com/in/humeranoor/.

Artem Moroz

Researcher, CIIRC, Czech Technical University

Since joining the CIIRC RICAIP TESTBED in 2022, Artem has been applying his expertise in computer vision, machine learning, and deep learning to develop algorithms for a range of industrial applications. His main focus lies in designing solutions that enhance automation capabilities, particularly in object manipulation—where his algorithms enable robotic systems to handle and interact with objects precisely—and in quality inspection, improving the ability to detect defects and maintain high product standards.

Varun Burde

Research Assistant, CIIRC, Czech Technical University

Varun Burde is a Ph.D. student at the Czech Technical University in Prague, where he is advised by Dr. Torsten Sattler and Dr. Pavel Burget. In addition to his doctoral studies, he serves as a Research Assistant at the CIIRC, CTU, within the Research Group at the Testbed for Industry 4.0. Varun holds a master’s degree in Cybernetics and Robotics from the same institution. His research focuses on advancing 3D reconstruction techniques to enhance robotic manipulation, aiming to push the boundaries of industrial automation. By leveraging cutting-edge computer vision algorithms, Varun seeks to drive innovation in automated systems, contributing to greater precision and efficiency in Industry 4.0 applications.

Vit Zeman

Researcher, CIIRC, Czech Technical University

I’m a researcher at CTU CIIRC Testbed for Industry 4.0, where I also interned during my studies. My focus is on the usage of computer vision and machine learning in industrial applications. I have completed my Master of Science degree in Cybernetics and Robotics at the Faculty of Electrical Engineering, Czech Technical University in Prague, where I also finished my Bachelor’s in the same field.

Tun Shwe

Data & AI Consultant, Freelance

Tun is focused on helping companies imagine and implement their strategic data vision with high volume real-time data. He was previously a VP of Data and Data Engineer at high growth startups and has led cross-functional data teams in developing analytics platforms and data-intensive AI applications. In his spare time, Tun organises PyData Cornwall meetups, goes surfing, plays guitar and tends to his analogue cameras.

Ben Gamble

Field CTO, Ververica

A long-time builder of AI powered games, simulations and collaborative user experiences. Ben has previously built a global logistics company, large scale online games and augmented reality apps. Ben currently works to make fast data and AI a reality for everyone.

Tomáš Tomeček

Senior Principal Software Engineer, Red Hat

With over a decade of experience as a Software Engineer at Red Hat, Tomas is passionate for leveraging technology to simplify life. He's focused on automation, integration, and solving complex problems. In the past year, Tomas works on using AI to boost efficiency and productivity. After work, he's a hiker, gardener and snowboarder.

Cedric Clyburn

Senior Developer Advocate, Red Hat

Cedric Clyburn (@cedricclyburn), Senior Developer Advocate at Red Hat, is an enthusiastic software technologist with a background in Kubernetes, DevOps, and container tools. He has experience speaking and organizing conferences including DevNexus, WeAreDevelopers, The Linux Foundation, KCD NYC, and more. Cedric loves all things open-source, and works to make developer's lives easier! Based out of New York.

Karel Piwko

Senior Principal Software Engineer, RedHat

Karel Piwko is a Senior Principal Software Engineer with extensive experience in both management and technical roles within the software industry. After spending 10 successful years in management roles, Karel recently transitioned back to an individual contributor position, reflecting his passion for hands-on technical work and innovation.

In his current role, Karel focuses on developer productivity, championing improvements in DevOps practices and leveraging artificial intelligence to enhance software development processes. His expertise spans across Java, JavaScript, TypeScript, Python as well as agile methodologies, change transformation, coaching and mentoring.

Practical & Inspiring Program

Friday
Workshops

O2 Universum, Českomoravská 2345/17a, 190 00, Praha (workshops won't be streamed)

Registration 08:00 – 08:45

	Room D2	Room D3	Room D4	Room D6	Room D7
09:00 – 12:30 coffee break 10:30 – 11:00	Utilizing Large Language Models for improved anti-tracking in web browsers Room D2 Humera Noor Minhas, Digital Munich Online tracking remains a significant privacy concern for internet users. Current solutions while effective have limitations in terms of coverage maintenance and precision. This workshop aims to leverage the power of LLMs to create a more robust adaptive and efficient anti-tracking system. We will explore the architecture of an LLM-based anti-tracking system developing the data pipeline and exploring how these models can be fine-tuned to analyze network requests page content and user interactions in real-time. The system's ability to understand the semantic context of web elements allows for more accurate identification of tracking attempts reducing false positives while improving detection rates of sophisticated trackers. A key focus will be on the practical challenges of implementing such a system within the constraints of a web browser environment. We'll discuss strategies for optimizing LLM inference to meet the real-time demands of browsing balancing accuracy with performance.	Beyond Real-World Limitations - Mastering Synthetic Data Generation for Enhanced ML Performance Room D3 Tomáš Sikora, Emplifi Kryštof Šaml, Emplifi As machine learning pushes into new frontiers the demand for diverse and representative datasets frequently outpaces the availability of real-world data. This workshop explores a spectrum of advanced techniques for synthetic data generation from traditional methods to cutting-edge AI agent-driven approaches. By embracing the principles of data-centric AI we'll start with traditional data generation techniques and progress to innovative AI agent-driven strategies. Throughout the workshop we'll demonstrate techniques that promise to overcome limitations in real-world datasets reshape the data landscape in ML and rigorously evaluate the quality of the generated data. Workshop Overview: - Setting the Stage: The Data Challenge in ML We'll begin with a brief historical context touching on pre-LLM approaches to synthetic data generation. This background will highlight the impact of recent advancements and set the stage for our deep dive into modern techniques. - Data-Centric AI: A Paradigm Shift We'll explore how the data-centric AI movement is reshaping our approach to machine learning. Participants will learn why focusing on data quality and targeted synthetic data generation can often yield better results than simply increasing dataset size or model complexity. -Evolution of Synthetic Data Generation with AI -LLM-Powered Data Creation: Leveraging large language models for synthetic data generation. -Multi-Agent Systems: Advancing to manually designed agent ecosystems for nuanced and accurate data production. -Automated Agent Workflows: Exploring the cutting edge with self-optimizing agent interactions for superior data quality. -Robust Evaluation in the Data-Centric Paradigm We'll emphasize rigorous evaluation techniques aligned with data-centric AI principles ensuring the effectiveness of synthetic data in real-world ML applications. This workshop is ideal for ML practitioners researchers and data scientists looking to overcome data quality and scarcity challenges. Participants should have a basic understanding of machine learning concepts and some experience with programming in Python. By the end of this workshop attendees will have a comprehensive understanding of how data-centric AI principles can be applied to synthetic data generation from LLM-based techniques to state-of-the-art agent-driven approaches. They'll be equipped with practical skills to implement these strategies potentially revolutionizing how they tackle data-related challenges in ML projects.	Introduction to Algorithmic Trading: Hands-On Strategy Implementation with Real-World Data Room D4 Szymon Bubak, Jiai Jérémy Cochoy, Redstone Solution OÜ Algorithmic trading has become a cornerstone of financial markets with automation and data-driven strategies driving the majority of transactions. This workshop provides a comprehensive hands-on introduction to the world of algorithmic trading aimed at students and professionals interested in financial markets who want to move beyond academic exercises and engage with real-world scenarios. Participants will develop a trading strategy using real-world data such as Bitcoin prices or stocks from the S&P 500 with the initial focus on achieving profitability under the assumption of no transaction fees. The 3-hour session will guide participants through the end-to-end process of designing implementing and backtesting a deep learning-based trading strategy. The workshop will be structured as follows: - Data Preparation: Participants will learn how to source and preprocess financial data preparing it for model input. - Feature Extraction: We will introduce simple features from the data to feed into a deep learning model. - Loss Function Design: The workshop will cover how to design a loss function tailored to trading strategies and objectives. - Model Training: Participants will implement and train a deep learning model using PyTorch. The workshop will focus on how to properly train models in low-data environments ensuring the model generalizes effectively by employing robust train/validation/test splits. Backtesting: We will backtest the model’s performance allowing participants to understand its strengths and limitations in various market conditions. Rather than dividing the session into theoretical and practical segments the entire workshop will seamlessly integrate theory and implementation. Participants will work on a live trading scenario continuously applying new knowledge as they progress through the workshop. By the end they will have created a complete training pipeline—from data preparation to model training and backtesting. A critical aspect of this workshop will be understanding the practical challenges of implementing algorithmic trading strategies in real-world markets. We will discuss the limitations of deep learning models when applied to financial data including how to mitigate overfitting in low-data environments. Additionally participants will explore the impact of transaction fees slippage and other market inefficiencies learning how these factors affect profitability. The workshop is designed for participants with a basic understanding of Python and machine learning but no prior experience with algorithmic trading is required. By the end of the session attendees will have gained practical skills in designing and implementing a trading strategy along with valuable insights into the intricacies of algorithmic trading in financial markets. Participants will be provided with all necessary code templates and data and they are expected to bring their own laptops to engage fully in the hands-on aspects of the workshop. This workshop offers a unique opportunity to bridge the gap between theoretical learning and real-world financial applications equipping participants with the tools and knowledge to pursue further exploration in algorithmic trading.	InstructLab: plug your knowledge into a model easily Room D6 Tomáš Tomeček, Red Hat Cedric Clyburn, Red Hat Karel Piwko, RedHat During this hands-on exercise you will learn what is InstructLab and how you can leverage it to easily extend Large Language Models with your data and run them on your infrastructure. The tool makes it easy to download run and chat with models locally on your laptop. InstructLab is a fully open-source project from Red Hat and the MIT-IBM Watson AI Lab that introduces Large-scale Alignment for chatBots (LAB). The paper behind it: https://arxiv.org/abs/2403.01081 The LAB method is driven by taxonomies which are largely created manually and with care. For a taxonomy you supply InstructLab then can generate synthetic data used to train a model. Everyone who has experience with LLMs can greatly benefit from this workshop. We will create our own knowledge documents use InstructLab to generate synthetic data out of them train a model from the data and chat with them.	Accelerating AI Through Human Knowledge: Teaching to Imitate Experts and Win on the Race Track Room D7 Alexander Buchelt, St. Pölten University of Applied Sciences Tobias Kietreiber, St. Pölten University of Applied Sciences In this 3-hour hands-on workshop participants will explore the exciting world of Imitation Learning a powerful technique in artificial intelligence that allows agents to mimic expert behavior and excel in complex environments. Building on the fundamentals of Reinforcement Learning this workshop introduces the theory behind Imitation Learning and demonstrates how it can be applied to solve real-world problems efficiently. By guiding AI through expert demonstration imitation learning accelerates training especially in environments where traditional reinforcement learning might be time-consuming or difficult. Imitation Learning is crucial for AI systems that need to learn from limited data or human expertise such as autonomous driving robotics and gaming. In contrast to trial-and-error methods in reinforcement learning imitation learning allows models to replicate the strategies of experienced individuals drastically reducing training time and improving performance. Attendees will gain a deep understanding of how this approach combines the best of both supervised and reinforcement learning creating smarter faster decision-making systems.
12:30 – 14:00	Lunch
14:00 – 17:30 coffee break 15:30 – 16:00	3D reconstruction from Images and their application Room D2 Varun Burde, CIIRC, Czech Technical University Artem Moroz, CIIRC, Czech Technical University Vit Zeman, CIIRC, Czech Technical University This workshop will delve into recent advances in 3D computer vision and provide participants with practical hands-on experience in generating 3D reconstructions from image data. Attendees will explore cutting-edge techniques including neural radiance fields (NeRFs) Gaussian splatting multi-view stereo (MVS) and structure from motion (SfM) for surface reconstruction. The session will cover the fundamentals of 3D reconstruction focusing on how modern algorithms transform 2D images into detailed and accurate 3D models. Additionally participants will learn the essential steps of dataset creation and optimization for training advanced 3D reconstruction methods. A key feature of the workshop will involve capturing a set of images of objects and demonstrating how to systematically collect and organize data to ensure high-quality 3D model generation. Attendees will gain experience in building datasets tailored to different 3D reconstruction techniques such as NeRF and Gaussian splatting and optimizing them for improved accuracy and visual fidelity. This workshop is ideal for researchers engineers and enthusiasts seeking to understand the latest in 3D vision technologies with applications ranging from augmented reality and robotics to digital content creation.	A practical guide to LLM-based AI agents Room D3 Philipp Wendland, Deloitte Consulting This hands-on workshop is designed to provide participants with an in-depth practical understanding of how to leverage Large Language Models (LLMs) to create intelligent AI agents. As the rise of generative AI continues to transform industries it’s becoming increasingly important for both AI professionals and business leaders to understand the capabilities and implementation strategies for LLM-based systems. The Deloitte AI Institute focusses on brining AI expertise to clients across all industries ranging from innovation over strategy to capability building and scaling. Phillips’s strong technical background in physics / computer science enables him to bridge the gap between business and technology. 1: Introduction to LLM-based agents - Overview of the concept and architecture of AI agents - Introduction to popular frameworks to expedite agent development 2: Hands-on implementation - Guide participants through building a simple LLM-based AI agent using a given framework - Allow for customisation to demonstrate the flexibility and effectiveness of AI agents 3: Industry-Application - Outlook on newest developments of AI agents across various industries - Outlook on the potential of generative Ai and AI agents in particular across industries By the end of the workshop participants will have a solid understanding of the concepts behind LLM-based AI agents and hands-on experience in building these systems themselves. Led by an experienced facilitator from Deloitte’s AI Institute this workshop promises to provide valuable skills and knowledge that participants can leverage in their careers. This workshop is ideal for AI practitioners developers and researchers eager to explore the latest advancements in generative AI and apply LLM-driven automation in their respective fields.	Synthetic Data Generation for Embedding Model Fine-Tuning Room D4 Stefan Josef, O2/Dataclair Filip Roskovec, O2/Dataclair Ondřej Čermák, O2/Dataclair Retrieving information from documents in non-English and domain-specific languages presents a challenge for many organizations. While general embedding models are powerful they often fall short when dealing with specialized terminology not encountered in their training data. This workshop offers a practical approach to addressing these issues: using a combination of real and synthetic data to build robust datasets for fine-tuning open embedding models. The workshop consists of two parts. First we provide an overview of embedding models fine-tuning techniques and methods for generating synthetic data tailored to these approaches. In the second part participants will engage in a hands-on session to generate synthetic data for fine-tuning their own models.	Parallel Genetic Algorithms in Python Room D6 Jakub Tomasz Gnyp, International Centre for Theory of Quantum Technologies Agata Gurzynska, PricewaterhouseCoopers In this workshop we delve into the construction and implementation of parallel genetic algorithms (PGAs) using Python. Genetic algorithms (GAs) and evolutionary algorithms in general are powerful tools for solving optimization problems and when parallelized they offer significant speedups and efficiency improvements. Participants who learn PGAs will also be able to apply them in reinforcement learning. The workshop will have a limited amount of mathematics - instead the focus will be on both the idea behind PGAs and practical coding skills. Starting with the PyGAD library its uses and limitations will be discussed and presented with easy-to-understand examples. Later key aspects of parallel programming will be introduced such as recognizing CPU- and I/O-bound operations and the use of processes and threads respectively. Global lock in Python will be addressed as well as racing conditions. Therefore a basic understanding and implementation of locks barriers flags and shared memory in general will be achieved. To illustrate the practical applications of parallel genetic algorithms apart from minor examples the workshop features three major case studies. The first involves solving a labyrinth demonstrating how a parallel genetic algorithm can efficiently navigate complex search spaces and de facto interact with an environment. Participants will observe how the parallelization of GAs can lead to faster convergence on optimal paths compared to sequential approaches. Diversity in population will be addressed as well. The second case study explores the application of parallel genetic algorithms in quantum cryptography. In this domain GAs can optimize parameters for quantum key distribution protocols enhancing especially efficiency. By parallelizing the algorithm we can tackle the computational challenges of the vast solution spaces inherent in quantum cryptographic systems. The BB84 protocol will be the protocol in question explained without the quantum mechanics' mathematical rigor and the essentials of the protocol will already be implemented. The third and last case study will be a neural network in which hyperparameters will be optimized by a PGA in a Genetically Reinforced Learning scheme. Knowing how the PGA may interact with an environment and work on even very complicated functions this optimization task will be an easy step for those who have already seen the neural network. By the end of the workshop participants will have a solid understanding of how to implement and apply parallel genetic algorithms in Python with practical insights into their strengths and limitations. They will be equipped with the knowledge to extend these techniques to other domains fostering innovation in computational problem-solving.	Real-Time Anomaly Detection and Alerting in Financial Markets Using Stream Processing Room D7 Ben Gamble, Ververica Tun Shwe, Freelance In the world of financial markets the ability to detect and act on anomalies in real-time is crucial. This workshop will explore how to build a stream processing system that not only detects rapid changes in stock prices but also calculates key stock market indicators like the Relative Strength Index (RSI) Moving Average Convergence Divergence (MACD) or Bollinger Bands in real-time. Attendees will learn how to calculate these indicators in real-time to identify potential buy or sell signals and trigger instant alerts such as Slack messages to notify users of significant market movements or even directly call API to buy/sell instruments. At the end we will discuss and later build a stream processing pipeline in the IDE using the ML model. Attendees will learn about stream processing and how to use it to implement a real-time system for calculating key stock market indicators like RSI MACD and Bollinger Bands and how to use these indicators to detect anomalies and act on them. On top of that they will learn how to use ML models in their pipelines to move decision-making to the next level.

Room D2

Room D3

Room D4

Room D6

Room D7

09:00 – 12:30 coffee break
10:30 – 11:00

Utilizing Large Language Models for improved anti-tracking in web browsers

Room D2

Humera Noor Minhas, Digital Munich

Online tracking remains a significant privacy concern for internet users. Current solutions while effective have limitations in terms of coverage maintenance and precision. This workshop aims to leverage the power of LLMs to create a more robust adaptive and efficient anti-tracking system. We will explore the architecture of an LLM-based anti-tracking system developing the data pipeline and exploring how these models can be fine-tuned to analyze network requests page content and user interactions in real-time. The system's ability to understand the semantic context of web elements allows for more accurate identification of tracking attempts reducing false positives while improving detection rates of sophisticated trackers. A key focus will be on the practical challenges of implementing such a system within the constraints of a web browser environment. We'll discuss strategies for optimizing LLM inference to meet the real-time demands of browsing balancing accuracy with performance.

Beyond Real-World Limitations - Mastering Synthetic Data Generation for Enhanced ML Performance

Room D3

Tomáš Sikora, Emplifi
Kryštof Šaml, Emplifi

As machine learning pushes into new frontiers the demand for diverse and representative datasets frequently outpaces the availability of real-world data. This workshop explores a spectrum of advanced techniques for synthetic data generation from traditional methods to cutting-edge AI agent-driven approaches. By embracing the principles of data-centric AI we'll start with traditional data generation techniques and progress to innovative AI agent-driven strategies. Throughout the workshop we'll demonstrate techniques that promise to overcome limitations in real-world datasets reshape the data landscape in ML and rigorously evaluate the quality of the generated data. Workshop Overview: - Setting the Stage: The Data Challenge in ML We'll begin with a brief historical context touching on pre-LLM approaches to synthetic data generation. This background will highlight the impact of recent advancements and set the stage for our deep dive into modern techniques. - Data-Centric AI: A Paradigm Shift We'll explore how the data-centric AI movement is reshaping our approach to machine learning. Participants will learn why focusing on data quality and targeted synthetic data generation can often yield better results than simply increasing dataset size or model complexity. -Evolution of Synthetic Data Generation with AI -LLM-Powered Data Creation: Leveraging large language models for synthetic data generation. -Multi-Agent Systems: Advancing to manually designed agent ecosystems for nuanced and accurate data production. -Automated Agent Workflows: Exploring the cutting edge with self-optimizing agent interactions for superior data quality. -Robust Evaluation in the Data-Centric Paradigm We'll emphasize rigorous evaluation techniques aligned with data-centric AI principles ensuring the effectiveness of synthetic data in real-world ML applications. This workshop is ideal for ML practitioners researchers and data scientists looking to overcome data quality and scarcity challenges. Participants should have a basic understanding of machine learning concepts and some experience with programming in Python. By the end of this workshop attendees will have a comprehensive understanding of how data-centric AI principles can be applied to synthetic data generation from LLM-based techniques to state-of-the-art agent-driven approaches. They'll be equipped with practical skills to implement these strategies potentially revolutionizing how they tackle data-related challenges in ML projects.

Introduction to Algorithmic Trading: Hands-On Strategy Implementation with Real-World Data

Room D4

Szymon Bubak, Jiai
Jérémy Cochoy, Redstone Solution OÜ

Algorithmic trading has become a cornerstone of financial markets with automation and data-driven strategies driving the majority of transactions. This workshop provides a comprehensive hands-on introduction to the world of algorithmic trading aimed at students and professionals interested in financial markets who want to move beyond academic exercises and engage with real-world scenarios. Participants will develop a trading strategy using real-world data such as Bitcoin prices or stocks from the S&P 500 with the initial focus on achieving profitability under the assumption of no transaction fees. The 3-hour session will guide participants through the end-to-end process of designing implementing and backtesting a deep learning-based trading strategy. The workshop will be structured as follows: - Data Preparation: Participants will learn how to source and preprocess financial data preparing it for model input. - Feature Extraction: We will introduce simple features from the data to feed into a deep learning model. - Loss Function Design: The workshop will cover how to design a loss function tailored to trading strategies and objectives. - Model Training: Participants will implement and train a deep learning model using PyTorch. The workshop will focus on how to properly train models in low-data environments ensuring the model generalizes effectively by employing robust train/validation/test splits. Backtesting: We will backtest the model’s performance allowing participants to understand its strengths and limitations in various market conditions. Rather than dividing the session into theoretical and practical segments the entire workshop will seamlessly integrate theory and implementation. Participants will work on a live trading scenario continuously applying new knowledge as they progress through the workshop. By the end they will have created a complete training pipeline—from data preparation to model training and backtesting. A critical aspect of this workshop will be understanding the practical challenges of implementing algorithmic trading strategies in real-world markets. We will discuss the limitations of deep learning models when applied to financial data including how to mitigate overfitting in low-data environments. Additionally participants will explore the impact of transaction fees slippage and other market inefficiencies learning how these factors affect profitability. The workshop is designed for participants with a basic understanding of Python and machine learning but no prior experience with algorithmic trading is required. By the end of the session attendees will have gained practical skills in designing and implementing a trading strategy along with valuable insights into the intricacies of algorithmic trading in financial markets. Participants will be provided with all necessary code templates and data and they are expected to bring their own laptops to engage fully in the hands-on aspects of the workshop. This workshop offers a unique opportunity to bridge the gap between theoretical learning and real-world financial applications equipping participants with the tools and knowledge to pursue further exploration in algorithmic trading.

InstructLab: plug your knowledge into a model easily

Room D6

Tomáš Tomeček, Red Hat
Cedric Clyburn, Red Hat
Karel Piwko, RedHat

During this hands-on exercise you will learn what is InstructLab and how you can leverage it to easily extend Large Language Models with your data and run them on your infrastructure. The tool makes it easy to download run and chat with models locally on your laptop. InstructLab is a fully open-source project from Red Hat and the MIT-IBM Watson AI Lab that introduces Large-scale Alignment for chatBots (LAB). The paper behind it: https://arxiv.org/abs/2403.01081 The LAB method is driven by taxonomies which are largely created manually and with care. For a taxonomy you supply InstructLab then can generate synthetic data used to train a model. Everyone who has experience with LLMs can greatly benefit from this workshop. We will create our own knowledge documents use InstructLab to generate synthetic data out of them train a model from the data and chat with them.

Accelerating AI Through Human Knowledge: Teaching to Imitate Experts and Win on the Race Track

Room D7

Alexander Buchelt, St. Pölten University of Applied Sciences
Tobias Kietreiber, St. Pölten University of Applied Sciences

In this 3-hour hands-on workshop participants will explore the exciting world of Imitation Learning a powerful technique in artificial intelligence that allows agents to mimic expert behavior and excel in complex environments. Building on the fundamentals of Reinforcement Learning this workshop introduces the theory behind Imitation Learning and demonstrates how it can be applied to solve real-world problems efficiently. By guiding AI through expert demonstration imitation learning accelerates training especially in environments where traditional reinforcement learning might be time-consuming or difficult. Imitation Learning is crucial for AI systems that need to learn from limited data or human expertise such as autonomous driving robotics and gaming. In contrast to trial-and-error methods in reinforcement learning imitation learning allows models to replicate the strategies of experienced individuals drastically reducing training time and improving performance. Attendees will gain a deep understanding of how this approach combines the best of both supervised and reinforcement learning creating smarter faster decision-making systems.

12:30 – 14:00

Lunch

14:00 – 17:30 coffee break
15:30 – 16:00

3D reconstruction from Images and their application

Room D2

Varun Burde, CIIRC, Czech Technical University
Artem Moroz, CIIRC, Czech Technical University
Vit Zeman, CIIRC, Czech Technical University

This workshop will delve into recent advances in 3D computer vision and provide participants with practical hands-on experience in generating 3D reconstructions from image data. Attendees will explore cutting-edge techniques including neural radiance fields (NeRFs) Gaussian splatting multi-view stereo (MVS) and structure from motion (SfM) for surface reconstruction. The session will cover the fundamentals of 3D reconstruction focusing on how modern algorithms transform 2D images into detailed and accurate 3D models. Additionally participants will learn the essential steps of dataset creation and optimization for training advanced 3D reconstruction methods. A key feature of the workshop will involve capturing a set of images of objects and demonstrating how to systematically collect and organize data to ensure high-quality 3D model generation. Attendees will gain experience in building datasets tailored to different 3D reconstruction techniques such as NeRF and Gaussian splatting and optimizing them for improved accuracy and visual fidelity. This workshop is ideal for researchers engineers and enthusiasts seeking to understand the latest in 3D vision technologies with applications ranging from augmented reality and robotics to digital content creation.

A practical guide to LLM-based AI agents

Room D3

Philipp Wendland, Deloitte Consulting

This hands-on workshop is designed to provide participants with an in-depth practical understanding of how to leverage Large Language Models (LLMs) to create intelligent AI agents. As the rise of generative AI continues to transform industries it’s becoming increasingly important for both AI professionals and business leaders to understand the capabilities and implementation strategies for LLM-based systems. The Deloitte AI Institute focusses on brining AI expertise to clients across all industries ranging from innovation over strategy to capability building and scaling. Phillips’s strong technical background in physics / computer science enables him to bridge the gap between business and technology. 1: Introduction to LLM-based agents - Overview of the concept and architecture of AI agents - Introduction to popular frameworks to expedite agent development 2: Hands-on implementation - Guide participants through building a simple LLM-based AI agent using a given framework - Allow for customisation to demonstrate the flexibility and effectiveness of AI agents 3: Industry-Application - Outlook on newest developments of AI agents across various industries - Outlook on the potential of generative Ai and AI agents in particular across industries By the end of the workshop participants will have a solid understanding of the concepts behind LLM-based AI agents and hands-on experience in building these systems themselves. Led by an experienced facilitator from Deloitte’s AI Institute this workshop promises to provide valuable skills and knowledge that participants can leverage in their careers. This workshop is ideal for AI practitioners developers and researchers eager to explore the latest advancements in generative AI and apply LLM-driven automation in their respective fields.

Synthetic Data Generation for Embedding Model Fine-Tuning

Room D4

Stefan Josef, O2/Dataclair
Filip Roskovec, O2/Dataclair
Ondřej Čermák, O2/Dataclair

Retrieving information from documents in non-English and domain-specific languages presents a challenge for many organizations. While general embedding models are powerful they often fall short when dealing with specialized terminology not encountered in their training data. This workshop offers a practical approach to addressing these issues: using a combination of real and synthetic data to build robust datasets for fine-tuning open embedding models. The workshop consists of two parts. First we provide an overview of embedding models fine-tuning techniques and methods for generating synthetic data tailored to these approaches. In the second part participants will engage in a hands-on session to generate synthetic data for fine-tuning their own models.

Parallel Genetic Algorithms in Python

Room D6

Jakub Tomasz Gnyp, International Centre for Theory of Quantum Technologies
Agata Gurzynska, PricewaterhouseCoopers

In this workshop we delve into the construction and implementation of parallel genetic algorithms (PGAs) using Python. Genetic algorithms (GAs) and evolutionary algorithms in general are powerful tools for solving optimization problems and when parallelized they offer significant speedups and efficiency improvements. Participants who learn PGAs will also be able to apply them in reinforcement learning. The workshop will have a limited amount of mathematics - instead the focus will be on both the idea behind PGAs and practical coding skills. Starting with the PyGAD library its uses and limitations will be discussed and presented with easy-to-understand examples. Later key aspects of parallel programming will be introduced such as recognizing CPU- and I/O-bound operations and the use of processes and threads respectively. Global lock in Python will be addressed as well as racing conditions. Therefore a basic understanding and implementation of locks barriers flags and shared memory in general will be achieved. To illustrate the practical applications of parallel genetic algorithms apart from minor examples the workshop features three major case studies. The first involves solving a labyrinth demonstrating how a parallel genetic algorithm can efficiently navigate complex search spaces and de facto interact with an environment. Participants will observe how the parallelization of GAs can lead to faster convergence on optimal paths compared to sequential approaches. Diversity in population will be addressed as well. The second case study explores the application of parallel genetic algorithms in quantum cryptography. In this domain GAs can optimize parameters for quantum key distribution protocols enhancing especially efficiency. By parallelizing the algorithm we can tackle the computational challenges of the vast solution spaces inherent in quantum cryptographic systems. The BB84 protocol will be the protocol in question explained without the quantum mechanics' mathematical rigor and the essentials of the protocol will already be implemented. The third and last case study will be a neural network in which hyperparameters will be optimized by a PGA in a Genetically Reinforced Learning scheme. Knowing how the PGA may interact with an environment and work on even very complicated functions this optimization task will be an easy step for those who have already seen the neural network. By the end of the workshop participants will have a solid understanding of how to implement and apply parallel genetic algorithms in Python with practical insights into their strengths and limitations. They will be equipped with the knowledge to extend these techniques to other domains fostering innovation in computational problem-solving.

Real-Time Anomaly Detection and Alerting in Financial Markets Using Stream Processing

Room D7

Ben Gamble, Ververica
Tun Shwe, Freelance

In the world of financial markets the ability to detect and act on anomalies in real-time is crucial. This workshop will explore how to build a stream processing system that not only detects rapid changes in stock prices but also calculates key stock market indicators like the Relative Strength Index (RSI) Moving Average Convergence Divergence (MACD) or Bollinger Bands in real-time. Attendees will learn how to calculate these indicators in real-time to identify potential buy or sell signals and trigger instant alerts such as Slack messages to notify users of significant market movements or even directly call API to buy/sell instruments. At the end we will discuss and later build a stream processing pipeline in the IDE using the ML model. Attendees will learn about stream processing and how to use it to implement a real-time system for calculating key stock market indicators like RSI MACD and Bollinger Bands and how to use these indicators to detect anomalies and act on them. On top of that they will learn how to use ML models in their pipelines to move decision-making to the next level.

Saturday, March 21
Workshops

O2 Universum, Českomoravská 2345/17a, 190 00, Praha (and on-line)

Registration from 9:00

09:50 – 10:00

Welcome to ML Prague 2025

10:00 – 10:30

Towards Production-Ready Czech LLMs with Continuous Pretraining

Ondřej Filip, Seznam.cz

Seznam has been developing custom large language models (LLMs) with a focus on the Czech language and the wide range of Internet services we provide. At the dawn of open-weight models, these early systems lacked several critical capabilities, such as generating fluent and natural Czech text efficiently, understanding nuanced cultural contexts, and managing long-form content. This talk will explore how we addressed these limitations during that formative period and share insights from our journey toward production-ready Czech LLMs.

10:30 – 11:00

Data, your worst enemy?

Johan Loeckx, Vrije Universiteit Brussel

Ever more decisions are driven by advanced, nonlinear data analysis, where the validity, correctness, and fairness of the outcomes are often assumed but difficult to guarantee in practice. We increasingly rely on the output of algorithmic systems (broader than just LLMs) without fully understanding how they arrive at their results. Although much attention has been paid to the validity and fairness of individual predictions or models, the broader topic of AI engineering and its impact remains relatively unexplored.

AI system design is primarily performed by humans tasked with ensuring that operationalization aligns with business objectives. Currently, alignment is handled opaquely by Data Scientists and/or quantified through performance and fairness metrics.

However, many mistakes occur during design—such as violating causality, linearity, or independence constraints, or introducing bias through seemingly minor engineering choices—due to ignorance or the inability to manage complexity. These issues are typically undetectable by metrics and difficult for humans to identify because of the complex interactions between decisions.

In this talk, we will demonstrate how existing approaches fall short and explain how we believe a human-centric, knowledge-driven AutoML architecture and methodology can make the design process more scientific and systems more trustworthy.

Our goal is to combine the strengths of humans and machines, making the AutoML process explainable and leveraging domain knowledge in the synthesis of pipelines and features to ensure alignment. The architecture explores several novel ideas: first, the construction of pipelines and deep features is approached in a unified way. Next, synthesis is driven by a shared knowledge system, interactively queried to determine which pipeline operations to use or features to compute. Lastly, the synthesis process makes decisions at runtime using partial solutions and the results of their application on data. This approach enables interactive collaboration between humans and machines.

11:00 – 11:30

Lies, Damn Lies and Gen AI

Jon McLoone, Wolfram Research

While there has been much excitement about the potential of large language models (LLMs) to automate tasks that previously required human intelligence or creativity, many early projects have failed because of LLMs’ innate willingness to lie. The presentation explores the nature, cause and consequences of this “hallucination” issue and proposes a solution.

By combining generative AI with more traditional symbolic AI, reliability can be maintained, explainability improved and private knowledge and data injected. The talk will show simple examples of combining language-based thinking with computational thinking to generate solutions that neither could achieve on its own.

An example application of an AI scientific research assistant will be shown that brings together the ideas presented in a most demanding real-world task, where false information is not acceptable.

11:30 – 13:00

LUNCH & POSTER SESSION

Poster session:

- Generating 360-Degree Videos for Immersive Therapeutic, Educational, and Tourism Experiences: Techniques and Challenges (Lakshmi Babu Saheer)

- Vehicle Recommendations using Heterogeneous Data (Matej Škrabić)

- Promptbook: A Paradigm Shift in AI Native Development (Jirka Jahn)

- Prove What You Compute: Cryptographic Verification for Inference Pipelines (Michele Dallachiesa)

- Contrastive Forecasting: Latent-Space Time Series Prediction Using Contrastive Divergence (Jeremy Cochoy)

- Deep Learning Surrogates for Efficient Upscaling in 3D Fractured Media (Martin Špetlík)

- Generating Data Insights with LLMs by Querying Tables (Kristýna Onderková)

- How reliable are LLM-generated summaries? (Patrícia Schmidtová)

- Discovering Action Rules for Counterfactual Explanations in Python (Lukas Sykora)

- NV-Retriever - How to train state-of-the-art embedding models for information retrieval (Yauhen Babakhin)

13:00 – 13:30

An introduction to protein structure prediction

Joseph Pareti, Joseph Pareti's AI consulting

Protein structure prediction has become a cornerstone of modern drug discovery, offering crucial insights for drug-target interactions. This report examines three major computational approaches within an integrated drug development workflow. The primary focus is on AlphaFold 3's architecture, illustrated through simplified implementations of its key components - the Pairformer and Diffusion modules. These demonstration programs provide practical insight into how the system transforms amino acid sequences into accurate three-dimensional structures. RosettaFold is analyzed as a complementary approach, highlighting its comparable performance and distinct advantages in protein design applications. The report also evaluates lightweight protein language models as resource-efficient alternatives for organizations with limited computational infrastructure. Through comparative analysis of these approaches, different prediction strategies can be optimally deployed within the drug development pipeline based on specific requirements for accuracy, speed, and available computational resources.

13:30 – 14:00

Mammography solved by AI?

Ivan Cimrák, University of Zilina

Breast cancer screening is an indispensable tool in the early detection of one of the most prevalent cancers among women worldwide. However, one persistent challenge lies in accurately diagnosing clusters of microcalcifications (MCs), which are small calcium deposits within the breast tissue. These clusters are notoriously diverse in appearance, making it difficult to distinguish benign from malignant cases with precision. This diagnostic uncertainty often results in unnecessary biopsies, causing undue stress for patients and placing additional burdens on healthcare systems. To address this, we turned to the transformative potential of artificial intelligence (AI), specifically convolutional neural networks (CNNs), to refine the accuracy of breast cancer screening.

Our study investigates two distinct classification approaches using CNNs: a traditional binary method (classifying MCs as either benign or malignant) and a more advanced three-class method (introducing a third category for non-MC cases). Leveraging two robust datasets—the Curated Breast Imaging Subset of the Digital Database for Screening Mammography and the Optimam Database—we identified ResNet-101 as the most effective CNN architecture for this task. To deepen our understanding of the model’s decision-making process, we employed Grad-CAM visualizations, which highlight the regions of mammogram images that most influence the AI’s predictions.

The results revealed a stark contrast between the two approaches. The binary classification model achieved an accuracy of 74.7% and a Matthews correlation coefficient (MCC) of 0.458. While promising, the model displayed notable limitations in interpretability, often relying on surrounding breast tissue rather than focusing directly on the MCs. Additional challenges arose from benign abnormalities, imaging artifacts, breast implants, and excessive black backgrounds in mammogram patches, further complicating accurate predictions.

In contrast, the three-class model brought a leap in performance, achieving an impressive accuracy of 91.7% and an MCC of 0.767. By introducing a third classification category, this model demonstrated an enhanced ability to isolate and evaluate microcalcifications with greater precision. However, this advancement came with a new challenge: vascular calcifications, which were not adequately represented in the training datasets, were occasionally misclassified. These findings underscore the importance of robust, comprehensive datasets to ensure accurate and reliable AI models.

Our research highlights the immense potential of AI to revolutionize breast cancer screening by improving diagnostic accuracy, reducing unnecessary procedures, and enhancing the overall efficiency of healthcare delivery. The significant improvement achieved with the three-class classification approach offers a glimpse into a future where AI can act as a trusted partner for radiologists. However, our findings also emphasize the critical need for continuous refinement of these systems. Addressing the misclassification of vascular calcifications and expanding datasets to encompass a broader range of imaging scenarios will be pivotal in unlocking the full potential of this technology. In this era of rapid technological advancement, our study takes a vital step toward combining cutting-edge AI with human expertise to create a more precise and patient-centered approach to breast cancer detection.

14:00 – 14:30

End-to-end Stroke imaging analysis, using reservoir computing-based effective connectivity, and interpretable AI

Alessandro Crimi, AGH University of Krakow

We present an End-to-end AI framework for directed graphs including explainable AI.

This is a machine learning pipeline combining reservoir computing and directed graph analysis to model brain connectivity in stroke patients using MRI data. Effective connectivity is derived via reservoir computing, enabling the creation of directed graph representations. These graphs are classified using a directed graph convolutional networ. Explainable AI tools provide insights into disrupted brain networks, elucidating biomarkers for stroke classification and enhancing clinical interpretability. This approach highlights the potential of machine learning to improve patient stratification in stroke and other brain diseases. The technical innovations are related to reservoir computing networks, directed graph analysis, and explainable AI of effective brain connectivity.

14:30 – 15:00

COFFEE BREAK

15:00 – 15:30

Attentive interpretable models for scalable content recommendation in mobile games

Martin Dlask, King

Candy Crush Saga, a popular mobile game with millions of monthly players, leverages AI to ensure gameplay remains engaging, relevant, and fair. The talk consists of two parts. The first part focuses on introducing a novel approach for content recommendation using attentive networks, which was published in RecSys in 2024. We describe the architecture, experimental results, and a scale-adaptive algorithm for fair and relevant recommendations of offers and rewards for completing in-game quests. The second part outlines the implementation of a scalable prediction system for millions of players. We introduce a serving architecture for mobile games, including a typical client-server model. Alongside the talk, we present practical advice on ensuring a robust implementation of recommender systems: detecting degenerate feedback loops, preventing loss of relevance over time, overcoming cold-start problems, and removing bias.

15:30 – 16:00

Evolution of Recommendation System: from ANN to Ensemble of Scorers

Raid Arfua, GR8 Tech

This talk draws from my recent article, which chronicles the evolution of a sports event recommendation system from its early neural network based approach — implemented well before Deep Learning gained mainstream popularity — into a more effective ensemble of simpler scoring methods. These methods range from traditional statistical formulas to sophisticated similarity algorithms leveraging sparse matrix structures.

In this session, we will delve into the technical details of working with sparse matrices, as well as the development and application of the “Hyperbolic Score” evaluation metric, examining why it can be particularly effective under certain conditions. We will also discuss the importance of building a robust Data Platform as a foundation for rapid experimentation, ultimately enabling more efficient development and iteration of recommendation engines.

Beyond the established techniques, I will introduce several thought-provoking ideas for integrating large language models (LLMs) into recommender systems. Additionally, we will explore strategies for balancing model complexity with interpretability, ensuring that both accuracy and transparency remain central concerns in model design.

My hope is that these insights and practical considerations will provide the ML and Data Science community with valuable perspectives on building robust, scalable, and explainable Recommendation Systems.

16:00 – 16:30

Estimating online behavior of ad hoc cohorts using context-dependent weighing of panel participants

Ariel Azia, Similarweb

As much of human activity has moved to the internet in the last few decades, businesses must take into account behavior of individuals and groups online; trends of website or mobile application usage, volume of terms used in search engines, and the popularity pages of specific products and services can all contribute to an understanding of online activity patterns, allowing companies to make informed strategic decisions. Similarweb is one such company to provide these digital insights, termed digital business intelligence, for a myriad of customers and use cases. In the process of generating these metrics, Similarweb combines a small set of actual data with more extensive information from a selected panel of online users.

A particularly difficult task for Similarweb is the estimation of ad hoc cohorts of users, as clients require the freedom to query and compare their relevant online behavior metric for their specific use case. As it is almost impossible to generate a priori estimations of every possible cohort, it is substantially useful to measure the interaction between each specific user within the cohort and the cohort itself and assign it an individual weight. This weighing problem is similar in essence to problems in recommendation, whose solutions are well established, but is further complicated by sampling biases within the available actual data, biases within the panel users, the sparsity of interactions between users and cohorts, and the multitude of possible cohorts. We briefly present an earlier approach we have adopted i.e. assigning each panel user a singular weight and applying simple rescaling within a given cohort. This singular weight is useful under strong constraints, but yields poor results on ad hoc cohorts as it cannot account for the non linear nature of interaction between users and cohorts of which they are a part.

We introduce a new multistep approach to create the context dependent weighing. First, we obtain a representative embedding of both common types of cohorts (websites, search terms, product views etc.) and a respective embedding representing users using the same encoder. Second, we train a recommendation-like neural network that learns the non linear interactions between users and their cohorts. This new approach allows us to obtain both an overall sum representation as well as the inner weight distribution for an ad hoc cohort. We demonstrate the usefulness of this new approach on several examples of ad hoc cohorts. It is interesting to note that as an additional byproduct of this training process, we can extract a useful intermediate from the network that embeds both users and cohorts under the constraint of actual data and panel biases.

16:30 – 17:00

COFFEE BREAK

17:00 – 17:30

The Evolution of Virtual Buddy: From Concept to Deployment

Ondřej Finke, O2/Dataclair

Virtual Buddy, a Retrieval-Augmented Generation (RAG) system, has transformed customer care operations for the largest telecom provider in the Czech Republic. Built on O2's knowledge base, it is designed to support customer service representatives in addressing customer requests and needs. This talk presents the system's journey from its conceptualization in 2023 to its production deployment at the beginning of 2024, highlighting the technological challenges overcome to develop this state-of-the-art AI assistant. In addition to the technological aspects, significant emphasis will be placed on the business side of adoption. This includes defining precise business cases, steering development to meet specific objectives, and gradually transforming the company from merely adapting to the use of Virtual Buddy to actively reshaping other business areas to accommodate future needs for its development. Although the talk focuses on the current Virtual Buddy, we will also briefly explore future prospects and advancements for the system.

17:30 – 18:00

Intelligent Manufacturing Assistant Bot

Alexander Jesser, University Heilbronn

In a joint research project of the Institute for Intelligent Cyber-Physical Systems ICPS at Heilbronn University and the industrial partners, SABO Mobile IT GmbH and Grossebacher Systeme AG, research is being conducted into a middleware for voice assistants for global auditory human-technology interaction in the field of industrial plants.
Interactive voice bots are becoming increasingly common in consumer devices and have great potential to support operators and service personnel in operating complex machines. Due to their lack of user-friendliness, flexibility, security and integration into existing industrial solutions, conventional command controls have so far only met with limited acceptance in industrial environments. Chatbots as well as voicebots both involve so-called Natural Language Understanding (NLU) and Natural Language Processing (NLP). However, with voicebots, much more emphasis must be placed on conversation design in order to convey the conversation content in a compact yet understandable way. In addition, a robust method for recognizing voice commands is required. Thanks to their comparatively lower complexity chatbots are already widely used in a variety of applications. Chatbots are mainly used for processes that are easy to automate and always follow a similar pattern. Voicebots, on the other hand, are not or hardly ever used in industrial production environments. The ambient noise level and process reliability are major hurdles. Furthermore, natural communication is not possible with current voicebots. It can be summarized that voicebots are mainly used in acoustically quiet and less critical.
The aim of the joint research is to realize an assistant for machines that informs the operator, takes over his instructions, checks the exchanged information and thus ensures that a machine works according to the operator's wishes and within its technical capabilities. The deep integration of the intelligent manufacturing assistant bot (IMAB) into the machine software and the understanding of natural language communication is the key to controlling the machine, in contrast to a limited set of commands that humans, not machines, have to learn. The scientific innovation lies in an intelligent language assistant capable of natural language dialogs beyond the level of regular chat bots. A new intermediate layer between a language chat bot-oriented user interface and machine control enables hands-free language communication in combination with classic user interfaces.
The research path will be demonstrated in this talk. Aspects of the resource-related hardware architecture required in an industrial environment and the intelligent software architectures will be presented. In particular, intelligent noise cancellation in the speech signal under a wide range of industrial ambient noise is of crucial importance. Furthermore, the integrated speech-to-text (STT) and text-to-speech (TTS) processes are presented.
In addition, the implementation of an intelligent speaker recognition based on neural networks with third-party speaker detection and an intelligent user authentication based on voice input is demonstrated.
The presentation will cover the latest research results of the process steps within the project requirements. In particular, the scientifically innovative core issues of the research, which are essentially based on machine learning methods, will be demonstrated.

18:00 – 20:30

NETWORKING & DRINKS

Sunday, June 3rd
Conference day 1

O2 Universum, Českomoravská 2345/17a, 190 00, Praha (and on-line)

Doors open at 08:30

09:00 – 09:30

Adversarial attacks on the largest language and vision models

Stanislav Fort, Google DeepMind

Adversarial attacks pose a significant challenge to the robustness, reliability and alignment of deep neural networks from simple computer vision to hundred-billion-parameter language models. Despite their ubiquitous nature, our theoretical understanding of their character and ultimate causes, as well as our ability to successfully defend against them, are noticeably lacking. This talk examines the robustness of modern deep learning methods and the surprising scaling of attacks on them, and showcases several practical examples of transferable attacks on the largest closed-source vision-language models out there. I will conclude with a direct analogy between the problem of adversarial examples and the much larger task of general AI alignment.

09:30 – 10:00

Training AI Models for Crime Scene Fingerprint Recognition

Jakub Sochor, Innovatrics

This talk delves into critical aspects of latent fingerprint recognition in crime scene investigations, focusing on fingerprint analysis and system evaluation. We will discuss the challenge of training minutiae detectors without ground truth annotations and introduce innovative approaches using synthetically generated fingerprint data. By leveraging AI and synthetic data, we will show how these advancements can significantly enhance the accuracy and efficiency of fingerprint recognition in forensic science.

10:00 – 10:30

Understanding the neural networks through rule extraction

Tomas Pevny, Czech Technical University

Neural networks are ubiquitous yet they remain opaque for most of its users, who has very little understanding of how they store the knowledge and how the information propagates through. In this talk, I would like to share our findings from our quest to understand these phenomena. Specifically I will show the decision rules realized by neural networks and why it might be difficult to understand them without the knowledge of the data distribution. This will give us intuition why neural networks are robust yet why adversarial samples are so easy to create. Finally, we will use these tools to understand, how the decision rules compose during inference.

10:30 – 11:00

COFFEE BREAK

11:00 – 11:30

Evaluating LLM outputs with humans and LLMs

Ondřej Dušek, MFF Charles University

How well do LLMs perform on text generation tasks, and how can we tell? We present approaches based on annotating individual errors, using human evaluators as well as LLMs. For humans, we introduce our efficient annotation framework and schema. For LLM-based evaluation, we show a metric using an ensemble of open-source LLMs, which includes a reasoning for each annotated error, evaluated on various generation tasks and evaluation aspects (such as accuracy or fluency) and showing high correlation with human annotators. Both approaches allow us to use benchmarks with recent data unseen to LLMs during training, bypassing the data leakage problem that artificially inflates LLMs' performance on commonly used benchmarks.

11:30 – 12:00

Advances and Challenges in Topic Modeling of Text Documents

Martin Neznal, Productboard

In this talk, we will explore the field of topic modeling for text documents, focusing on its challenges and practical applications. I will highlight various methods for clustering text documents, enhancing clustering quality, validating results, and integrating solutions into users' daily workflows. Our presentation will share insights and lessons learned from building topic modeling inside a product that serves real customers, emphasizing the challenges of creating scalable systems that deliver real value.

I will emphasize the importance of preprocessing input documents to improve clustering quality. This includes extracting relevant elements from the text, such as entities, key phrases, or summaries. Subsequently, I will demonstrate and compare methods for clustering these text representations, such as hierarchical clustering or directly using LLMs to cluster a large set of documents of text.
In real-world scenarios, new text documents are created daily, and new clusters emerge over time. We will explore techniques to detect new clusters as they appear while maintaining the integrity of existing clusters.

Assessing the quality of discovered topics is a key challenge in topic modeling. I will provide an overview of validation techniques, ranging from traditional machine learning metrics to methods that use LLM as a judge. Furthermore, we will discuss the importance of human-in-the-loop validation processes in ensuring the relevance and accuracy of the topics.

Finally, I will share insights on improving the usability and user experience of topic modeling, including effective naming and description of clusters, and we will discuss whether users are willing to provide feedback on AI models and how this feedback can be used to refine and enhance the existing solutions.

12:00 – 12:30

Towards Real-World Fact-Checking with Large Language Models

Iryna Gurevych, Technical University of Darmstadt

Misinformation poses a growing threat to our society. It has a severe impact on public health by promoting fake cures or vaccine hesitancy, and it is used as a weapon during military conflicts to spread fear and distrust. Current research on natural language processing (NLP) for fact-checking focuses on identifying evidence and predicting the veracity of a claim. People's beliefs, however, often do not depend on the claim and rational reasoning but on credible content that makes the claim seem more reliable, such as scientific publications or visual content that was manipulated or stems from unrelated contexts. To combat misinformation, we need to show (1) "Why was the claim believed to be true?", (2) "Why is the claim false?", (3) "Why is the alternative explanation correct?". In this talk, I will zoom in on two critical aspects of such misinformation supported by credible though misleading content. Firstly, I will present our efforts to dismantle misleading narratives based on fallacious interpretations of scientific publications. Secondly, I will show how we can use multimodal large language models to (1) detect misinformation based on visual content and (2) provide strong alternative explanations for the visual content.

12:30 – 14:00

LUNCH & POSTER SESSION

Poster session:

- Shedding some light on black-box models with ML Classifier Copying (Muriel Rovira-Esteva)

- Learning Representations from Ultrasound Signals (Immanuel Roßteutscher)

- 1D-CNN for Automated Characterisation of Surface-Breaking Cracks Using Ultrasonic A-Scan Data (Thomas Beckingham)

- Leveraging Point Transformers for Detecting Anatomical Landmarks in Digital Dentistry (Kateřina Trávníčková)

- Agentic AI for Sustainability Modelling: a Graph-based Retrieval-augmented System for LCA Advisory (Loris Rodigari)

- Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning (Vasilije Markovic)

- Efficient and High-Fidelity Synthetic Data with TabularARGN: An Open-Source SDK with Imputation Capabilities (Ivona Krchova)

- Automatic Prompt Engineering with llm-as-a-customer loss function (Marcin Koralewski)

- From Data to Discovery: GeoAI Topographic Objects Change Detection (Maciej Adamiak)

- Monte Carlo Neutron Event Simulation for AI-Based Data Processing (Camille Gillespie )

14:00 – 14:30

Distributed Collaborative AI with Applications to Drones

Hava Siegelmann, University of Massachusetts Amherst

How come drones are still mainly human controlled and have such limited autonomy? First, drones operate under significant constraints, including limited computational power, energy capacity, and communication bandwidth. Reinforcement Learning fail to maintain optimal performance under such constraints. We propose sequence AI algorithms that significantly improving compute and energy efficiency. Among the key features are rapid onboard responses and adaptability in dynamic environmental changes, robustness to missing inputs, minimization of sensor usage and the ability to use cheaper sensors to greater effect, as well as making possible the use of cheaper hardware while maintaining peak effectiveness. Second issue is the need of communication and cooperation among drones. Distributed AI is known to suffer explosion of communication needs, and this is not available in realistic swarms of drones. We propose a cooperative AI where the agents are lifelong learners. On the go, they are able to update, learn from failures, and become more expert with more experience. This paradigm enables both collaborative AI without explosive communication as well as a great reduction in the required labeled data (teacher), since the agents peer-teach each other. We suggest that these two directions of research will advance us towards true safe autonomy.

14:30 – 15:00

How to feed your LLMs with data from the web

Jan Čurn, Apify

All major generative AI and Large Language Models (LLMs) have been trained using data scraped from the web.

Additionally, LLM applications often extract web data to provide up-to-date context for answers using Retrieval Augmented Generation (RAG).

However, reliably collecting online data at scale is challenging due to issues like blocking, dynamic content rendering, and the sheer volume of data.

In this talk, we will explain how you can establish an efficient web data extraction pipeline, clean the HTML to circumvent the “garbage in, garbage out” problem, and present examples of successful applications.

15:00 – 15:30

Fitting LLMs into a single GPU: Making neural networks smaller

Vladimir Macko, GrizzlyTech, former Google AI

As neural networks continue to grow in size and complexity, the demand for efficient models has never been greater. Neural network pruning and quantization have emerged as two of the most promising techniques for reducing model size and improving computational efficiency. But how do these techniques translate from theoretical research to real-world applications?

In this talk, we will explore the state of the art in neural network pruning and quantization, presenting key findings from academia alongside lessons learned from industry implementations.

Using examples from real-world projects, we will discuss practical approaches to algorithm selection, toolchain optimization, and model evaluation. Whether you're a machine learning researcher or a practitioner, this session will equip you with actionable strategies to make neural networks faster, smaller, and more efficient without compromising performance.

Join us to bridge the gap between cutting-edge research and applied machine learning, and discover how to make the most of these transformative techniques in your work.

15:30 – 16:00

COFFEE BREAK

16:00 – 17:00

PANEL DISCUSSION

Stanislav Fort, Google DeepMind
Iryna Gurevych, Technical University of Darmstadt
Jon McLoone, Wolfram Research

17:00 – 17:05

CLOSING REMARKS

Have a great time Prague, the city that never sleeps

You can feel centuries of history at every corner in this unique capital. We'll invite you to get a taste of our best pivo (that’s beer in Czech) and then bring you back to the present day at our networking event.

Venue ML Prague 2025 will run hybrid, in person and online!

The main conference as well as the workshops will be held at O2 Universum.

We will also livestream the talks for all those participants who prefer to attend the conference online. Our platform will allow interaction with speakers and other participants too. Workshops require intensive interaction and won't be streamed.

Conference building

O2 Universum
Českomoravská 2345/17a, 190 00, Praha 9

Workshops

O2 Universum
Českomoravská 2345/17a, 190 00, Praha 9

Now or never Registration

Early Bird

Sold Out

Conference days € 270
Only workshops € 200
Conference + workshops € 440

Standard

Sold Out

Conference days € 290
Only workshops € 230
Conference + workshops € 490

Late

Sold out

Conference days € 320
Only workshops € 260
Conference + workshops € 520

What You Get

Practical and advanced level talks led by top experts.
Networking and drinks with speakers and people from all around the world.
Delicious food and snacks throughout the conference.

They’re among us We are in The ML Revolution age

Machines can learn. Incredibly fast. Faster than you. They are getting smarter and smarter every single day, changing the world we’re living in, our business and our life. The artificial intelligence revolution is here. Come, learn and make this threat your biggest advantage.

Our Attendees What they say about ML Prague

That's awesome. Few days ago I attended #mlprague and it was a great success. Much diverse sessions about using LLM in outer space exploration, medical diagnosis and autonomous driving.
— Waleed El-Badry 🇪🇬 (@wbadry) May 6, 2024
#mlprague is dead, long live the #mlprague!

Fantastic workshops this year. LLM-agents in computer games (GoodAI), textual similarity for Czech (Seznam), explainability explained by Seldon&Amazon, pineapple flambé 🍍🔥, liquid cocaine🍺, 🙏 microsoft (and Hrnek) for all ☕. pic.twitter.com/tLsdphd6Jc
— Petr Simecek (@simecek) June 4, 2023
Thank you a lot #MLPrague (whole conference team) for absolutely awesome #AI #MLevent!

+doing it in style (during these hard COVID times) it really gives us all hope that by using our brains (within #MachineLearning or #DL don’t have anything stronger) we can overcome anything!
— Radovan Kavicky @radovankavickyFebruary 28, 2021
Thank you @JiriMaterna for bringing the top #MachineLearning #AI professionals together even in these tough times! #MLPrague
— Jen Bleha (@JanBleha) February 27, 2021
Woohoo! This was a lot of fun! :-D Thank you, @MLPrague , for a great hackathon and an amazing first conference day. I look forward to tomorrow :)
— Yurij Mikhalevich (@theyurij) February 27, 2021

Thank you to Our Partners

Strategic Partners

Platinum Partners

Gold Partners

Silver Partners

Communities and Further support

Would you like to present your brand to 1000+ Machine Learning enthusiasts? Send us an email at info@mlprague.com to find out how to become a ML Prague 2025 partner. Our basic partnership offer can be found here.

Become a partner

Happy to help Contact

If you have any questions about Machine Learning Prague, please e-mail us at
info@mlprague.com

Organizers

Jiří Materna
Scientific program & Co-Founder
jiri@mlprague.com

Teresa Pulda
Event production
teresa@mlprague.com

Gonzalo V. Fernández
Marketing and social media
gonzalo@mlprague.com

Jona Azizaj
Partnerships
jona@mlprague.com

Ivana Javná
Speaker support
ivana@mlprague.com

Barbora Toman Hanousková
Communication
barbora@mlprague.com

Jan Romportl
Moderator

Machine Learning Prague 2025

World class expertise and practical content packed in 3 days!

What to expect

Phenomenal Confirmed speakers

Hava Siegelmann

Stanislav Fort

Iryna Gurevych

Johan Loeckx

Ariel Azia

Jon McLoone

Ondřej Dušek

Alexander Jesser

Joseph Pareti

Ondřej Filip

Raid Arfua

Tomas Pevny

Vladimir Macko

Alessandro Crimi

Ondřej Finke

Martin Dlask

Martin Neznal

Jakub Sochor

Jan Čurn

Ivan Cimrák

Philipp Wendland

Jakub Tomasz Gnyp

Agata Gurzynska

Tobias Kietreiber

Sebastian Eresheim

Alexander Buchelt

Filip Roskovec

Stefan Josef

Ondřej Čermák

Kryštof Šaml

Tomáš Sikora

Jérémy Cochoy

Szymon Bubak

Humera Noor Minhas

Artem Moroz

Varun Burde

Vit Zeman

Tun Shwe

Ben Gamble

Tomáš Tomeček

Cedric Clyburn

Karel Piwko

Practical & Inspiring Program

Friday Workshops

Utilizing Large Language Models for improved anti-tracking in web browsers

Beyond Real-World Limitations - Mastering Synthetic Data Generation for Enhanced ML Performance

Introduction to Algorithmic Trading: Hands-On Strategy Implementation with Real-World Data

InstructLab: plug your knowledge into a model easily

Accelerating AI Through Human Knowledge: Teaching to Imitate Experts and Win on the Race Track

3D reconstruction from Images and their application

A practical guide to LLM-based AI agents

Synthetic Data Generation for Embedding Model Fine-Tuning

Parallel Genetic Algorithms in Python

Real-Time Anomaly Detection and Alerting in Financial Markets Using Stream Processing

Saturday, March 21 Workshops

Welcome to ML Prague 2025

Towards Production-Ready Czech LLMs with Continuous Pretraining

Data, your worst enemy?

Lies, Damn Lies and Gen AI

LUNCH & POSTER SESSION

An introduction to protein structure prediction

Mammography solved by AI?

End-to-end Stroke imaging analysis, using reservoir computing-based effective connectivity, and interpretable AI

COFFEE BREAK

Attentive interpretable models for scalable content recommendation in mobile games

Evolution of Recommendation System: from ANN to Ensemble of Scorers

Estimating online behavior of ad hoc cohorts using context-dependent weighing of panel participants

COFFEE BREAK

The Evolution of Virtual Buddy: From Concept to Deployment

Intelligent Manufacturing Assistant Bot

NETWORKING & DRINKS

Sunday, June 3rd Conference day 1

Adversarial attacks on the largest language and vision models

Training AI Models for Crime Scene Fingerprint Recognition

Understanding the neural networks through rule extraction

COFFEE BREAK

Friday
Workshops

Saturday, March 21
Workshops

Sunday, June 3rd
Conference day 1