The biggest European conference about ML, AI and Deep Learning applications
running in person in Prague and online.
Machine Learning Prague 2023
In cooperation with Kiwi.com
– , 2023Registration
World class expertise and practical content packed in 3 days!
You can look forward to an excellent lineup of 40 international experts in ML and AI business and academic applications at ML Prague 2023. They will present advanced practical talks, hands-on workshops and other forms of interactive content to you.
What to expect
- 1000+ Attendees
- 3 Days
- 40 Speakers
- 10 Workshops
- 1 Party
Alexander Del Toro BarbaML & Quantum Computing Lead, Google
Dr. Alexander Del Toro Barba is Machine Learning & Quantum Computing Practice Lead at Google Cloud. Alexander joined Google in 2018 as an AI Specialist, where he supported business & industry solving their most difficult problems with machine learning. Alexander contributed to DeepMind on machine learning for wind power optimisation, and Google Research for weather forecasting. Since 2020 Alexander is practice lead of the machine learning specialist team, and since 2022 Global Quantum Computing Practice Lead where he is closely collaborating with the Google Quantum AI research team.
Martin SchmidSenior Research Scientist, DeepMind
CEO & Co-Founder of EquiLibre Technologies. Previously Senior Research Scientist at DeepMind. Co-author of DeepStack and Player of Games.
Mireia Diez SánchezSenior Researcher, Brno University of Technology
Dr. Mireia Diez Sánchez is a researcher at the Speech@FIT group at Brno University of Technology. Mireia received her Electronic Engineering degree in 2009, and her Ph.D. in 2015, both from the University of the Basque Country, Spain. Her thesis focused on the study of features for Language and Speaker recognition. In 2016 she obtained an individual Marie Curie fellowship for the SpeakerDICE project dealing with diarization tasks. She has attended several international workshops dedicated to the field of speaker recognition and diarization.
Jan KulveitResearch Fellow, Future of Humanity Institute, Oxford University
Jan’s research is centered on studying the behaviour and interactions of boundedly rational agents and more generally on making AI aligned with human interests. He is also interested in modelling complex interacting systems, and strategies to influence the long-term future.
Previously he worked as a researcher at the Institute of Physics ASCR, was the Strategy Director for the Czech EA Association, and co-organized the Summer school on Human-aligned AI. His background is in theoretical physics, phase transitions, networks and complex systems.
Jon McLooneDirector of Technical Communication & Strategy, Wolfram Research
Jon McLoone is central to driving the company's technical business strategy and leading the consulting solutions team. With over 30 years of experience working with Wolfram Technologies, Jon has helped in directing software development, system design, technical marketing, corporate policy, business strategies and much more. Jon gives regular keynote appearances and media interviews on topics such as the Future of AI, Enterprise Computation Strategies and Education Reform, across multiple fields including healthcare, fintech and data science. He holds a degree in mathematics from the University of Durham. Jon is also Co-founder and Director of Development for computerbasedmath.org, an organisation dedicated to fundamental reform of maths education and the introduction of computational thinking.
Marek RosaCEO / Founder, GoodAI
Marek Rosa is the founder and CEO of GoodAI, a general artificial intelligence R&D company, and Keen Software House, an independent game development studio, started in 2010, and best known for its best-seller Space Engineers (nearing 5 million copies sold).
Marek has been interested in game development and artificial intelligence since childhood. He started his career as a programmer and later transitioned to a leadership role. After the success of Keen Software House titles, Marek was able to fund GoodAI in 2014 with a $10 Million personal investment.
Both companies now have over 100 engineers, researchers, artists, and game developers.
Marek's primary focus is the development of Space Engineers, VRAGE3 engine, AI Game, and Memetic Badger.
GoodAI's mission is to develop AGI - as fast as possible - to help humanity and understand the universe. One of the commercial stepping stones is the "AI game," which features LLM-driven NPCs grounded in the game world with developing personalities and long-term memory. GoodAI also works on autonomous agents that can self-improve and solve any task that a human can.
Olivier KochDirector of AI, Onfido
Olivier Koch is the Director of AI at Onfido, an online identity verification company. With his team, he builds new computer vision and machine learning techniques for fraud detection at a global scale, with a focus on performance and fairness. Prior to that, Olivier led the machine learning team for product recommendation at Criteo, where he led a major transformation of the Criteo engine using auto-encoders at the billion-user scale. He also led the computer vision team at Thales Optronics, building new algorithms for the French Rafale aircraft. Olivier holds a PhD in computer science from MIT and an engineering degree from ENSTA (Paris, France). His work has been published at international conferences such as ICCV, IJFR, ICRA and ICCV.
Mehrnoosh SamekiPrinciple Product Lead, Responsible AI Tooling, Microsoft
Mehrnoosh Sameki is a principal product lead at Microsoft, responsible for leading the product efforts on machine learning interpretability and fairness within the Open Source and Azure Machine Learning platform. She has cofounded Error Analysis, Fairlearn and Responsible AI Toolbox and has been a contributor to the InterpretML offering. She earned her PhD degree in computer science at Boston University, where she currently serves as an adjunct assistant professor, offering courses in responsible AI.
Uri RosenbergSpecialist Technical Manager of AI/ML Services, Amazon
Uri Rosenberg is the specialist technical manager of AI & ML services within enterprise support at Amazon Web Services (AWS) EMEA. Uri works to empower enterprise customers on all things ML: from underwater computer vision models that monitor fish to training models on satellite images in space; from optimizing costs to strategic discussions on deep learning and ethics. Uri brings his extensive experience to drive success of customers at all stages of ML adoption. Before AWS, Uri led the ML projects at at&t innovation center in Israel, working on deep learning models with extreme security and privacy constraints. Uri is also an AWS certified Lead Machine learning subject matter expert and holds an MsC. in computer science from Tel-Aviv Academic college, where his research focused on large scale deep learning models.
Alex AthorneResearch Engineer, Seldon
Alex Athorne is a Research Engineer at Seldon, where he works on open-source libraries for explainability and drift detection. He studied mathematics at Warwick and went on to do a PhD at Imperial College London in dynamical systems. He's passionate about open-source development and writing about his experiences in ML.
Lars RuddigkeitAccount Technical Strategist Swiss FedGov, Microsoft
Dr. Lars Ruddigkeit is an Account Technical Strategist at Microsoft, where he works on digital sovereignty questions with governments like the state of Switzerland. He is also an AI Speaker for Executive Briefings in Redmond, where he inspires C-Level board members to achieve more with Artificial Intelligence and take ownership of Responsible AI. He co-authored the whitepaper “Security Implications of ChatGPT” with the Cloud Security Alliance and is a global security expert in various working groups, including their Artificial Intelligence one. He holds a PhD in Chemistry from the University of Bern and an MBA from the Swiss Business School.
Martina BekrováML Team Lead, Melown Technologies
Martina Bekrova is a ML Team Lead in Melown Technologies. Her background is in mathematics, she studied Mathematical modelling in physics at Charles university in Prague. But shortly after graduation she found her passion for machine learning. In Melown Technologies she is responsible for ML Team which is applying (not only) machine learning techniques in context of 3D city reconstruction from images and point clouds in order to make the 3D city meshes a little bit smarter every day.
Martin NeznalSenior Data Scientist, Productboard
A senior data scientist at Productboard, Martin focuses on applying natural language processing (NLP) techniques to help companies process, analyze, and make sense of customer feedback. He is passionate about taking business problems and developing and deploying models that solve the underlying customer needs. In addition to NLP, Martin’s experience spans network security and customer churn, and he has a Master’s degree in Applied Mathematics from FNSPE CTU.
Alexander HagerfML Engineering Lead, Emplifi
Alexander Hagerf has over 10 years of experience in software development, having worked in many various industries e.g. retail, banking, media, and government. For the last 6 years, he's been working as a data engineer in Prague, Czech Republic. After working in GoodData & Jumpshot (Avast) he is currently the ML Engineering Lead in Emplifi working with both MLOps and Data engineering. He has talked at several meetups/conferences both in Stockholm and in Prague (e.g. Hadoop User Group, Avast Data summit) Education: MSc in Engineering Physics at the Royal Institute of Technology (KTH), thesis in ML - Statistical Relational Learning.
Fabian KovacResearch Assistant, St. Pölten University of Applied Sciences
Fabian Kovac is a 30-year-old Research Assistant at the St. Pölten University of Applied Sciences in Austria with a strong focus on Reinforcement Learning as well as sensor data and time series analysis. He finished his bachelor’s degree with distinction in Data Science & Business Analytics at the St. Pölten University of Applied Sciences in 2022 and after finishing his master’s degree, Fabian is pursuing a PhD to further work towards his dream to leave his footprints in the world.
Before starting his research career, he had a diverse background in both professional sports playing American football as well as software engineering, having worked as a Full-Stack Developer with a strong backend focus at an international company for nearly 10 years, where he also had the opportunity to lead a small software development team and to manage data-driven projects.
Piotr SkalskiML Growth Engineer, Roboflow
My name is Piotr Skalski and I am a Computer Vision Engineer with over 5 years of experience in the field. My primary focus in my work is on detection, segmentation, and tracking, but my passion for sports is what truly sets me apart. I am always looking for ways to connect sports and computer vision, and I am constantly exploring new and innovative ways to use technology to enhance sports analysis. As an active open-source contributor, I have created several repositories that have collectively gathered around 4k GitHub stars. You can find my contributions in top computer vision repositories, and I am always looking for new ways to contribute and make a positive impact in the community.
Aimira BaitievaComputer Vision Engineer, Valeo
Aimira Baitieva obtained her degree in Computer Science at the Faculty of Electrical Engineering at the Czech Technical University in Prague. During her studies, she was working at the Multi-robot Systems group at CTU. She currently works as a research engineer at Valeo, focusing on machine learning and computer vision. She is part of the team which specializes in practical applications for the automotive industry. Current projects involve creating an automated visual inspection system on production lines and usage of lidar detection to secure workstations.
Foad VafaeiCo-founder, Bladetrotter
Foad Vafaei has worked at a number of leading software companies, including Oracle, SAP, Nuance, Microsoft, and JetBrains. Foad has a BS in Electrical Engineering and a BA in Economics from the University of Massachusetts in the US. He is currently working with a stealth-mode startup in the edge Machine Learning space.
Michal DufekHead of Research & Co-Founder, Analytical Platform
Michal is the head of research & development in the field of data analytics. He graduated from the University of Economics in Prague, Faculty of Informatics and Statistics. His research interests include data analysis, Bayesian statistics, and time series modeling.
Michal KubištaHead of Data Science, Dataclair
Michal Kubišta is a passionate data scientist who is keen on process optimization and driving ethical AI practices. As the Head of Data Science at Dataclair, he leads the Data Science Guild, a group dedicated to cross-team knowledge sharing and collaboration. He is currently focused on optimizing internal processes in O2 and streamlining the customer experience. Prior to joining Dataclair, Michal worked as a data scientist in the FMCG industry, where he concentrated on developing data-driven solutions for retailers’ pricing, loyalty, and assortment operations. He holds a Master's degree in Economic theory and modeling from the Institute of Economic Studies at Charles University.
Karel ŠimánekFounder & CEO, BigHub
Karel Šimánek is a co-founder of BigHub, a technology company that's making waves with its innovative AI solutions. Karel attended the Faculty of Nuclear Sciences and Physical Engineering at the Czech Technical University, where he also met his future business partner, Tomáš Hubínek. Before founding BigHub in 2017, Šimánek worked for several consulting and banking firms, where he honed his skills in business strategy and technology implementation.
Karel is passionate about staying up-to-date with the latest trends in AI and technology and is committed to making a positive impact on society through his work. That is also why he is an active member of the data community, lecturing, organizing DATA mesh meetups, and hosting the Data Talk podcast.
Matej MurínMachine Learning Engineer, Meteopress
Matej is a graduate student at FIT CTU and a Machine Learning Engineer at Meteopress. He has experience with generative deep learning for various tasks, including image inpainting and time-series predictions using generative adversarial networks.
Aisling O’SullivanSenior Data Scientist, Dataclair
Aisling O’Sullivan is a Senior Data Scientist at O2 AICentre/Dataclair where she uses machine learning to help accelerate medical research. She has worked with pharmaceutical companies to help discover novel targets for cancer immunotherapy and other applications. She previously received her PhD in Computational Neuroscience from Trinity College Dublin where she used machine learning models to understand how the brain processes speech and language. She also spent time as a visiting PhD researcher at the University of Rochester, USA and received her degree and master’s in Biomedical Engineering from Trinity College Dublin.
Michal MarusanSenior Cloud Solution Architect, Data and AI, Microsoft
Michal Marusan is a senior cloud solution architect at Microsoft, focusing on Data & AI services, responsible for adoption and design of solutions and projects for the enterprise customers starting AI & ML workloads on Azure. He drove adoption of Azure ML and related AI services in various industries - Telco, Banking, Gaming, Retail.
Michal ŠtefánikNLP team lead, Gauss Algorithmic
Michal Štefánik is a senior language specialist in the NLP team at Gauss Algorithmic and a researcher at Masaryk University. Throughout the last six years in NLP, he led the deployment of Attention-based models to numerous NLP applications, ranging from Named Entity Recognition to Machine Translation.
Michal conducts research on enhancing the robustness of large language models, including generalization to unseen tasks. He is also a founder of the students' Transformers Club, whose members received international prizes, including first place in Meta's NAACL DADC competition.
Nikola GroverováNLP data scientist, Gauss Algorithmic
Specialist in natural language processing, graduate of applied mathematics and stochastic methods. Recreational climber.
Thomas BrowneSenior Data Scientist, Kiwi.com
Senior data scientist at Kiwi.com where he focuses on mathematical theory to address travel search-related problems with machine learning. In the past he graduated from Paris Cité University, France, with a PhD in probability and statistics for numerical simulators. He also has experience with applying machine learning in the fields of energy - reliability in nuclear plants - and pharmaceutical industry - identification of key features in cancer drug development. On a much lighter note, he is a huge fan of indie/punk music and loves cooking.
Lucie BlechováMachine Learning Engineer, Kiwi.com
ML Engineer at Kiwi.com working on predictive models and their technical implementation in production environment. She has a Master's degree in Economics from Charles University in Prague. She has worked in data teams her entire career, first working for the Ministry of Health and then a pharmaceutical company, afterwards starting full-time at an energy company in Prague and then moving to the Netherlands to work for a commodity trading company. She has experience with delivering data science and machine learning solutions in all those fields. Despite the fact that they might seem unrelated, data is what connects them all. In her personal life, she loves science, sci-fi, martial arts, yoga, hiking, and, during summers, wild swimming.
Martin PlajnerHead of Research and Development, Logio
Martin Plajner is the Director of the Research and Development department in the consultancy company Logio. This department's goal is to keep the company at the technological edge and to provide new methods and methodology. This is done by seeking novel approaches, prototyping, and defining new products. An inseparable part of the R&D team are trainees; students, who represent the company's future. Consequently, he desires to preserve his link to academia and is a junior researcher at the Institute of Information Theory and Automation (UTIA) in the field of decisions making theory with mathematical modeling background from Ph.D. studies and the Czech Technical University. These two parts provide an opportunity to combine the business and the academic world and to challenge both theoretical concepts as well as established practices.
Theodor PetříkConsultant in Research and Development, Logio
Theodor Petřík is an R&D consultant in the company Logio. He has worked in the company as a trainee over most of his university studies and then naturally became a full-time consultant. Theodor is also a Ph.D. candidate at the Institute of Economic Studies (IES) at the Faculty of Social Sciences, Charles University where he is researching how companies should conduct strategic operations planning to utilize their scarce resources in the most efficient way. The knowledge gained from theoretical research can be applied to real-world applications and the experience obtained from the real-world projects provides in return a unique perspective on theoretical concepts.
Petr ŠimánekSenior Researcher, FIT CTU
Petr is a mathematician and senior researcher at FIT CTU. Petr is involved in many ML projects and is focused on merging ML with known or unknown physics and dynamics.
Radovan KavickyData Science Instructor, Datacamp
Radovan Kavicky joined DataCamp among its first employees (historically 1st Data Science Instructor from CEE region & is still historically the only person worldwide who have made successful transition from regular student to DataCamp instructor and employee after being #1 worldwide @ DataCamp platform for a year, back in 2017).
Radovan is Data Science Polyglot (R, Python, Julia ++more) and Data Science Veteran with 11+ years of experience in Data Science and Applied AI/ML Consulting & extensive knowledge in the area (Data Science consulting, education & community building with successful cooperation together with global leaders within our industry, like f.e. H2O.ai, Anaconda or Tableau). Radovan is also co-founder of Slovak.AI (Slovak Research Center for Artificial Intelligence), member of AIslovakIA (National platform for the AI development in Slovakia) and various international professional societies within Data Science & AI/ML industry, like f.e. IEEE Computer Society, CLAIRE (Confederation of Laboratories for Artificial Intelligence Research in Europe), European AI Alliance (European Commission/Futurium), TAILOR network (Trustworthy AI - Integrating Learning, Optimisation and Reasoning), UDSC (United Data Science Communities), PyData Global Network, Global Tableau #DataLeader network & The Python Software Foundation (PSF).
Radovan is Founder of PyData Slovakia/Bratislava (#PyDataSK #PyDataBA), R <- Slovakia (#RSlovakia), Julia Users Group Slovakia (#JUGSlovakia) & SK/CZ Tableau User Group (#skczTUG) that you are all welcome to join.
Jiří PihrtResearcher, FIT CTU
Jirka is a graduate student and researcher at FIT CTU. Currently, he is involved primarily in machine learning for spatiotemporal predictions, but he also has previous experience in augmented/virtual reality and web development.
Jozef ReginacData Science Lead, STRV
Jozef was the first data engineer at STRV and is now leading data science team. Previously long-time data analyst in many fields including forensic, supply chain and e-commerce. He got pissed by the traditional tech stack and turned into analytics engineer thanks to dbt. He likes good filter coffee, open source projects, and filmography.
Matej ChomaSenior Researcher, Meteopress
Matej is a Data Scientist at Meteopress and a Ph.D. student at FIT CTU. Specializing in spatiotemporal prediction and physics-informed deep learning. Nature lover and mountaineer.
Pavel JezekData Engineer, STRV
Pavel is a data and analytics engineer at STRV who turns great coffee into business value. He enjoys problem solving and finding efficient solutions to help push the projects forward. He is a fan of unix and FOSS and enjoys spending his time in the command line.
Aneta HavlínováData scientist, Workday
Aneta Havlinova is currently a data scientist/Python developer in Workday, contributing to the development of People Analytics, an application providing insights into HR data in areas such as diversity, attrition, skills, and more. She gained experience in HR analytics also in public sector, namely during her internship for the Council of the European Union in Brussels. Previously, she worked as a data scientist in MSD, where she used her knowledge for example to help lab scientists with biological processes modelling, or to provide oncology marketing teams with insights based on financial data. She has a master’s degree from the Institute of Economic Studies at Charles University in Prague.
Martin KoryťákData scientist, Workday
Martin Korytak is a data scientist and Python developer at Workday. He is one of the key contributors to its proprietary engine which provides enthralling insights into HR data in a narrative form. Prior to joining Workday, Martin was an IBM Great Minds intern at IBM Research in Zurich working on accelerated inference of tree-based models capable of handling large-scale data sets. He holds an M.Sc. degree in data science with specialization in artificial intelligence from Czech Technical University in Prague. His interests span algorithms, neural networks and interpretability of machine learning algorithms. He is also a member of the local AI community and an enthusiastic teacher of Python programming language.
Stepan KadlecML and Data Engineering specialist, ForML
ML and Data Engineering specialist focusing on operational architectures of ML solutions - enabling their smooth transition from research to production through appropriate ML lifecycle management. Previously leading research in ML pipeline formalization at Oracle AI Apps. Co-author of the ForML framework.
Mike PearmainChief Data Officer, VietcomBank
Currently Chief Data Officer at VietcomBank with a special interest in products with ML components, the architectures to support these, and organizational separation of concerns to deliver them. Previously a data scientist at Google, Kaggle competitions master, and co-author of the ForML framework.
Practical & Inspiring Program
O2 Universum, Českomoravská 2345/17a, 190 00, Praha (workshops won't be streamed)
|Room D2||Room D3||Room D4||Room D6||Room D7|
Operationalizing Responsible AI in Practice
Mehrnoosh Sameki, Microsoft
Are you a data scientist looking to author machine learning solutions responsibly using the latest tooling? Our brand-new Responsible AI dashboard is designed to help you by providing a single pane of glass bringing together a variety of model assessment and responsible decision-making capabilities under one roof. The dashboard enables you to easily assess and validate your models by looking into a variety of model performance fairness and error analysis components interpret your models (including blackbox ones) to understand how they are making their predictions perform perturbations via what-if analysis and counterfactual analysis and understand/fix data imbalance issues. By the end of this session you will have gained hands-on experience in the utilization of these tools and how you can use the outputs to identify diagnose and mitigate your models’ issues and communicate their value to your stakeholders across the organization.
Learning to Learn: Hands-on Tutorial on Using and Improving Few-Shot Language Models
Michal Štefánik, Gauss Algorithmic
As AI models become an increasingly common element of many applications we more notoriously face practical limitations of specialized models working well only for a single training task and data. Huge language models like OpenAI's GPT-3 showed that models could be much more versatile and adapt to new tasks without updating the model provided only with natural instructions and a small number of input-output examples of the desired task. In practice Few-shot learners can solve your new task with accuracy comparable to the supervised models trained on hundreds to thousands of samples. Our workshop will give you an overview of the existing models able of Few-shot learning including their limitations. We will experiment with creative ways of utilizing in-context Few-shot learning such as customizing the model's predictions to specific users. Finally we will provide some recipes for training Few-shot learners for new languages or further scaling up the accuracy of existing Few-shot models.
Reproducible, portable, and distributable ML solutions in Python
Stepan Kadlec, ForML
When achieved the combination of reproducibility portability and distributability in ML solutions constitutes a powerful faculty unlocking a number of operational opportunities. While reproducibility is a well-established pathway for conducting scientific research it is not always receiving the same recognition within the data product industry. Similarly portability and distributability are typically regarded as irrelevant for bespoke solutions and only pursued in case of explicit demands. This might be reasonable given the extra cost incurred by conventional development; but with modern tooling these properties can be easily achieved without much extra effort. In return this brings significant benefits in the form of highly collaborative R&D inherent lifecycle management effective model troubleshooting carefree and flexible deployment (latency/throughput-optimal runtime modes) and even potential commoditization (market of turnkey solutions). In this workshop we will dive deeper into these concepts examining carefully the available technologies and reviewing some of the existing tools. A significant amount of the time will be spent working with the ForML framework implementing a practical end-to-end ML solution demonstrating all of these declared principles.
Gaussian process regression when it comes to numerical simulators
Thomas Browne, Kiwi.com
While numerical simulators are often used by heavy industries to model complicated phenomenons their complexity makes them sometimes slow and harder to exploit. Gaussian process regression (GPR) provides an accurate framework where based on a limited amount of calls to the simulator one can have a prediction on any of the simulator's output together with confidence bounds. GPR can then be extended to solve optimization and sensitivity analysis tasks with a parsimonious approach. In this workshop the attendants will be given a walk through the basics of GPR in Python. Besides they will be provided with implemented examples how GPR can help.
Drug discovery using NLP
Aisling O’Sullivan, Dataclair
NLP is an important and rapidly growing field. While its application in fields such as language translation and chatbots is well-known the use of NLP in the billion-dollar pharmaceutical industry is less commonly cited. NLP is particularly appealing to drug discovery since these models are capable of capturing complex medical concepts that are difficult for humans to grasp as well as understanding the structure of molecules which are key to discovering novel drugs. In this workshop you will be introduced to the world of using machine learning for drug discovery with a focus on NLP. We'll show you how to apply ML techniques to discover novel drug candidates using NLP on the text and also by applying NLP to the "language of molecules."We will do this through a use-case of classifying molecules that can or cannot cross the blood-brain barrier. This use-case is important for developing drugs that target diseases of the central nervous system (such as Alzheimer's) as well as for identifying potentially toxic drugs. We'll also explain the applicability of these approaches to other important problems such as identifying antibiotics and cancer drugs.
Transform Your Data Game: Mastering Data Modeling and Analytics with dbt
Jozef Reginac, STRV
Dbt has gained significant traction in the analytics engineering community and is on the quest to become the go-to tool for data teams. With the latest addition Python models it’s becoming relevant even for machine learning engineers. We would like to walk you through the basic project setup the first data model all the way up to creating the Python model. Our goal is for you to be confident in using dbt in your team and to help you merge the work of all data team members into one environment.
ML with a Large Set of Variables: Feature Selection Techniques for Regression in Python
Aneta Havlínová, Workday
In many ML applications we encounter a situation when datasets have a large amount of potential features but relatively few observations—from an analysis of genetics data with thousands of gene expressions through financial data modelling with voluminous data that flows in from capital markets and economies to HR analytics area with extensive data on employees such as their personal information skills job histories and more. In these cases feature selection is crucial to prevent overfitting and to improve model performance. This workshop provides participants with an overview of some of the useful feature selection methods including linear models such as Orthogonal Matching Pursuit or tree-based methods such as Random Forest or Boruta. First a theoretical background is presented. Afterwards the participants are guided step-by-step through implementation of these methods in Python with the practical use-case being tied to the HR data analytics context.
LIME & SHAP: Explainable AI (xAI)
Radovan Kavicky, Datacamp
In this workshop led by Radovan Kavicky from Datacamp Basecamp.ai and GapData Institute you will get familiar with principles and tools of Explainable AI (xAI) like LIME SHAP and others. Complex modern-day ML algorithms where deep learning and ensemble methods dominate are really hard to fully understand but the decision process behind them can and need to be transparent and trustworthy for decision makers within critical domains as finance healthcare or public sector and governmental services where TRUST is a MUST. In fact with growing regulatory pressure also outside these areas Explainable AI (xAI) will be necessity for any organization soon. You will learn how to understand the inner workings of these ML algorithms and how to design systems that imitate intelligence in a transparent way. You will also get an overview of current trends in Explainable AI/ML and the challenges that are ahead of us.
Bayesian Networks in business planning and risk management
Martin Plajner, Logio
Explore with us a complex and powerful family of models Bayesian Networks. In our workshop you will have a chance to i) understand the Bayesian Network models and their strengths drawbacks and application areas ii) build a data-based model which you will use to answer business planning questions and what-if scenarios and iii) create an expert-knowledge model to handle risk management infer posterior probabilities and construct emergency scenarios. In this workshop you will have an opportunity to get hands-on experience with Bayesian Networks modeling using R language. No prior Bayesian Networks knowledge is required bring a laptop with the current R version ready to use.
Predicting weather with deep learning
Petr Šimánek, FIT CTU
In this workshop we will implement train and test machine learning models that analyze satellite and weather radar data. You will get hands-on experience with the most common deep neural nets used for spatiotemporal predictions (e.g. UNet with some bells and whistles and convolutional recurrent nets). You will play with PyTorch implementation and analyze the results. You will understand the common pitfalls and reasons why the prediction fails.
O2 Universum, Českomoravská 2345/17a, 190 00, Praha (and on-line)
Registration from 9:00
Welcome to ML Prague 2023
Player of Games - Search in Imperfect Information GamesMartin Schmid, DeepMind
From the very dawn of the field, search with value functions was a fundamental concept of computer games research. Turing’s chess algorithm from 1950 was able to think two moves ahead, and Shannon’s work on chess from 1950 includes an extensive section on evaluation functions to be used within a search. Samuel’s checkers program from 1959 already combines search and value functions that are learned through self-play and bootstrapping. TD-Gammon improves upon those ideas and uses neural networks to learn those complex value functions — only to be again used within search. The combination of decision-time search and value functions has been present in the remarkable milestones where computers bested their human counterparts in long standing challenging games — DeepBlue for Chess and AlphaGo for Go. Until recently, this powerful framework of search aided with (learned) value functions has been limited to perfect information games. We will talk about why search matters, and about generalizing search for imperfect information games.
Boosting Investment Decisions with Graph Attention Reinforcement LearningMichal Dufek, Analytical Platform
Are you tired of using traditional methods for asset pricing? Look no further than our cutting-edge research on Graph Attention Reinforcement Learning! By utilizing graph neural networks with attention mechanisms and a deep reinforcement learning framework, we have developed a new approach that outperforms existing methods in terms of accuracy and efficiency. Our GARL approach is evaluated using synthetic simulated data and shows that it is an effective approach, particularly when the problem is redesigned as a multi-class classification problem. Don't miss out on this exciting new development in asset pricing!
Standing Still Is Not An Option: Alternative Baselines for Attainable Utility PreservationFabian Kovac, St. Pölten University of Applied Sciences
The rapid development of machine learning and artificial intelligence in general has led to growing concerns about the potential impact of AI on society. Ensuring that AI systems behave safely and beneficially is a major challenge, particularly in the context of Reinforcement Learning, where an agent learns by interacting with an environment and receiving feedback in the form of rewards.
Avoiding negative side-effects is one of those challenges, where the agent should not cause unintended harm while trying to achieve its primary objective. A promising way to accomplish this task in an implicit way without telling the agent what not to do, is Attainable Utility Preservation (AUP). AUP is a safe Reinforcement Learning approach that minimizes side-effects by optimizing for a primary reward function while preserving the ability to optimize auxiliary reward functions. However, AUP's applicability is limited to tasks where a no-op action (e.g., standing sill) is available in the agent's action space. Depending on the environment, this cannot always be guaranteed.
To overcome this limitation, we introduce new baselines for AUP, which are applicable to environments with or without a no-op action in the agent's action space. We achieve this by regularizing the primary reward function in different ways with respect to auxiliary goals, depending on the used variation. This enables designers of environments to define simple reward functions, which then get extended by our introduced baselines to induce safer behavior.
We evaluate all introduced variants on multiple AI safety gridworlds, which were specifically designed to test the agent's ability to solve a primary objective while avoiding negative side-effects. These effects include e.g., facing the agent in front of several options where only one solution without a side-effect is imminent, refraining from causing damage or interfering with the environment's dynamics, rescuing items without destroying them, or to learn how to mitigate delayed effects to some extent and to not complete the primary objective on purpose.
We show how our approach induces safe, conservative, and effective behavior, even when a no-op action is not available for the agent. An additional benefit lies in the variation-based approach, which allows to consider multiple variants depending on the tasks to solve.
In conclusion, our work addresses critical challenges in AI safety related to Reinforcement Learning and proposes an updated approach to achieve safe behavior implicitly by avoiding negative side-effects, contributing to the broader effort of designing safe and beneficial AI systems for the future.
LUNCH & POSTER SESSION
Multi-Model Machine Learning based Industrial Vision Tool for Assembly Part Quality ControlAimira Baitieva, Valeo
Creating a visual inspection tool in the automotive industry can be challenging due to having many different types of defects, including ones we have not seen before. Physical setup constraints, importance of missed defects and subjectivity of labels adds even more complexity to this task. I will illustrate all this on a Valeo project, which is aimed at helping the operator detect bad sensors on the production line using visual information. To tackle this complex problem we have combined different computer vision models, extracting features from the segmented anomaly map alongside with the supervised classifier score and using them for the final classification.
3D Pose Estimation in SportPiotr Skalski, Roboflow
Are you ready to take your sports analysis to the next level? Look no further! In this talk, we will dive deep into the exciting world of 3D pose estimation using multiple cameras and the powerful YOLOv7 model. From detection to post-processing, calibration to visualization, I'll be walking you through every step of the process and providing you with the professional insights you need to improve your analysis. But don't worry, this talk won't be all work and no play - I promise to add some humor to keep things interesting.
Neural fields in aerial 3D reconstructionMartina Bekrová, Melown Technologies
Methods using neural fields for novel view synthesis are hot research topic. Training of neural fields was initially very computationally demanding and could take days to create one scene and minutes for rendering each view. But soon after came invention of Instant NeRF from Nvidia which fasten the computation rapidly and we decided to test how it works with images from aerial scanning. With few modifications it is possible to also create mesh of the scene. In this talk we will share our experiments with various neural field based methods applied on aerial data and comparison of the results with our traditional SfM algorithm for 3D reconstruction.
ChatGPT and Wolfram|Alpha, a tale of two AIsJon McLoone, Wolfram Research
In the rapidly evolving field of artificial intelligence (AI), two distinct paradigms have emerged: statistical AI and symbolic AI. Statistical AI, exemplified by models such as ChatGPT, excels at natural language understanding and generation, leveraging vast amounts of data to make predictions and generate responses. Symbolic AI, exemplified by systems like Wolfram Alpha, excels at formal reasoning, knowledge representation, and symbolic computation. Each paradigm has its own strengths and weaknesses, and the integration of both approaches has the potential to unlock new capabilities and applications. In this talk, we explore the unique characteristics of statistical and symbolic AI, highlighting their respective advantages and limitations.
The talk will delve into the power of ChatGPT's language modeling and Wolfram Alpha's computational knowledge engine, and discuss how Wolfram is pioneering efforts to synergistically combine these two AI paradigms. By leveraging the complementary strengths of both statistical and symbolic AI, we aim to create a unified AI system capable of providing human-like language understanding, precise reasoning, and dynamic computation.
LLM-driven game charactersMarek Rosa, GoodAI
We present AI Game, a novel type of role-playing sandbox game that leverages LLM-powered agents to enhance player experience. Our game features agents with long-term memory and autonomous goal pursuit, enabled by large language models that emulate their personalities, behaviors, thoughts, actions, and dialogues. These agents can observe and interact with their environment, communicate with each other, and make decisions based on their individual goals. Our game offers a unique and immersive gameplay experience that challenges traditional notions of game design and opens up exciting new avenues for exploration in AI and game development. In this talk, we will discuss the technical and design aspects of our game, and highlight some of the key challenges and opportunities in this emerging field.
Bridging the Gap between Large Language Models and Human IntelligenceJan Kulveit, Future of Humanity Institute, Oxford University
As we continue to develop Language Models (LLMs), it raises the question of whether they are just "stochastic parrots" or a stepping stone toward Artificial General Intelligence (AGI). In this talk, we will explore the similarities and differences between state-of-the-art LLMs and predictive processing, a neurologically plausible theory of function of human brain. By comparing these two systems, we can gain insights into how ML research can converge with natural evolution, leading to more human-like AI solutions.
Deep learning approaches to speaker diarizationMireia Diez Sánchez, Brno University of Technology
Speaker diarization is the task of determining the speaker turns in a recording of a conversation, automatically finding “who speaks when”.
Speaker diarization is one of the most challenging tasks in the automatic speech processing field: it deals with voice activity detection (VAD), speaker recognition, segmentation of the speech into speaker turns, handling of overlapped speech and it needs to infer the number of speakers in the input conversation.
In this talk, we will focus on the recent neural network-based state-of-the-art methods, such as end-to-end diarization (EEND) and target-speaker VAD systems and will explain how these architectures tackle the speaker diarization problem.
Mastering Summarization Techniques: A Practical Exploration with LLMMartin Neznal, Productboard
In this talk, we would like to focus on the summarization of collections of feedback and describe all its challenges. We will focus on the state-of-the-art summarization models, such as GPT-3, open source GPT variants, Bart, and other transformers as well as some extractive approaches such as Gensim. We will show how they perform for summarization of different types of text such as conversations, reviews, long & short texts, etc.
We will present what are the industry standard methods for the evaluation of summaries such as ROUGE, BLEU, BLANC, BERTscore, or Supert, and use them to evaluate the summarization models. We will show how we use these approaches in Productboard to automatically and without supervision evaluate the quality of thousands of summaries daily.
We will talk about techniques to apply to summarization models to achieve significantly better summaries such as for example fine-tuning, ways how to query GPT models, text cleaning, etc.
We will also focus on multi-document summarization. We will describe what are the state-of-the-art models for this task, how to evaluate the multi-document summary, and which techniques we use to preprocess the input documents when we need to summarize a collection comprising hundreds or thousands of texts into one paragraph (such as clustering, text relevancy or pre-summarization of single documents)
In the last section of our talk, we will share our experience of implementing the summarization feature in Productboard, how we incorporate the user feedback into our summarization pipeline, how we connect summaries with other ML features and also which tech stack we use, and how we scale it to deploy an independent solution for thousands of companies (each with thousands of text/feedback).
Conference day 1
O2 Universum, Českomoravská 2345/17a, 190 00, Praha (and on-line)
Doors open at 08:30
State and Future of Quantum Computing & Quantum Machine LearningAlexander Del Toro Barba, Google
In this talk, Alexander will give an overview of recent developments in quantum computing and quantum machine learning, tackle potential applications in the near term and for fault tolerant quantum computers, and provide some tips on how to start in this field.
Probabilistic Precipitation Nowcasting with Deep Physics-Constrained Neural NetworksMatej Murín, Meteopress
In Meteopress, we have developed a neural network for precipitation nowcasting, achieving state-of-the-art quantitative performance in our geographic area but producing blurry predictions. In this contribution, we go over the shortcomings of traditional, regression-based neural networks for nowcasting and showcase why and how they fail to produce realistically looking and physically sound predictions. We then propose a new type of physics-constrained generative adversarial network, named PhyDGAN, and explain the decisions that lead to this architecture's design. We show how this type of network has better probabilistic qualities than a GAN without any physical constraints while still producing accurate, realistic predictions. We study how the introduced physical constraints influence the model and explore the possibility of creating new physically-based output features that would be interesting from a meteorological perspective.
Using TensorFlow for data processingMichal Kubišta, Dataclair
The first thing that very likely pops into your head when you hear TensorFlow is neural networks. In this talk, we aim to demonstrate that it is not only a powerful open-source machine-learning library, but a whole ecosystem of tools and, to a certain degree, a programming language on its own. While introducing concepts such as a computational graph, functions, and datasets we will illustrate how you can leverage generators for incremental data handling, how to build simple preprocessing pipelines, and most importantly how to make all of these run on multiple CPUs or even GPUs. And, if you are particularly brave - or bored - even how to use TensorFlow for visualizations.
Bringing automation and fairness to identity verification on the internet with deep learningOlivier Koch, Onfido
This talk will cover the technical challenges of leveraging the latest computer vision and machine learning techniques in a context of fraud detection. In particular, we will show how the latest advancements in deep learning allow us to significantly automate industrial processes that heavily rely on manual labeling.
We will start with a presentation of our system leveraging biometrics and identity document data. We will present the key constraints of fraud detection, such as constantly evolving threats, massively unbalanced data, and unreliable labels. We will discuss how we learned to navigate them, while finding the best possible balance between false acceptance and false rejection.
We will then move on to the trade-offs of supervised and unsupervised learning and how deep learning can be used most effectively in the fraud detection setting.
Finally, we will focus on how we address bias in our system. Making identity verification as fair as possible is a core objective for Onfido. We will present concrete steps that practitioners can use in a real-world setting to reduce bias in their systems.
Open Source Explainability - Understanding Model Decisions using AlibiAlex Athorne, Seldon
Explainable AI, or XAI, is a rapidly expanding field of research that aims to supply methods for understanding model predictions. We will start by providing a general introduction to the field of explainability, introduce the Alibi library and focus on how it helps you to understand trained models. We will then explore the collection of algorithms provided and the types of insight they each provide, looking at a broad range of datasets and models, and discussing the pros and cons of each. In particular, we'll look at methods that apply to any model. The aim is to give the ML practitioner a clear idea of how explainability techniques can be used to justify, explore and enhance their use of ML, especially for models in deployment.
Explainable AI for Computer Vision and NLP modelsUri Rosenberg, Amazon
With organizations leveraging AI/ML solutions to transform their businesses, comes the need to ensure that models are trustworthy and understandable. For structured tabular data, known techniques (SHAP, LIME) have been proven effective in providing model and inference explainability. However, gaining model interpretability for unstructured, computer vision and NLP tasks requires innovative approaches. In this talk we will deep dive into the theory, methods and examples used in AWS Clarify to gain explainability on computer vision and NLP models and expand how they fit in the MLOps pipeline.
LUNCH & POSTER SESSION
Building a Framework for Easy Model Deployments at ScaleAlexander Hagerf, Emplifi
This talk will go through how we at Emplifi created and use a framework that now enables us to deploy any of our ML models as HTTP endpoints, stream consumers, or batch jobs in Spark. All of this with no (or almost no) custom coding. This in turn has also helped with automatic pipelines for retraining, deployments, and testing for us.
We will show how adopting a standard for all models (MLflow) enabled us to abstract away the models' implementations and write code that works for any and all models, regardless of the underlying technology and/or dependencies. This separation of concerns also makes clear the line between where the data scientist's work ends, and where the data engineer's begins. Having one single code base for all deployments also means that updates or extensions are fast and easy to do.
From Prototype to Production: Best Practices for AI/ML Model ImplementationKarel Šimánek, BigHub
As we all know, creating an AI/ML model is just the first step toward developing a successful data product. The real challenge lies in integrating the model into the company's systems and workflows, ensuring sustainability and observability, and simplifying the model production process.
In this presentation, we will focus on practical solutions to these challenges using cutting-edge technologies such as MLFlow, Azure ML Studio, SageMaker, Databricks, and Kubeflow. We will explore how to integrate AI/ML models with the rest of the IT ecosystem, how to ensure the quality and functionality of the infrastructure, and how to avoid hidden defects and illegal libraries.
We will also discuss the best practices for creating a data product using feature stores, frameworks, and custom libraries to simplify the model production process.
Furthermore, we will dive into the principles of DevOps and MLOps, discussing how to achieve them in MS Azure environments. We will explore the different environments to use and what to use them for, as well as the tools that can help us achieve these principles.
In short, this presentation will provide practical and professional insights into the world of MLOps and DevOps in AI/ML and shows how to create sustainable and observable AI/ML models that integrate seamlessly into the company's systems and workflows.
How to Lead a Data Science Team: Practical solutions for a more streamlined workflowFoad Vafaei, Bladetrotter
When stakeholders see the tangible benefits driven from large datasets, they are fascinated by data science. At other times, a chasm separates the data science team from business domain experts. How do we cross the chasm and develop a strong vision? How do we foster a collaborative culture so that data science and analytics projects will not run into costly delays?
If data science is to be transformational, it must be democratized in the organization. There are many ways to democratize data science in your organization and foster collaboration.
In this talk, we'll discuss practical solutions to overcome the hurdles and work towards a more efficient data science workflow.
PANEL DISCUSSIONMarek Rosa, GoodAI
Martin Schmid, DeepMind
Jan Kulveit, Future of Humanity Institute, Oxford University
Lars Ruddigkeit, Microsoft
Have a great time Prague, the city that never sleeps
You can feel centuries of history at every corner in this unique capital. We'll invite you to get a taste of our best pivo (that’s beer in Czech) and then bring you back to the present day to party at one of the local clubs all night long!
Venue ML Prague 2023 will run hybrid, in person and online!
The main conference as well as the workshops will be held at O2 Universum.
We will also livestream the talks for all those participants who prefer to attend the conference online. Our platform will allow interaction with speakers and other participants too. Workshops require intensive interaction and won't be streamed.
Českomoravská 2345/17a, 190 00, Praha 9
Českomoravská 2345/17a, 190 00, Praha 9
Now or never Registration
What You Get
- Practical and advanced level talks led by top experts.
- Party in the city with people from around the world. Let’s go wild!
- Delicious food and snacks throughout the conference.
They’re among us We are in The ML Revolution age
Machines can learn. Incredibly fast. Faster than you. They are getting smarter and smarter every single day, changing the world we’re living in, our business and our life. The artificial intelligence revolution is here. Come, learn and make this threat your biggest advantage.
Our Attendees What they say about ML Prague
Are you attending too? Do you have tips for what not to miss?February 27, 2021
Guys, job more than well done 👍 thanks for great conference🙂— Ivan Kasanický (@IvanKasanicky) February 28, 2021
Thank you to Our Partners
Communities and Further support
Would you like to present your brand to 1000+ Machine Learning enthusiasts? Send us an email at firstname.lastname@example.org to find out how to become a ML Prague 2023 partner.
Become a partner
Happy to help Contact
If you have any questions about Machine Learning Prague, please e-mail us at
Scientific program & Co-Founder