Imagine how much you can learn from the person behind the algorithms of Facebook’s News Feed or the person who trained Yahoo’s algorithm with billions of clickstreams.
14 hours full of hands-on information. This is not an academic event, this will not just be theories. You will see real-life applications of machine learning.
Our speakers as well as the audience will be joining us from countries around the world. The conference will be held in English.
It's not all about the lectures! Meet interesting people, discuss your ideas and enjoy parties in the center of Prague!
We will be organizing tours, where you will get to discover the most beautiful places of Prague, taste the best local beers, and discuss the hottest topics in machine learning. Don't worry about accommodation, transport, or information during the conference. We will be there to help you with anything you need.
Reinforcement learning (RL) is a technique that goes way back to the early 80s. It has become very popular again recently because of the use of deep neural networks as the function approximators, in what is called DQN (deep Q-network). The broad application of DQNs ranges from playing ATARI games from raw pixels or playing a board game of Go to optimizing large cooling systems in data centers.
In the first session of this workshop, we will cover the basic theory of Q-learning and DQN, recent advances and expected future development. In the second part, we will go through the source codes of an RL algorithm and train it using a problem from the popular OpenAI gym. The convergence of harder problems can take several days so we will be training with a simpler task that can be done during the workshop.
An intermediate level of Python is required for this workshop.Michal Sustr, Czech Technical University & Milan Le, Charles University
The goal of this workshop is to give attendees a blueprint for building an end-to-end enterprise-ready ML solution and demonstrate its usage on typical ML corporate use cases (telco, digital marketing).
In the first session, we will create a base for analyzing the data using Cloudera’s CDH platform. The platform will be set up for data science workloads, including the integration of necessary tooling. We will also point out best practices and pitfalls to avoid when running the platform with ML workloads in enterprise environments.
The second session will be dedicated to real-time ingestion and analysis of live data against prepared models on top of our previously created platform.
In the third session, we will go into detail on some specific machine learning techniques applied in telco industry use cases like churn prediction or community detection.Johnson Darkwah & Jiri Polcar & Karel Vaculik, Gauss Algorithmic
In malware detection, an extremely low false positive rate is one of the most important targets. This paradigm results in a funnel of multiple different algorithms, in which each component is designed for achieving a very high precision rate with a potentially low recall. Another specific aspect is that in malware detection, big data is really big (hundreds of terabytes of files) while posing the challenge of dealing with a very imbalanced class representation (typically much more benign software than malware).
In the first two sessions, we will offer a theoretical look at various algorithms used for malware detection. In the latter two sessions, we will provide the participants with a direct hands-on experience with a real world dataset (more than 200.000 samples). We will also hold a hackathon-like competition in designing a malware detection pipeline that will result in the best recall with the highest precision. For this part of the workshop, a basic level of Python knowledge is recommended (although not obligatory).Martin Balek & Adam Hanka & Monika Seidlova & Ondrej Svec & Tomas Trnka, Avast
Session II of the workshop.Michal Sustr, Czech Technical University & Milan Le, Charles University
Session II of the workshop.Johnson Darkwah & Jiri Polcar & Karel Vaculik, Gauss Algorithmic
Session II of the workshop.Martin Balek & Adam Hanka & Monika Seidlova & Ondrej Svec & Tomas Trnka, Avast
At GoodAI, we believe that that gradual learning is a key aspect of AGI. In this hands-on workshop, we shall introduce and motivate the use of gradual learning throughout the lifetime of an agent. We will present a problem which is challenging for the current state of the art deep learning methods, and will demonstrate how it can be solved with the use of gradual learning. We will also share code samples for a gradual learner and work with the audience to construct their own gradual learning agent, which is applicable to both the presented problem and other examples.Joseph Davidson & Josef Strunc & Sungmin Aum, GoodAI
Session III of the workshop.Johnson Darkwah & Jiri Polcar & Karel Vaculik, Gauss Algorithmic
Session III of the workshop.Martin Balek & Adam Hanka & Monika Seidlova & Ondrej Svec & Tomas Trnka, Avast
Recent advances in artificial intelligence have been extraordinary. Despite this fact, the measure of our true progress towards human-level artificial intelligence has been elusive. The AI Roadmap Institute has been established to tackle this challenge and push our understanding of the AI landscape beyond our current comprehension. This workshop will offer a glimpse into one of the fundamental tasks of the institute, namely the comparison of disparate roadmaps for building intelligent machines. Such comparison is vital for evaluating where we truly stand relative to our goal, the discovery of potential dead-ends, and the identification of areas of research that the community should really focus on.Jan Feyereisl, GoodAI
In this talk, we will overview the topic of image retrieval. Initially, we will focus on retrieval methods based on Bag-of-Words (BoW) image representation, which exploit geometric constraints. We will then discuss how standard similarity-based image retrieval formulation uses BoW with geometry to satisfy only a single user requirement. Then, we will show how different formulations employ the same methods to fulfill different requirements in other tasks, such as online browsing of image collections, image analysis based on large collections of photographs, or 3D model construction. Finally, we will present how BoW with geometry dispenses with the need for costly manual image annotation to perform Convolutional Neural Network (CNN) training.Filip Radenovic, Czech Technical University
Session IV of the workshop.Martin Balek & Adam Hanka & Monika Seidlova & Ondrej Svec & Tomas Trnka, Avast
Low level task-based AI gets commoditized quickly and more general AI is decades off. In the meantime, will new AI startups succeed or will the value accrue to Google and Facebook?
While most of the machine learning talent works in consumer internet giants and related general tech companies, massive and timely problems are lurking in every major industry outside tech. If you believe the ‘software is eating the world’ hypothesis, then every company in every industry will need to become a tech company.
I will describe how Vertical AI startups solve full-stack industry problems that require subject matter expertise, unique data, and a product that uses AI to deliver its core value proposition.
I will share data about why Vertical AI startups are interesting, personal experience from my startups and DCVC, and big problems for the future in different verticals like finance, healthcare, transportation, energy, agriculture, and environment.Bradford Cross, DCVC (USA)
The Data Science group at The New York Times develops and deploys machine learning solutions to newsroom and business problems. Re-framing real-world questions as machine learning tasks requires not only adapting and extending models and algorithms to new or special cases but also sufficient breadth to know the right method for the right challenge. I'll first outline how unsupervised, supervised, and reinforcement learning methods are increasingly used in human applications for description, prediction, and prescription, respectively. I'll then focus on the 'prescriptive' cases, showing how methods from the reinforcement learning and causal inference literatures can be of direct impact in engineering, business, and decision-making more generally.Chris Wiggins, New York Times (USA)
The Microsoft Cognitive Toolkit is an open-source deep-learning toolkit that provides uncompromised scaling, speed and accuracy via an easy-to-use Python interface. In this session, you will learn that you do not have to sacrifice speed when using state of the art methods of machine learning in a high-level language.Willi Richert, Microsoft (DE)
We will explore the power of news sentiment to predict financial returns, in particular the returns of a set of European stocks. Building on past decision support work going back to the Delphi method we will describe a text analysis expert weighting algorithm that aggregates the responses of both humans and algorithms by dynamically selecting the best response according to previous performance.
The proposed system is tested through an experiment in which ensembles of experts, crowds and machines analyzed Thomson Reuters news stories and predicted the returns of the relevant stocks mentioned right after the stories appeared. The expert weighting algorithm was better than or as good as the best algorithm or human in most cases.Germán Creamer, Stevens Institute of Technology and Columbia University (USA)
Over the years, we have experimented with many methods such as SVMs, neural networks, decision tree forests, sentiment analysis or word vectors –all with the goal of predicting if the price of a popular tech stock (AAPL, GOOG, AMZN and others) will grow or sink in the next few minutes. Find out what we used and how it eventually turned out.Michal Illich, Wikidi (CZ)
Data has a value from which information and knowledge can be extracted. Within the talk, Wojciech Indyk, from Konica Minolta Laboratory Europe (KMLE), will describe tips and tricks on the methods for prototyping Machine Learning systems. Wojciech will make an overview of the lessons learnt in KMLE within the development of a predictive maintenance application for Multi-FunctionalPrinters.
The context for this application is characterized by time series data and peculiar features for machine learning such as unbalanced classes and data from MFPs counters that convey partial information on machine’s status.
After the identification of machine learning problems for predictive maintenance of MFPs, Wojciech will focus on the right tools, the scope of analytical processes and the availability of reusable components and functions, that could contribute to the development of a friendly context for streamlining specific machine learning pipelines.Wojciech Indyk, Konica Minolta Research Lab Europe (PL)
Feed ranking's goal is to provide people with over a billion personalized experiences. We strive to provide the most compelling content to each person, personalized to them so that they are most likely to see the content that is most interesting to them. Similar to a newspaper, putting the right stories above the fold has always been critical to engaging customers and interesting them in the rest of the paper. In feed ranking, we face a similar challenge, but on a grander scale. Each time a person visits, we need to find the best piece of content out of all the available stories and put it at the top of feed where people are most likely to see it. To accomplish this, we do large-scale machine learning to model each person, figure out which friends, pages and topics they care about and pick the stories each particular person is interested in. In addition to the large-scale machine learning problems we work on, another primary area of research is understanding the value we are creating for people and making sure that our objective function is in alignment with what people want.Lars Backstrom, Facebook (USA)
Seznam.cz is, among other things, a major player on the search engine market in the Czech Republic. A few years ago, we switched our image search service from a third-party provider to our own in-house solution. It started as a simple modification of our fulltext search engine. But, over time, it has gradually developed into a standalone system. This talk will be focused on the evolution of the search, the obstacles we faced and the solutions we implemented. We'll briefly discuss the models, machine learning techniques and features that are used in the image search pipeline. The focus will mostly be on our investigation into deep learning in order to further improve the relevancy of the system.Lukas Vrabel, Seznam.cz (CZ)
Free text presents a special challenge for data scientists: specifically, how to extract useful information without requiring Google-like powers of semantic analysis. In this talk, we use Latent Dirichlet Allocation to discover topics in the text of movie reviews, then use these topics to predict movie ratings. We also briefly discuss sentiment analysis using VADER, a simple rule-based model.David Gerster, BigML (USA)
TensorFlow is an open-source Python library for machine learning and neural networks. With this new tool, deep machine learning transitions from an area of research into the realm mainstream software engineering. In this session, learn how you can leverage TensorFlow to make maintainable, high performance machine learning systems.Yufeng Guo, Google (USA)
The NVIDIA DGX SATURNV taps into the compute power of 125 NVIDIA Pascal™-powered DGX-1™ server nodes to drive new levels of deep learning and AI analytics. 1,000 NVIDIA Tesla® P100 data center accelerators, coupled with NVIDIA’s deep learning software stack, provide high-performance deep learning training across multiple frameworks to deliver the best results in the shortest time. We will provide insights and details on this leap forward towards exascale computing.Ralph Hinsche, Nvidia (DE)
We will describe methods for designing feedforward and recurrent neural architectures and their effective applications to diverse problems in the life sciences. In particular, we will demonstrate deep learning applications for the prediction of the properties of small and large molecules (proteins) in bioinformatics, the analysis of circadian rhythms in systems biology, and the detection of cancer and other pathologies in biomedical imaging.Pierre Baldi, University of California (USA)
A key reason for giving a machine intelligence is to enable it to handle new unforeseen challenges. A system that can do this autonomously over extended periods of time meets an important requirement for what we call general intelligence. Predictions and promises notwithstanding, machines with general intelligence and high levels of autonomy have been slow in coming, and principles for how to design such systems have been missing. In this talk, I will present new methodological and engineering principles for designing a machine that becomes increasingly better at behaving in under specified circumstances with experience, in a goal-directed way, continuously, on the job, with minimal up-front information. Starting from a tiny amount of designer-specified code – a seed – the machine recursively self-improves, ultimately growing its knowledge to large multiples of the seed. I will show an example of how such a machine can learn a complex task such as a TV-style interview. Knowing nothing about sentence structure or grammar, interviews, multimodal gestures, or turn taking, it learns in an unsupervised manner how to effortlessly couple and coordinate multimodal gesture and speech, how to construct complex grammatically correct sentences, how to correctly interpret complex spoken utterances, resolve anaphora, interpret multimodal gestures, take turns smoothly to achieve the goal of cooperative dialogue, and properly and appropriately structure a whole interview in live interaction with a human.Kristinn R. Thórisson, Reykjavik University & Icelandic Institute for Intelligent Machines, (IS)
The talk will address interactive machine-learning systems in real-world environments, in which human-centered design of AI behaviours is crucial since robots need to react appropriately to human expectations and behavior. Examples of such adaptive systems will cover the fields of entertainment and education, where physical and virtual robots are explicitly developed for human-robot interaction to support human-like interaction: from robots used as tutors in schools or as support in elderly care to virtual robots, such as 3D augmented reality dinosaurs in a zoo or AI characters in PC games. I will introduce the vision of MATSUKO: to create a platform for mixed-reality intelligent companions, learning characters - that can sense and response, integrated into the natural world.Maria Vircikova, Matsuko (SK)
Many scientists, futurologists and philosophers have predicted that humanity will achieve a technological breakthrough and create Artificial General Intelligence (AGI). It has been suggested that AGI may be a positive or negative factor in the global catastrophic risk. In order to mitigate a dangerous AGI system it is important to understand how the system came to be in such a state. In this talk, I will survey, classify and analyze a number of pathways, which might lead to the arrival of dangerous AGI.Roman Yampolskiy, University of Louisville (USA)
After chess (DeepBlue – IBM) and go (AlphaGo – DeepMind), computers beat professional players in no-limit Poker, important milestone for AI. In contrast to previous games, Poker is a game of imperfect information – while this property is common in real world decision making, it makes the problem much more challenging for the computers. DeepStack is a revolutionary algorithm for such games that incorporates local search and deep neural nets.Martin Schmid, IBM (CZ)
Multi-instance learning applies to problems where samples are described by a set of vectors of arbitrary size. Such problems naturally occur in text analysis, image recognition, analysis of molecules, and network security. In this talk, I will first show how neural networks can be elegantly applied to MIL problems. Then we will move to modelling more complex tree structures describing structure of network traffic.Tomas Pevny, Cisco (CZ)
Our database contains hundreds of terabytes of executable files which can be used for training machine learning models. We introduce a flexible, in-house developed machine learning system for partitioning the space of fingerprints of executable files that automatically trains separate local models in each partial volume leveraging both CPU and GPU training of random forests.
This system allows for fast prototyping of new features both on local models (on order of seconds) and in successful cases, on the entire dataset (on order of hours), thereby giving our analysts a powerful tool to engineer new ways to stop malware.Martin Vejmelka, Avast (CZ)
First 50 tickets
Before April 17, 2017
From April 18, 2017
Lectures in the beautiful Lucerna Cinema with more than 100 years of history (one of the oldest cinema halls still in operation across Europe and the world), practical hands-on workshops, and showcases of the latest machine learning solutions at the stands of our partners.
Two amazing parties. One right after the workshops and the second in the La Loca club in the city center on Saturday. Great venues just for us!
Food provided by Freshland and Bakeland, a family company that bakes with traditional family recipes from the 1930s. And of course, we can’t forget to mention on the coffee from Doubleshot!
Most of the talks will be given by well-known invited speakers. However, we would also like to give students and startups an opportunity to present their work and research. It will be realized in the form of short lightning talks. Each speaker will get a free ticket to the conference.
If you are interested in becoming a lightning speaker, please send us a short paper (less than 1000 words) describing your machine learning project. Our committee will select up to 10 speakers for the lightning session.
Submission deadline: February 28, 2017
Notification: March 15, 2017
We understand that students usually have a limited budget for conference tickets. That is why we are offering several volunteer positions at Machine Learning Prague. Currently, we are looking for a camera operator. If you are interested in helping us out and getting a free ticket, please contact us.
If you have any questions about Machine Learning Prague, please
Copyright © 2017 Machine Learning Prague s.r.o.
Blanicka 589/5, Vinohrady, 120 00 Praha 2