MACHINE LEARNING / DATA SCIENCE / DATA ENGINEERING
March 27, 2024 @ Svenska Mässan

ABOUT THE CONFERENCE

GAIA organizes a one-day conference for people interested in artificial intelligence and all things data. The aim is to create an environment for learning, networking, and knowledge sharing among individuals, organizations, and academia around these common interests. The conference focuses on applied machine learning and real-world data science. It introduces diverse content from enthusiastic domain experts and typically covers what is happening within the field in Gothenburg as speakers often have local connections.

Our last conference can be seen here, and GAIA Conference 2024 is already around the corner. See you there!

WHAT TO EXPECT

Inspiration and knowledge

Fascinating talks will be held by representatives from academia and many different industries. We expect to get inspired and learn about techniques, strategies, and tools commonly used by people in the field. We hope to leave the conference with a long list of new things to explore further!

Location

This year, we are once again hosting the conference at Svenska Mässan, conveniently located near the Korsvägen stop. As usual, this prime venue will allow us to bring plenty of attendees, partners, and startups together.

Food and drinks

Of course, food and drinks are included in the ticket price. We’ll provide you with breakfast, lunch, and fika with infinite amounts of coffee and tea so that you can stay sharp during the whole day. We recommend you plan for some extra time after the closing remarks as we finish the day with bubbles!

One great conference

We’re honored to have so many representatives from Gothenburg sharing their knowledge and thoughts. They’ll tell us more about what’s happening on the west coast, and you’ll have a chance to meet with other local enthusiasts with similar problems and interests.

SCHEDULE

Breakfast

08:00

Opening Remarks

by Jakob Andersson

Chairman | GAIA

09:00

Why Take a Unidimensional Approach to Tools That Exist in High Dimensional Space?

by Allison Cohen

Senior Applied AI Projects Manager | Mila – Quebec AI Institute

09:10

Advancing Private Federated Learning: Insights and Innovations from Research at Apple

by Filip Granqvist

Machine Learning Research Engineer | Apple

09:45

Break

10:15

From Hypothesis to Reality: Designing a Superhuman Racing AI Agent Using a Deep Reinforcement Learning

by Alisa Devlic

Senior Research Scientist | Sony AI

10:30

Panel Discussion: “Navigating AI’s Labyrinth: Ethical and Legal Threads”

Daniel Gillblad, Jair Ribeiro, Julia Lemonte, Charlotta Kronblad, Sara Wrige
Moderated by Danila Petrelli

11:00

Lunch Break

12:00

Multimodal Generative AI Demystified

by Ekaterina Sirazitdinova

Senior Data Scientist | Nvidia

13:30

Zenseact Open Dataset for Advancing Autonomous Driving

by Mina Alibeigi

AI research lead and scientist | Zenseact

14:00

Scenario Extraction in the Real World

by Fabian Peng Kärrholm

Senior Data Scientist | Volvo Cars

14:30

Break

15:00

Retrieval Augmented Generation for Threat Intelligence

by Aron Lagerberg

Principle AI Scientist | Recorded Future

15:30

Lingo & GAIA: Transforming Autonomous Driving with Large Language Models and Generative AI

by Remi Tachet des Combes

Senior Applied Scientist | Wayve

16:00

Information Leakage of Neural Networks

by Johan Östman

Research Scientist | AI Sweden

16:30

Closing Remarks

by Josef Lindman Hörnlund, Jakob Andersson

Board Members | GAIA

17:00

The Role of Data and AI in Farming

by Robin Johansson

COO | Optima Planta

09:45

Break

10:15

Characterizing High-Needs High-Cost Patients with Segmentation

by Juulia Suvilehto

Senior data scientist | Sahlgrenska University Hospital

10:30

Evaluating LLM-based Solutions: Overcoming Challenges and Developing Effective Metrics

by Olena Nahorna

Analytical Linguist | Grammarly

11:00

Unveiling Precision: A Novel Machine Learning Framework for Accurate Probability Estimates in Financial Industries

by Abel Sancarlos González, Edgar Bahilo Rodríguez

Data Scientist and Lead AI Engineer | B2-Impact

11:30

Lunch Break

12:00

Key Insights (and Frustrations) Alongside the Technical Journey from a Fluffy AI Vision to a Complete ML-Product, Viewed from a Gritty Data Scientist Perspective

by Andrea Krogdal

Data Scientist | Eghed

13:30

Building the Future: Exploring the Fact-based Realities of AI-Assisted Coding

by Adam Tornhill

CTO | CodeScene

14:00

Do Chemformers Dream of Organic Matter? Evaluating Transformer Models for Synthesis Prediction in the Pharmaceutical Domain

by Samuel Genheden

Associate Director | AstraZeneca

14:30

Break

15:00

Pandas 2, Dask or Polars? Quickly Tackling Larger Data on a Single Machine

by Ian Ozsvald

Chief Data Scientist | Mor Consulting

15:30

GenAI: Summary of Breakthroughs in Image Generation

by Olof Harrysson

Machine Learning Engineer | Modulai

16:00

Adapting Like Humans: Embodied AI Beyond Datasets and Domains

by Pier Luigi Dovesi

Senior AI Engineer | Silo AI

16:30

SPEAKERS

Allison Cohen

Senior Applied AI Projects Manager @ AI for Humanity, Mila – Quebec AI Institute

Why Take a Unidimensional Approach to Tools That Exist in High Dimensional Space?

There is a problematic and yet ubiquitous phenomena in the field of AI research and product development: we’re too late to ask ourselves whether the algorithms we’ve been developing are built for purpose. Why? Because we’ve been misinterpreting “fit for purpose” to mean technically feasible. This definition obscures considerations of equal importance including the cultural, social, political and legal landscape that permeate the tool’s design and determine the tool’s utility.

When building AI products, researchers are only confronted with questions of multidisciplinary reflection as they’re about to submit a paper or launch their tool. However, at this point in the project, a host of relevant decisions have already been made, whether consciously or not, that influence the algorithms’ utility. These decisions begin before the algorithm has ever been trained or the data has ever been collected.

In this talk, I will discuss important points of inflection that are too often missed in the product development lifecycle. These inflection points present opportunities for AI researchers and product developers to ensure that the technology they’re building is fit for purpose.

Biography

Allison Cohen is a Senior Applied AI Projects Manager within Mila’s AI for Humanity team. In this role, Allison works closely with AI researchers, social science experts and external partners to professionalize and deploy socially beneficial AI projects. In this role, she has successfully delivered an LLM search engine for the OECD and a multi-modal misogyny dataset, which obtained a Spotlight recognition at NeurIPS. Allison was on InspiredMinds! Top 50 Influential Women in AI list and was the Runner Up for the Women in AI “Leader of the Year” Award in the category of Equity, Diversity and Inclusion. She holds an MA in Global Affairs from the University of Toronto and a BA in International Development from McGill University.

Charlotta Kronblad

Dr in digital transformation @ Gothenburg University

Panel Discussion: Navigating AI’s Labyrinth: Ethical and Legal Threads

Biography

Charlotta Kronblad is a former lawyer – with ten years of experience in the legal field – who researches the digital transformation of this very setting. Charlotta holds a Ph.D. in digital transformation from the Chalmers University of Technology, has completed a postdoc at the House of Innovation at Stockholm School of Economics, and is currently doing research at Gothenburg University at the Department of Applied IT, connected to the Swedish Center for Digital innovation. Her research focuses on the implementation of algorithmic decision making in the public sector and its social and legal justice implications.

 

Daniel Gillblad

Chief of AI @ Recorded Future

Panel Discussion: Navigating AI’s Labyrinth: Ethical and Legal Threads

Biography

Daniel Gillblad is the Chief of AI at Recorded Future and contributes as Senior Scientific Advisor to AI Sweden. Previously, he has served as Director of the Chalmers AI Research Center, co-Director for Scientific Vision of AI Sweden, Head of AI for RISE Research Institutes of Sweden, and Director of the Decisions, Networks, and analytics laboratory at the Swedish Institute of Computer Science. He serves on several boards of directors, is by the Swedish government the appointed representative in the Global Partnership on AI, and is a fellow of the Royal Swedish Academy of Engineering Sciences. He has led a large number of research and innovation projects, worked with digital strategy development for several large Swedish enterprises, and is the co-founder of several AI oriented startups. His research interests are focused around Machine Learning, large scale data analytics, and their real-world applications and impact on society.

Jair Ribeiro

Analytics and Insights Leader @ Volvo Group

Panel Discussion: Navigating AI’s Labyrinth: Ethical and Legal Threads

Biography

Jair Ribeiro, the Analytics and Insights Leader at Volvo Group, brings over 15 years of experience in advancing artificial intelligence and data-driven innovation within the automotive industry. His expertise focuses on the ethical application of AI, enhancing operational efficiency and sustainability. Jair’s leadership is defined by a commitment to leveraging technology responsibly, fostering collaboration, and driving Volvo Group’s vision for sustainable transport solutions. With a global perspective and a pragmatic approach, he plays a pivotal role in shaping the future of AI in mobility, emphasizing its positive impact on society and the environment.

Julia Lemonte

AI Governance Manager @ AstraZeneca

Panel Discussion: Navigating AI’s Labyrinth: Ethical and Legal Threads

Biography

Julia is a dedicated advocate of ethical AI, drawing from a wide range of experience across AI risk management, AI regulation, data governance, and responsible AI implementation within the pharmaceutical industry. With a focus on AI governance, Julia has spearheaded enterprise-wide initiatives, ensuring that we continue to leverage AI to drive innovation responsibly and compliantly.

 

Sara Wrige

Parish priest in the Church of Sweden

Panel Discussion: Navigating AI’s Labyrinth: Ethical and Legal Threads

Biography

Sara Wrige is a priest in the Church of Sweden and works in Gamlestaden, in Gothenburg. Her background is in Condensed Matter Theory, with a PhD in Physics from Chalmers 2001. After studying theology she was ordained as a priest in 2005. She has worked within the church ever since. Sara has a long experience of interdisciplinary relations between theology, science and technology.

 

Mina Alibeigi

AI research lead and scientist @ Zenseact

Zenseact Open Dataset for Advancing Autonomous Driving

In this talk, we will explore the critical role of comprehensive and varied driving data in the development of reliable and effective autonomous driving technology. A key focus will be on the Zenseact Open Dataset (ZOD), a large, diverse, multimodal dataset recently released to propel autonomous driving advancements. We will delve into the unique features of ZOD, outlining how it was created and the measures taken for its public release. This release aims to spur innovation in the field, pushing forward robust and safe autonomous driving while enhancing overall road safety. Additionally, the talk will briefly highlight our recent and ongoing research projects, showcasing how ZOD is being utilized in various potential applications.

Biography

Mina Alibeigi is an experienced AI research lead and scientist focused on contributing to the advancement of autonomous driving (AD) technology. Holding a Ph.D. in Robotics and Machine Intelligence, her contributions to the AI community are reflected in several real-world AI system implementations, patent applications, and scientific publications. In her role as an AD researcher at Zenseact, Mina leads projects like Zenseact Open Dataset. Additionally, as a visiting researcher at Cambridge Machine Learning Systems Lab, Mina delves into large-scale decentralized learning. Beyond her technical role, she actively shares knowledge in various forums, engages in talks, and advocates for the next generation’s empowerment at events like “Women in AI” and “Girls in Tech.”

Aron Lagerberg

Principle AI Scientist @ Recorded Future

Retrieval Augmented Generation for Threat Intelligence

The previous years’ advances in generative AI and Large Language Models (LLMs) have completely revolutionized the way the world thinks about AI and what problems it can solve. LLMs have been positioned as an interface to data and knowledge in various shapes or form, making knowledge queryable by natural language. In practice however, this comes with many critical challenges: how can you leverage real-time data and not just what the LLM was trained on? How can you make the answers reliable? How can you make such a system fast enough? In this talk, we will dive into how we at Recorded Future have built a retrieval augmented generation (RAG) system using LLMs to be used as an interface to our threat intelligence graph. The end goal of this system is for a user to get reliable, fact-based answers to questions posed in natural language concerning threat intelligence. In particular we will discuss how we approached the above fundamental questions.

Biography

With a mission to transform state-of-the-art methods and research in ML into useful things, Aron has spent the last 10 years building ML- and AI-products. Aron has a PhD in mathematics, and has been working in various fields, including life-sciences and the financial industry. Currently, he is working in the intersection of NLP, Generative AI, Graphs, and AI Engineering towards the goal of automating the threat-intelligence life-cycle for cyber- and geopolitical-security.

Filip Granqvist

Machine Learning Research Engineer @ Apple

Advancing Private Federated Learning: Insights and Innovations from Research at Apple

Private Federated Learning (PFL) is an approach to collaboratively train a machine learning model between edge devices with coordination by a central server, whilst preserving the privacy of the data on each edge device. PFL is an emerging field, with exponential growth in the number of papers published over the past few years. Several big tech companies invest heavily in the practical applications of PFL. 

In this talk, we will introduce Federated Learning as a subject and describe techniques to preserve the privacy of participating edge devices. We will discuss the unique challenges encountered in PFL, and advocate for further research in areas of great impact for PFL. We will briefly outline the methods Apple has chosen to implement a privacy-preserving Federated Learning system and present key results from deployed real-world applications and our published research.

Finally, we will present how to get started doing research in PFL using simulations with open source tools.

Biography

Filip Granqvist has been a core contributor to Apple’s Private Federated Learning (PFL) project since its inception in 2018, focusing on applied ML research, software design, and architecture. He is leading the development of Apple’s PFL framework for modelling, both in a simulation environment and for real-world applications using customer devices while preserving privacy. Filip’s expertise in PFL extends to consulting on a range of products at Apple, which span various domains such as language models, vision, and time series analysis. Some of his work has also been published at NeurIPS, ICASSP, and Interspeech. Filip is passionate about federated and decentralized technologies, software architecture for ML, and believes that privacy-preserving technologies are essential for a well-functioning society.

Adam Tornhill

CTO @ CodeScene

Building the Future: Exploring the Fact-based Realities of AI-Assisted Coding

Large Language Models have enabled machines to write code. The resulting movement, AI-assisted coding, promises to improve developer productivity. However, AI-assisted coding is still in its infancy. This implies that we should embrace it with caution, guardrails, and set realistic expectations beyond any marketing hype.

In this talk, Adam presents both the short- and long-term implications of using AI assistants to write code. We do so based on extensive CodeScene research analyzing over 100k AI refactorings in real-world codebases. Based on this data, we debunk the productivity claims of today’s AI assistants; it’s easy to mistake code-writing speed for productivity.

We then step out on a new path, showing how the same line of research introduces a revolutionary technology for supporting auto-generated code improvements. Using real-world demos, you will see the power of AI-assisted coding without the risks as we automatically improve existing code. In conclusion, we explore how these novel tools not only address industry challenges such as technical debt but also underscore the growing significance of comprehending code over mere writing in the age of AI. Join in!

Biography

Adam Tornhill is a programmer who combines degrees in engineering and psychology. He’s the founder of CodeScene, where he designs tools for software analysis. He’s also the author of the best-selling “Your Code as a Crime Scene” and three more books. Adam’s other interests include modern history, music, and martial arts.

Olena Nahorna

Analytical Linguist @ Grammarly

Evaluating LLM-based Solutions: Overcoming Challenges and Developing Effective Metrics

The introduction of LLMs has made solving complex tasks more widely available for companies and significantly expedited the development process. Ideas that previously would have taken months to implement can now be developed much faster. However, as tasks evolve in nature and pace, new challenges emerge. Specifically, the more diverse and human-like output a model can produce, the more difficult it is to assess its quality. In this talk, we will discuss the changing nature of tasks, the challenges this shift poses, and how to build an effective evaluation framework for your LLM-based project.

Biography

Olena is a linguist with experience in both academia and industry. She began her journey in 2008 as an ESL teacher and held the role for eleven years. The next phase of Olena’s professional journey involved delving into research: she completed her PhD in Philology at Kyiv National Linguistic University, where she also taught English and lexicology for several years. In 2019, Olena joined Grammarly as an analytical linguist, and ever since, she have worked on various projects aimed at solving problems ranging from grammatical error correction and ethical use of AI to those requiring the application of LLMs.

 

Edgar Bahilo Rodríguez

Lead AI Engineer @ B2-Impact

Unveiling Precision: A Novel Machine Learning Framework for Accurate Probability Estimates in Financial Industries

Nowadays, probability prediction models play an essential role in many industries. To cite only a few examples, these applications can range from models in the engineering sector, meteorology (weather forecasting) to modeling frameworks in the financial sector such as the prediction of the number of claims in insurance, credit scoring or propensity to pay forecasts.

Probability calibration is typically assessed graphically via reliability diagrams that plot an estimated version of the conditional event probability (CEP) against the forecast value, with deviations from the diagonal suggesting lack of calibration. Classical approaches to estimating CEP rely on binning and counting and have been hampered by ad hoc implementation decisions, instability under unavoidable choices regarding binning, and inefficiency.

This session shows a practical implementation of alternative calibration metrics applied to the debt management industry, where having a good calibration of probabilities can be critical and can significantly lead to increased cash flows (e.g. in the hundreds of thousands or millions of euros). An advanced machine learning framework is designed to address the proper scoring rules for the Hyperparameter Optimization (HPO) and the use of the ECCE-MAD and ECCE-R scores to have the best model selection in terms of calibration.

Biography

Edgar Bahilo works as Lead AI Engineer at B2Holding ASA. He focusses on designing, architecting, and implementing machine learning systems at scale, especially for time series data. His career journey has seen him contributing to major firms like Siemens AG and Siemens Energy AG. His work has spanned various applications, including industrial applications, distributed generation services and now the finance sector. He has been a speaker in several top industry conferences as AWS reinvent 2020 and Data Innovation Summit 2021/2022. Besides, he has contributed to several journals due to his collaboration as Industrial PhD supervisor in the UPC (Polytechnic University of Catalunya).

Abel Sancarlos González

Data Scientist @ B2-Impact

Unveiling Precision: A Novel Machine Learning Framework for Accurate Probability Estimates in Financial Industries

Nowadays, probability prediction models play an essential role in many industries. To cite only a few examples, these applications can range from models in the engineering sector, meteorology (weather forecasting) to modeling frameworks in the financial sector such as the prediction of the number of claims in insurance, credit scoring or propensity to pay forecasts.

Probability calibration is typically assessed graphically via reliability diagrams that plot an estimated version of the conditional event probability (CEP) against the forecast value, with deviations from the diagonal suggesting lack of calibration. Classical approaches to estimating CEP rely on binning and counting and have been hampered by ad hoc implementation decisions, instability under unavoidable choices regarding binning, and inefficiency.

This session shows a practical implementation of alternative calibration metrics applied to the debt management industry, where having a good calibration of probabilities can be critical and can significantly lead to increased cash flows (e.g. in the hundreds of thousands or millions of euros). An advanced machine learning framework is designed to address the proper scoring rules for the Hyperparameter Optimization (HPO) and the use of the ECCE-MAD and ECCE-R scores to have the best model selection in terms of calibration.

Biography

Dr. Abel Sancarlos is a seasoned data scientist with extensive experience in AI and ML research. He has contributed significantly to applied AI in the engineering sector with publications in Q1 scientific journals and magazines. His innovations have supported projects for major firms like Dassault Aviation. He’s spoken at international AI conferences and taught ML at top universities. Currently, he’s driving innovation in the financial sector, focusing on AI solutions for debt management. Dr. Sancarlos holds a PhD in Machine Learning and Engineering, with honors, and has received awards for his academic achievements

Ian Ozsvald

Chief Data Scientist @ Mor Consulting

Pandas 2, Dask or Polars? Quickly Tackling Larger Data on a Single Machine

Pandas 2 brings new Arrow data types, faster calculations and better scalability and even GPU acceleration in Pandas with CuDF is possible. Dask scales Pandas across cores and recently released a new “expressions” optimization. Polars is a new competitor to Pandas designed around Arrow with native multicore support. Which should you choose for modern research workflows? We’ll solve a “just about fits in ram” data task using the 3 solutions, talking about the pros and cons so you can make the best choice for your research workflow. You’ll leave with a clear idea of whether Pandas 2, Dask or Polars is the tool to invest in and how Polars fits into the existing numpy-focused ecosystem.

Do you still need 5x working RAM for Pandas operations (probably not!)? Can Pandas string operations actually be fast (sure)? Since Polars uses Arrow data structures, can we easily use tools like Scikit-learn and matplotlib (yes-maybe)? What limits do we still face? Could you switch to experimenting with Polars?

Biography

Ian is a Chief Data Scientist, has co-founded and built the annual PyDataLondon conference raising $100k+ annually for the open source movement along with the associated 13,000+ member monthly meetup. Using data science he’s helped clients find $2M in recoverable fraud, created the core IP which opened funding rounds for automated recruitment start-ups and diagnosed how major media companies can better supply recommendations to viewers. He gives conference talks internationally often as keynote speaker and is the author of the bestselling O’Reilly book High Performance Python (2nd edition). He has over 25 years of experience as a senior data science leader, trainer and team coach. For fun, he’s walked by his high-energy Springer Spaniel, surfs the Cornish coast and drinks fine coffee.

Remi Tachet des Combes

Senior Applied Scientist @ Wayve

Lingo & GAIA: Transforming Autonomous Driving with Large Language Models and Generative AI

In this talk, I will present LINGO and GAIA, two recent models developed at Wayve. LINGO is a pioneering open-loop driving commentator that utilises natural language processing to interpret and articulate driving scenes. Its unique “show and tell” feature employs referential segmentation to visually highlight areas of interest within a scene, enhancing the model’s interaction with its environment. This capability significantly advances autonomous vehicles (AV) by improving accuracy in describing surroundings, addressing model hallucinations, and bolstering safety communication. By integrating vision, language, and action, LINGO represents a critical step towards Vision-Language-Action Models (VLAMs), aiming for human-like communication and trustworthiness in AV technology.

GAIA is an advanced generative world model designed to simulate realistic driving scenarios. It leverages video, text, and action inputs to build representations of the environment and its future dynamics, enhancing AV decision-making and safety. Comprising an image encoder and a 6.5 billion parameter autoregressive transformer trained on extensive driving data, GAIA showcases scalability and superior video generation quality. Its ability to adjust scene features like weather and time, alongside predicting diverse futures and interactions with other agents, positions GAIA as a valuable tool for AV development.

Biography

Remi Tachet des Combes is a senior applied scientist at Wayve, a UK-based startup specialising in the development of artificial intelligence systems for self-driving vehicles. There, he focuses on world modelling and representation learning for autonomy, with the ultimate goal of solving Embodied AI. Prior to Wayve, Remi was a principal researcher at Microsoft Research where he made several contributions to the fields of reinforcement learning and deep learning. Remi holds a PhD in applied mathematics, and has worked at the MIT Senseable City lab, studying the impact and benefits of technology on urban planning.

Alisa Devlic

Senior Research Scientist @ Sony AI

From Hypothesis to Reality: Designing a Superhuman Racing AI Agent Using a Deep Reinforcement Learning

This work started out as a grand challenge to create an AI agent that could beat the world’s best Gran Turismo (GT) drivers. In order to develop an agent capable of competing against the world’s best drivers, GT Sophy was trained to master the following driving skills: race car control, racing tactics, and racing etiquette. This talk will tell you a story about the technical evolution of GT Sophy from a research outcome to an in-game feature that could be introduced as part of the Gran Turismo 7 (GT7) PlayStation racing game. Sony AI, in collaboration with Polyphony Digital and Sony Interactive Entertainment, designed novel deep reinforcement learning approaches with unique training and evaluation methods on a modern cloud computing platform to accommodate this project. In less than two years since GT Sophy appeared on the cover of Nature, the breakthrough AI agent has now become a permanent in-game feature of GT7.

Biography

Alisa Devlic is a Senior Research Scientist at Sony AI, working on applying and advancing the state-of-the-art in deep reinforcement learning in the gaming domain. She obtained her Ph.D. in Communication Systems from KTH, Stockholm, Sweden and did her postdoctoral work at IMT Atlantique, Rennes, France. Before joining Sony AI, she worked both in industry labs (Ericsson Research and Huawei Technologies) and academia (KTH, IMT Atlantique, and University of Zagreb). Alisa (co)authored more than 30 papers and obtained best paper awards at IEEE WoWMoM 2015, ICC 2017 and MMSP 2018. She was part of the Sony AI team who developed a GT Sophy, the first racing agent that can outperform the world’s best e-sports drivers and published results in the Nature journal. She started working in the field of Reinforcement Learning in 2020, when she joined Sony AI.

 

Ekaterina Sirazitdinova

Senior Data Scientist @ Nvidia

Multimodal Generative AI Demystified

Multimodal generative AI has recently seen significant advancements, enabling the creation of realistic images, videos, and audio from textual or other inputs. However, due to the complexity of these models, understanding how they function and how to apply them in practical settings can be challenging. During this talk, Ekaterina will shed light on the inner workings of multimodal generative AI models by discussing key concepts and techniques used in their development. She will also explore various applications and use cases of this technology. The talk is intended for anyone interested in the current state of AI and its potential to produce realistic and immersive multimedia experiences.

Biography

Ekaterina specializes in leveraging AI techniques, such as multimodal generative AI and large language models, to tackle computer vision and language processing challenges. She is skilled in end-to-end AI productization, encompassing the entire process from development to optimized deployment, whether it be in the cloud or at the edge. Previously, Ekaterina was a research engineer applying deep learning to medical image analysis. She has also authored several peer-reviewed journals and conference publications on various applications of image-based 3D reconstruction, localization, and tracking. Ekaterina received her Ph.D. in Computer Science and M.Sc in Media Informatics; she also holds a Diploma in Business Informatics.

Samuel Genheden

Associate Director @ AstraZeneca

Do Chemformers Dream of Organic Matter? Evaluating Transformer Models for Synthesis Prediction in the Pharmaceutical Domain

Language models like transformers have found a natural place in drug discovery, solving tasks such as property prediction, molecular optimization, and reactivity predictions. Transformer models trained on public data for synthesis prediction tasks, such as product and retrosynthesis prediction, have proven effective and sometimes outperform other approaches, including template-based retrosynthesis. In this contribution, we will outline our efforts of training and introducing transformer models for synthesis predictions into our production platform for synthesis planning that is used daily by chemists. We will discuss particular challenges faced when training on a large corpus of reaction data, comparison with existing models currently used by chemists for product prediction and retrosynthesis. We will show that transformer models trained on a diverse set of reactions can surpass existing models with impressive performance. Finally, we will outline outstanding issues preventing the full adoption of transformer models for synthesis prediction.

Biography

Samuel Genheden leads the Deep Chemistry team in Discovery Sciences, AstraZeneca R&D. He received his PhD in theoretical chemistry from Lund University in 2012, having studied computational methods to estimate ligand-binding affinities. He continued with postdocs at the Universities of Southampton and Gothenburg, where he simulated membrane phenomena using multiscale approaches. He joined the Molecular AI department at AstraZeneca in 2020 and became team leader in 2022. The team’s research focuses on the AiZynth platform for AI-assisted retrosynthesis planning. Samuel’s interests lie in studying chemical and biological systems with computers and using these approaches to impact drug development. He is a keen advocate for open-source software.

 

Johan Östman

Research Scientist @ AI Sweden

Information Leakage of Neural Networks

As machine learning is becoming a cornerstone of society, it increasingly encounters the challenge of handling sensitive data. This issue is magnified when trained machine learning models are shared with external entities, e.g., open-source or via API, which raises the critical question: Can sensitive information be extracted from the shared models?
In this talk, we will navigate the fascinating domain of information extraction attacks targeting trained machine learning models. We will dissect various attack vectors across different adversarial settings and their potential to compromise data. Additionally, we will discuss strategies to mitigate these attacks and showcase the effectiveness of such techniques.
Finally, we will touch upon the legal aspects and the importance in bridging between legal and technical definitions of risk.

Biography

Johan leads the privacy-preserving machine learning initiatives at AI Sweden, Sweden’s national center for applied AI. His team is dedicated to mitigating information leakage from machine learning models and advancing decentralized machine learning methodologies. He is also co-leading a research group at Chalmers University of Technology, focusing on the privacy-security-utility tension within federated learning. Additionally, he is the project initiator of a federated learning project together with Handelsbanken and Swedbank to combat money laundering. He also leads a larger initiative investigating the nuances of information leakage. Johan holds a Ph.D. in Information Theory and dual master’s degrees in Electrical Engineering and Industrial Economics.

Juulia Suvilehto

Senior data scientist @ Sahlgrenska University Hospital

Characterizing High-Needs High-Cost Patients with Segmentation

Healthcare utilization is extremely unevenly distributed in the population. Earlier research suggests that up to 50% of resources of a given healthcare system are used by only 5% of patients. This group is often called high-needs high-cost patients.  While the existence of this high-needs high-cost patient group is well established, the group has not been properly characterized. We know that these patients are likely to have multiple chronic health conditions and that they need a lot of care to manage their health, which cannot be properly modeled using industry standard methods of inspecting patient groups one diagnosis and one visit at a time. In this talk,  Juulia presents work done at Västra Götalandsregionen to detect and characterize this patient group using unsupervised learning. She will share some early findings and lessons learned from introducing advanced analytics to an organization more familiar with reporting and classical statistics.

Biography

Dr. Juulia Suvilehto is a senior data scientist at Sahlgrenska University Hospital, with over a decade of experience in data analytics for life sciences. Her work spans biotech startups, academia and now healthcare. In her current role she’s bridging the gap between clinical and technical expertise to advance AI in clinical settings. She is the founder of a network for Swedish healthcare data scientists, facilitating knowledge exchange and collaboration. Dr. Suvilehto is committed to making AI accessible to underrepresented groups and demystifying technology for non-technical individuals. She holds a PhD in systems neuroscience from Aalto University.

Fabian Peng Kärrholm

Senior Data Scientist @ Volvo Cars

Scenario Extraction in the Real World

To develop a self-driving vehicle, there’s a large need for real world scenarios to be used for validation and verification of the autonomous drive function. Real world scenarios need to be collected by vehicles driving in the intended operational domain of the function. This work focuses on evaluating the collected data in the form of time-series from those vehicles, and classifying it into different states, events and scenarios. The found items in the data can then be used to conduct simulations of the AD function, risk analysis of intended launch areas and traffic behavior assessment. The presentation will cover different methods to analyze and structure the data, and ways to draw conclusions about traffic behaviour based on it.

Biography

Fabian Peng Kärrholm has worked across a variety of fields and started his career in computational fluid dynamics and has a PhD in Combustion & Thermodynamics. He also worked with combustion engine control system for several years. For the past five years he has worked with data analysis at the department for Autonomous Drive at Volvo Cars. Currently, he’s working with scenario extraction, identification, and classification for traffic analysis.

 

Olof Harrysson

Machine Learning Engineer @ Modulai

GenAI: Summary of Breakthroughs in Image Generation

This presentation offers a comprehensive overview of the latest breakthroughs in generative AI, specifically within the realm of image generation. We highlight the capabilities of influential models such as Stable Diffusion and MidJourney, accompanied by a showcase of stunning and creative images they have produced.
We address the significant contributions of LORA in the development of custom models. These models are specifically designed to generate images with particular styles or concepts, marking a significant leap in personalized content creation.
Further, we explore innovative features like ControlNet, which gives users the ability to shape the structure of generated images, and IPAdapter, which introduces the concept of image-based prompts for generation.
Our focus then shifts to the dynamic open-source community, a treasure trove of diverse tools, techniques, and models that continuously fuel progress in this field. Among these innovations, we highlight ComfyUI, a tool that has been aptly likened to the “Photoshop” of image generation, due to its versatility and powerful features.
Our conversation extends beyond the technical intricacies to include the broader business implications, illustrating how these advancements are revolutionizing industries.
To conclude our presentation, we will peer into the future of image generation, anticipating further innovations and exploring the evolving landscape of this fascinating field.

Biography

Olof Harrysson, a Machine Learning Engineer at Modulai, specializes in AI/ML solutions. He serves on the board for ITHS’s “Utvecklare inom AI och Maskininlärning” program. With a Master’s in Computer Science from Lund’s Technical University, his passion for image generation was sparked by an intriguing YouTube video. Raised in Gothenburg, contact: olof@modulai.io.

Robin Johansson

COO @ Optima Planta

The Role of Data and AI in Farming

The farming industry is facing many challenges, both in terms of economics and in terms of sustainability. Financially, it is a low margin industry with rising costs. Environmentally, we see a growing population which is problematic when combined with limited resources, usage of 70% of the freshwater, and a third of the arable land having been destroyed in the last 40 years.

So what role does AI play in the future of farming?

Robin will talk about the problems in the farming industry, the potential applications and impact of data and AI, what is already done today, and what and what is needed to see the results of AI within farming.

Biography

Robin has an engineering background from Chalmers and a Master’s degree in entrepreneurship and business design. Since 2016 Robin has founded and worked with tech-startups. Encrypting memory cards for investigative journalists, IT-consulting, and since 2019 Robin has been the COO of Optima Planta. Optima Planta is a tech company developing the farming technology of the future, utilizing aeroponics and AI.

 

 

Pier Luigi Dovesi

Senior AI Engineer @ Silo AI

Adapting Like Humans: Embodied AI Beyond Datasets and Domains

The traditional approaches of training artificial intelligence on fixed datasets within defined domains have significantly advanced the field. However, the dynamic and evolving nature of real-world scenarios requires AI systems to break free from these boundaries, mirroring human adaptability.

This talk unveils the journey towards embodied AI systems, capable of continuously learning across diverse contexts, much like humans. Inspired by recent works on real-time adaptation, we outline how these methodologies serve as stepping stones towards embodied AI.

Embodiment in AI is important because it captures the agent’s ability to perceive, interact, and symbiotically evolve with the environment. We will show how transitioning from static datasets and domain-dependent AI to a more dynamic, embodied AI, can enhance the robustness and applicability of AI systems in the real world. This requires AI to have the ability to adapt in an online learning fashion, responding to the ever-changing conditions it encounters.

Our research is now going beyond academia becoming foundational tools for AI innovators. One example is Rain AI, startup backed by Sam Altman and closely tied to OpenAI, working on neuromorphic computing. Rain is using our work on real-time adaptation as a key component of their technology stack.

Biography

Pier Luigi Dovesi is a Senior AI Engineer at Silo AI, focusing on generative AI. His extensive work in perception and safety for autonomous driving and robotics originated the idea that AIs – just like humans – should continuously adapt.

Currently, Pier is directing projects including hyperrealistic portrait generation, adaptive interactive agents, and 3D scene reconstruction. Pier regularly contributes to AI research with publications in 3DV, ICRA, ECCV, and ICCV collaborating with top-tier institutions including KTH Royal Institute of Technology, the University of Bologna, ETH Zurich, and TU Munich.

Pier is an active member of the AI community. He engages in talks, organizes educational events and serves as a mentor for Women in AI.

Andrea Krogdal

Data Scientist @ Eghed

Key Insights (and Frustrations) Alongside the Technical Journey from a Fluffy AI Vision to a Complete ML-Product, Viewed from a Gritty Data Scientist Perspective

How do you go from a fluffy AI idea to a concrete ML product? The ambition is to clarify this fluffiness based on experiences from a completed project at Stena Stål with an end-to-end implementation predicting the volatile prices of steel. Focusing on key learnings from a data scientist’s perspective, we present an effective constellation of well-established technical solutions alongside emerging ML tools, such as MLflow, to produce a stable, repeatable and quality-assured machine learning workflow. 

 After covering the technical part, we will also try to give input on how to solve the really tricky part – how to intrigue stakeholders by mastering the art of explaining the concept of ML and how to stop wasting people’s skills and instead let them do the job they actually thrive in. Because then the ML-stars are aligned!

Biography

Andrea has been working in the data science field for over 4 years and has worked with everything from clustering driving behaviors, reliability tools, anomaly detection, prediction of future price trends to explaining the business value of ML-solutions to stakeholders. She has a master’s in mathematical statistics and has a passion for algorithms. Today, Andrea is a data scientist consultant helping clients build and understand intelligent solutions to enhance their products.

TICKETS

Tickets are now available for purchase. We are happy to introduce the early-bird tickets this year. Secure your spot before December 25, 2023, to get a discount!

As always, we offer half price for students, and now it is even half the early-bird price! All other participants are referred to the General Admission tickets. If you have any questions regarding your purchase, please send us an email using the contact information at the bottom of the page.

General Admission | Early-bird
By December 25, 2023
800 SEK
  • Breakfast
  • Lunch
  • Coffee, Snacks, Fika
General Admission | Regular
From December 26, 2023
1000 SEK
  • Breakfast
  • Lunch
  • Coffee, Snacks, Fika
Student Admission
Fixed price
400 SEK
  • Breakfast
  • Lunch
  • Coffee, Snacks, Fika

WORKSHOPS

This year, the GAIA Conference will be expanding into a new format. Alongside our main event on March 27th, we’re excited to bring you a special workshop day on March 26th, thanks to our collaboration with local experts.

We’re thrilled to introduce a workshop designed for those eager to explore how to harness the potential of artificial intelligence in their operations. Whether you’re taking your first steps into the world of AI or looking to deepen your existing knowledge, this session will empower you with practical insights and inspiration.

This workshop will show you that the power of AI isn’t confined to tech giants or early adopters. Whether you’re a local startup or a traditional business venturing into digital, AI can be your ally in innovation and growth regardless of your industry or company size.

Who is this for?

Directed specifically at business leaders and decision-makers, this workshop aims to equip you with the knowledge to make informed decisions about integrating AI into your business strategies. Take the first steps to identify opportunities, leverage data, and navigate the evolving landscape of AI technology.

What are the prerequisites?

Curiosity and a desire to learn are all you need to bring. Our session is designed to be accessible, requiring no prior technical knowledge. We’ll guide you through the essentials and explore the beyond together, ensuring you leave with inspiration and knowledge in AI applications for business.

What You Will Gain

  • Better Understanding
    Delve into the essentials of AI, demystifying the technology and its potential for your business. We’ll cover the latest advancements and practical applications, helping you to see beyond the hype.
  • Strategic Insights
    Discover how AI can solve real-world problems, enhance efficiency, and drive innovation in your business. We’ll explore ‘low-hanging fruit’—accessible AI opportunities you can quickly capitalize on—and longer-term strategic investments.
  • Risk Awareness
    Learn about the potential pitfalls and ethical considerations of AI implementation and how to navigate these responsibly in your business strategy.
  • Inspiration and Perspective
    Leave the workshop with knowledge and a fresh perspective on how AI can help shape the future of your business. Be inspired by success stories and envision how you can lead your company towards a smarter, AI-integrated future.

Limited spots available – Join us to explore the potential of AI.

Dive into the complexities of managing technical debt in large-scale software systems with our insightful, hands-on workshop on the 26th of March, presented in collaboration with CodeScene. This session is designed for technical leaders, software architects, engineering managers, and senior developers who face the challenging balance between innovation, feature delivery and maintenance in their daily work.

Understanding the Challenge

Prioritizing technical debt is a hard problem as modern systems might have millions of lines of code and multiple development teams — no one has a holistic overview. In addition, there’s always a trade-off between improving existing code versus adding new features so we need to use our time wisely.

What if we could mine the collective intelligence of all contributing programmers and start making decisions based on information from how the organization actually works with the code?

What This Workshop Offers

In this workshop, you’ll learn how easily obtained version-control data lets you uncover the behavior and patterns of the development organization. This language-neutral approach lets you prioritize the parts of your system that benefit the most from improvements so that you can balance short- and long-term goals guided by data. In this session, you’ll learn:

  • To prioritize technical debt in large-scale systems
  • Balance the trade-off between improving existing code versus adding new features
  • Visualize long time trends in technical debt
  • Take a data-driven approach to technical debt.

During the workshop, you get access to CodeScene – a behavioral code analysis tool that automates the analyses – which we use for practical exercises. We’ll do the exercises on real-world codebases in Java, C#, JavaScript and more to discover real issues.

Participants are also encouraged to take this opportunity to analyze their own codebase to get actionable take-away information.

Prerequisites
Bring your curiosity and your laptop. No prior preparation is required, though attendees will receive the workshop slides the day before for an optimal learning experience.

Spaces are limited – secure your spot to redefine how you manage technical debt with clarity and efficiency.

Are you eager to explore the transformative potential of Large Language Models (LLMs) in your projects? Join us for a hands-on workshop tailored specifically for those new to the world of LLMs. We will go through the fundamentals of LLMs and the tools and know-how to get the full potential out of pre-trained models. The workshop aims to cast a wide net rather than go in-depth on a small sub-problem. We offer a beginner-friendly overview of key concepts and practical applications.

Who is it for?

This workshop is perfect for people curious about integrating LLMs into their workflows but with limited or no prior experience with these models. It is designed to provide a gentle yet insightful introduction to the world of LLMs.

What This Workshop Offers

In this workshop, participants will:

  1. Gain an Introduction to LLMs: Explore the foundational concepts of LLMs, including embeddings, model variants, and considerations for selecting the suitable pre-trained model for your task.
  2. Learn Prompt Engineering Techniques: Discover how to craft effective prompts to guide LLMs in generating desired outputs, with practical examples showcasing successful and ineffective prompts.
  3. Explore the RAG Framework: Dive into the innovative Retrieval-Augmented Generation (RAG) framework, uncovering its potential to enhance text completion and semantic search tasks with minimal complexity.
  4. Introduction to Fine-Tuning: Get acquainted with basic LLM fine-tuning for custom tasks, including an overview of popular techniques.

Prerequisites

No prior experience with LLMs is required! However, participants should have:

– A basic understanding of machine learning concepts and algorithms.

– Proficiency in the Python programming language and familiarity with Jupyter notebooks.

– Curiosity and eagerness to learn about NLP concepts and applications.

– Bring a laptop for code-along steps

Spaces are limited!

Workshop#1: “AI For Decision Makers”
March 26, 2024
200 SEK
  • When: 8:00 – 10:00
  • Where: Vallgatan 3, Gothenburg
  • Hosted by: Smartr
Workshop #2: “Prioritize Technical Debt as if Time and Money matter”
March 26, 2024
200 SEK
  • When: 13:30 – 15:00
  • Where: Convendum, Kungsportsavenyn 21, Gothenburg
  • Hosted by: CodeScene
Workshop #3: “Getting full potential out of LLM projects”
March 26, 2024
200 SEK
  • When: 09:00 – 11:30
  • Where: Kits, Kungsportsavenyn 33, Gothenburg
  • Hosted by: Eghed (with Kits)

PARTNERSHIP

Join us as a partner for the 2024 GAIA Conference, connecting with the vibrant AI community in Gothenburg and beyond. Our flexible partnership tiers let you choose what suits your needs best. As a partner, you’ll connect with engaged AI professionals, showcase your offerings, and establish your leadership in the field.

In 2024, we’re moving to the main Congress Hall at Svenska Mässan to accommodate our growing audience. With an expected 800 attendees on March 27, 2024, we’re also offering innovative ways for partners to stand out beyond traditional booths.

The GAIA Conference relies on partners like you to make it a success. Don’t miss this chance to support and enhance our event. Partner with us today to connect with the incredible AI community!

Read more about the partnership tiers here: GAIA Partnership Information 2024.

CONTACT

Program and speakers

Are you wondering about the schedule or any of our sessions? Send an email to program@gaia.fish.

Questions or comments

Do you have any other questions or comments to us, send an email to conf@gaia.fish and we promise to help you!

Partnerships

Want to support the conference by becoming a partner? Reach out to us on partnership@gaia.fish for more information.

© 2024 GAIA

conf@gaia.fish