Aside

profile

Contact

Technical Skills

Python
R
Bash
SQL
GCP
Tableau

Open Source Contributions

Main

Hugo Fitipaldi

I’m a Data Scientist with a Ph.D. in Genetic Epidemiology and a strong background in statistical programming, data analysis, and machine learning, particularly within clinical research settings. My expertise lies in biostatistics and I excel at analyzing complex datasets to uncover significant trends and relationships, which I then translate into clear, practical insights that drive informed decision-making. I am particularly skilled in creating clear visualizations, dashboards, and presentations to communicate complex data effectively to diverse audiences.

Experience

Postdoctoral Fellow

Diabetic Complications unit

Lund University

Current - 2023

  • I am leading the data analysis for a multicenter project involving European universities and hospitals, using machine learning algorithms on clinical, biobank, and image data to identify and validate prognostic and predictive biomarkers for rapid aggressive disease progression in diabetic kidney disease (DKD).
  • Responsible for the harmonization of biobank and clinical data across multiple European centers, ensuring data integrity and consistency in collaborative research environments.
  • Contributed to various systematic review projects, where I was responsible for implementing Natural Language Processing (NLP) techniques to extract and analyze key information from scientific papers, and for creating and maintaining data dashboards for exploring results.

PhD Candidate

GAME unit

Lund University

2023 - 2018

  • Led the implementation of the CDISC format for the “All New Diabetics In Scania” (ANDIS) cohort within the IMI-RHAPSODY project, integrating clinical and OMICs data. This standardized data was essential for performing cross-validation and replication studies aimed at identifying subtypes of type 2 diabetes.
  • Served as the primary analyst for the COVID Symptom Study Sweden, where I developed predictive models to estimate COVID-19 prevalence and hospital admissions using app-based data5. I also created and maintained an online dashboard for real-time data dissemination during the SARS-CoV-2 pandemic, enhancing public health response strategies.
  • Employed data mining and Natural Language Processing (NLP) algorithms to analyze the characteristics of genomic research on non-communicable diseases (NCDs) from scientific literature and the GWAS Catalog. My work focused on identifying trends and disparities, contributing valuable insights into the genetic underpinnings of these conditions.

Research intern

GAME unit

Lund University

2018 - 2017

  • Collaborated on projects integrating genomic and clinical data to explore gene-environment interactions, leveraging statistical models and machine learning algorithms.
  • Developed an R package utilizing NLP’s Named Entity Recognition (NER) to automate the systematic extraction and analysis of key data from scientific documents.
  • Conducted a comprehensive review on precision medicine in type 2 diabetes, emphasizing the integration of genomics into clinical practice and highlighting the required technological and data infrastructure.

Freelance Data Scientist

N/A

Remote

2018 - 2018

  • During this brief period, I primarily focused on projects related to text mining, machine learning, predictive modeling, and academic research.

Education

PhD, Genetic Epidemiology

Lund University

Malmö, SE

2023 - 2018

  • Thesis: Use of data mining and artificial intelligence to derive public health evidence from large datasets
  • Coursework included: Applied Statistics, Applied Statistics in Clinical Research, Introduction to Deep Learning, Artificial Intelligence in Medicine and Life Sciences.

Master of Medical Science in Public Health and Epidemiology

Lund University

Lund, SE

2018 - 2016

  • Thesis: A global overview of Precision Medicine in type 2 diabetes: a systematic review.
  • Coursework included: Epidemiology and Public Health Research Methodology, Planning and Leadership, Applied Public Health Research Methods.

Bachelors degree in Physiotherapy

Universidade Federal de Pernambuco

Recife, BR

2014 - 2008

  • Thesis: Effects of taping on the contractile activity and strengthening of skeletal muscles: a narrative review (free translation from Portuguese).

Exercise Sciences

Kent State University

Kent, US

2013 - 2012

  • Exchange Student - Science without Borders Program

Complementary Education

Introducing Generative AI with AWS

Udacity

MOOC

2024 - 2024

  • This course offered an in-depth exploration of generative AI, emphasizing its foundational concepts and real-world applications. It covered foundational concepts in machine learning and generative AI with hands-on practice.

Machine Learning: Leveraging Data Insights

MIT Professional Education

MOOC

2022 - 2022

  • With this online program in Machine Learning, participants will learn the four-step process of machine learning that leads from analyzing data to evaluating the effectiveness of decisions made based on that data. At the end of the program participants will have a better understanding of how machine learning tools and techniques contribute to more efficient decision making in many different environments.

Artificial Neural Network and Deep Learning

Lund University

Lund, SE

2020 - 2020

  • A brief introduction to artificial neural networks - workshop.

Introduction to Natural Language Processing and Text Mining

Lund University

Lund, SE

2020 - 2020

  • This workshop aims to give an introduction to quantitative methods for analyzing text. We will illustrate a few tools, resources and workflows, including word embeddings, text clustering and binary text classification.

Introduction to Deep Learning

Lund University

Lund, SE

2020 - 2020

  • The aim of this course is to introduce students to common deep learning architectures such as multi-layer perceptrons, convolutional neural networks, and recurrent models such as the LSTM.

Deep Learning

University of Copenhagen

Copenhagen, DN

2019 - 2019

  • A thorough introduction to the foundations of machine learning, especially neural networks including their training; Introduction to convolutional neural networks and recurrent neural networks; Training and applying convolutional and recurrent neural networks for image analysis; Making use of data augmentation and other preprocessing steps to further improve the generalization performance.

Honors and Awards

Oxford Machine Learning Summer School 2024

Stiftelsen Landshövding Per Westlings minnesfond (95096)

Oxford, UK

2024

  • Travel grant
  • Support for scientific research at Lund University and for grants for study trips for both older and younger scientists at the university.

Next Generation Tech Booster Scholarship

Udacity & Bertelsmann

Remote

2023

  • Course cost
  • As a global leader in media, education, and services, Bertelsmann wants to empower people around the world to be successful in the tech and data sectors, especially those individuals who historically may not have access to such skill-building opportunities. This program aims to set up eager learners for exciting, high-paying careers in tech.

ACCESS Forum travel allowance and participation

ACCESS (Academic Collaboration Chile Sweden)

Punta Arenas, Chile

2022

  • Travel grant
  • The ACCESS forum “Reconnecting for a Sustainable Future” offered a unique opportunity for academic communities from Sweden and Chile to come together, network, and exchange experiences. Parallel discussions on various research themes, virtual sessions, and presentations on funding opportunities were the main activities during the forum. The forum took place in Punta Arenas, Chile from November 7-11, 2022.

Best Challenge Award (Data Science)

Danish Diabetes Academy

Vejle, Denmark

2022

  • Competition
  • This prize was presented as part of the Data Science Spring School, hosted by the Danish Diabetes Academy. The event featured a series of lectures and activities centered around data science and artificial intelligence, culminating in a hackathon-style challenge.

Microsoft Power BI Scholarship Program

Microsoft & Dataquest

Remote

2022

  • Course cost
  • Dataquest and Microsoft have partnered to offer the first interactive Power BI courses. These courses teach Power BI through a project-based, in-browser approach. This prepares learners for real-world skill application, increasing comprehension and confidence. After completing the 12-week program, learners will be well prepared to pass the PL-300 Microsoft Power BI Data Analyst Exam and continue advancing their data careers.

PyCon US 2022 Financial Aid

Python Software Foundation

Remote

2022

  • Conference cost
  • Financial aid recipients receive support for some or all of their expenses including transportation, hotel, and childcare. Because PyCon is the largest Python conference in the world, it’s a meeting place for Python developers from around the world. Therefore, the financial aid award process is designed to enhance PyCon, including speakers, tutorial presenters, and notable open source contributors.

Becas Santander Tech | Emerging Technologies Program by MIT Professional Education

Massachusetts Institute of Technology (MIT)

Remote

2022

  • Course cost
  • The “Becas Santander Tech - Emerging Technologies Program by MIT Professional Education” facilitates and promotes the knowledge and use of the emerging innovative technological tools most in demand in companies and will be developed during the 2022 academic year. The main objective of the Program is to offer a relevant and practical learning experience, providing knowledge and skills to learn asynchronously and to be able to put into practice immediately.

Bertelsmann Technology Scholarship

Udacity & Bertelsmann

Remote

2021

  • Course cost
  • Technology Scholarship Program powered by Bertelsmann for the Intro to ML with Tensor Flow Challenge Course.

Nordic Probabilistic AI School 2021

Norwegian University of Science and Technology (NTNU)

Remote

2021

  • Course cost
  • The mission of the Nordic Probabilistic AI School (ProbAI) is to serve state-of-the-art expertise in machine learning and artificial intelligence to the public, students, academia, and industry. The selection of participants is based on multiple criteria such as experience, institution affiliation, geographical location, gender. The selection process is based on the submitted material and will be executed by a diversified committee of trusted experts.

Virtual ODSC East 2020 Scholarship

Open Data Science Conference

Remote

2020

  • Conference cost
  • Each year, candidates who have distinguished themselves through outstanding academic achievement and personal excellence are chosen to attend the Open Data Science Conference.

Udacity Technology Scholarship powered by Bertelsmann (Deep Learning Nanodegree)

Udacity & Bertelsmann

Remote

2020

  • Course cost
  • Out of thousands of students in the Technology Scholarship Challenge Course, the progress in the course and dedication in the community stood out and the student was awarded a full Deep Learning Nanodegree.

PyCon US 2020 Financial Aid

Python Software Foundation

Pittsburgh, US

2020

  • Travel grant
  • Financial aid recipients receive support for some or all of their expenses including transportation, hotel, and childcare. Because PyCon is the largest Python conference in the world, it’s a meeting place for Python developers from around the world. Therefore, the financial aid award process is designed to enhance PyCon, including speakers, tutorial presenters, and notable open source contributors.

Bertelsmann Scholarship

Udacity & Bertelsmann

Remote

2019

  • Competition
  • Bertelsmann’s media, services and educational offerings make it a leader in many areas of the digital world. Accordingly, the company wants to empower as many people as possible to be successful in the digital world. The initial 2019–2020 phase of the program is a two-stage scholarship, open to any student, 18 years of age or older, interested in Cloud Computing, Data Science or Artificial Intelligence. Recipients will spend 3.5 months learning key components for Cloud Computing, Data Analysis, or Artificial Intelligence. Top students from this initial phase will earn a full Nanodegree program scholarship.

ODSC Europe 2019 Scholarship

Open Data Science Conference

London, UK

2019

  • Conference cost
  • Each year, candidates who have distinguished themselves through outstanding academic achievement and personal excellence are chosen to attend the Open Data Science Conference.

Medicinska fakulteten resebidrag för forskarstuderande

Lund University

Lund, SE

2019

  • Travel grant
  • This grant is to cover travel costs to other institutions in Sweden or abroad which are related to the PhD education. All PhD students that are enrolled in the postgraduate education programme with the Faculty of Medicine in Lund/Malmö are eligible for this grant.

Summer Research Scholarship

Lund University

Lund, SE

2018

  • Research grant
  • The purpose of the summer scholarships is to make students interested in research and take the shape of two-month-long research projects.

Swedish Institute Study Scholarship

Svenska Institutet

Stockholm, SE

2018 - 2016

  • Study grant
  • The Swedish Institute (SI), a government agency, offers scholarships each year for international students and researchers coming to Sweden. The programme offers a unique opportunity for future leaders to develop professionally and academically, to experience Swedish society and culture and to build a long-lasting relationship with Sweden and each other.

Scholarship - Science without Borders (Ciência sem Fronteiras)

Conselho Nacional de Desenvolvimento Científico e Tecnológico - “National Counsel of Technological and Scientific Development”​

Kent, US

2013 - 2012

  • Research grant
  • Science without Borders is a large-scale nationwide scholarship program primarily funded by the Brazilian federal government. The program seeks to strengthen and expand the initiatives of science and technology, innovation and competitiveness through international mobility of undergraduate and graduate students and researchers.

Scholarship - Programa de Educação pelo Trabalho para a Saúde (PET-Saúde)

Ministry of Health, Brazil

Recife, BR

2012 - 2010

  • Research grant
  • As one of the intersectoral actions directed towards strengthening the primary care and health surveillance, in accordance with the principles and requirements of the Sistema Único de Saúde - SUS, the program presupposes education through work and provides scholarships for tutors, preceptors, and healthcare undergraduate students. The program is part of the strategies of the Programa Nacional de Reorientação da Formação Profissional em Saúde (PRÓ-SAÚDE), implemented in the country since 2005.

Selected Publications

Machine Learning Models for Prediction of Diabetic Microvascular Complications

Kanbour S, Harris C, Lalani B, Wolf RM, Fitipaldi H, Gomez MF, Mathioudakis N. J Diabetes Sci Technol. 2024 Mar;18(2):273-286. doi: 10.1177/19322968231223726. Epub 2024 Jan 8. PMID: 38189280; PMCID: PMC10973856.

N/A

2024

Precision prognostics for cardiovascular disease in Type 2 diabetes: a systematic review and meta-analysis

Ahmad A, Lim LL, Morieri ML, Tam CH, Cheng F, Chikowore T, Dudenhöffer-Pfeifer M, Fitipaldi H, Huang C, Kanbour S, Sarkar S, Koivula RW, Motala AA, Tye SC, Yu G, Zhang Y, Provenzano M, Sherifali D, de Souza RJ, Tobias DK; ADA/EASD PMDI; Gomez MF, Ma RCW, Mathioudakis N. Commun Med (Lond). 2024 Jan 22;4(1):11. doi: 10.1038/s43856-023-00429-z. PMID: 38253823; PMCID: PMC10803333.

N/A

2023

Identification of biomarkers for glycaemic deterioration in type 2 diabetes

Roderick C. Slieker, Louise A. Donnelly, Elina Akalestou, Livia Lopez-Noriega, Rana Melhem, Ayşim Güneş, Frederic Abou Azar, Alexander Efanov, Eleni Georgiadou, Hermine Muniangi-Muhitu, Mahsa Sheikh, Giuseppe N. Giordano, Mikael Åkerlund, Emma Ahlqvist, Ashfaq Ali, Karina Banasik, Søren Brunak, Marko Barovic, Gerard A. Bouland, Frédéric Burdet, Mickaël Canouil, Iulian Dragan, Petra J. M. Elders, Celine Fernandez, Andreas Festa, Hugo Fitipaldi, …, Ewan R. Pearson & Guy A. Rutter. Nat Commun. 2023 May 3;14(1):2533. doi: 10.1038/s41467-023-38148-7. PMID: 37137910; PMCID: PMC10156700.

N/A

2023

Second international consensus report on gaps and opportunities for the clinical translation of precision diabetes medicine

Deirdre K. Tobias, Jordi Merino, Abrar Ahmad, Catherine Aiken, Jamie L. Benham, Dhanasekaran Bodhini, … Hugo Fitipaldi, … Maria F. Gomez, Peter A. Gottlieb, Siri Atma W. Greeley, Kurt Griffin, Andrew T. Hattersley, Irl B. Hirsch, Marie-France Hivert, Korey K. Hood, Jami L. Josefson, Soo Heon Kwak, Lori M. Laffel, Siew S. Lim, Ruth J. F. Loos, Ronald C. W. Ma, Chantal Mathieu, Nestoras Mathioudakis, James B. Meigs, Shivani Misra, Viswanathan Mohan, Rinki Murphy, Richard Oram, Katharine R. Owen, Susan E. Ozanne, Ewan R. Pearson, Wei Perng, Toni I. Pollin, Rodica Pop-Busui, Richard E. Pratley, Leanne M. Redman, Maria J. Redondo, Rebecca M. Reynolds, Robert K. Semple, Jennifer L. Sherr, Emily K. Sims, Arianne Sweeting, Tiinamaija Tuomi, Miriam S. Udler, Kimberly K. Vesco, Tina Vilsbøll, Robert Wagner, Stephen S. Rich & Paul W. Franks. Nat Med. 2023 Oct;29(10):2438-2457. doi: 10.1038/s41591-023-02502-5. Epub 2023 Oct 5. PMID: 37794253; PMCID: PMC10735053.

N/A

2023

Sociodemographic characteristics and COVID-19 testing rates: spatiotemporal patterns and impact of test accessibility in Sweden

Beatrice Kennedy, Georgios Varotsis, Ulf Hammar, Diem Nguyen, Germán D Carrasquilla, Vera van Zoest, Robert S Kristiansson, Hugo Fitipaldi, Koen F Dekkers, Meena Daivadanam, Mats Martinell, Jonas Björk, Tove Fall. Eur J Public Health. 2023 Nov 27:ckad209. doi: 10.1093/eurpub/ckad209. Epub ahead of print. PMID: 38011903.

N/A

2023

A phenome-wide comparative analysis of genetic discordance between obesity and type 2 diabetes

Coral, D. E., Fernandez-Tajes, J., Tsereteli, N., Pomares-Millan, H., Fitipaldi, H., Mutie, P. M., Atabaki-Pasdar, N., Kalamajski, S., Poveda, A., Miller-Fleming, T. W., Zhong, X., Giordano, G. N., Pearson, E. R., Cox, N. J. & Franks, P. W., 2023 Jan 26, (E-pub ahead of print) In: Nature Metabolism. 16 p.

N/A

2023

Discovery of drug–omics associations in type 2 diabetes with generative deep-learning models

Allesøe RL, Lundgaard AT, Hernández Medina R, Aguayo-Orozco A, Johansen J, Nissen JN, Brorsson C, Mazzoni G, Niu L, Biel JH, Brasas V, Webel H, Benros ME, Pedersen AG, Chmura PJ, Jacobsen UP, Mari A, Koivula R, Mahajan A, Vinuela A, Tajes JF, Sharma S, Haid M, Hong MG, Musholt PB, De Masi F, Vogt J, Pedersen HK, Gudmundsdottir V, Jones A, Kennedy G, Bell J, Thomas EL, Frost G, Thomsen H, Hansen E, Hansen TH, Vestergaard H, Muilwijk M, Blom MT, ’t Hart LM, Pattou F, Raverdy V, Brage S, Kokkola T, Heggie A, McEvoy D, Mourby M, Kaye J, Hattersley A, McDonald T, Ridderstråle M, Walker M, Forgie I, Giordano GN, Pavo I, Ruetten H, Pedersen O, Hansen T, Dermitzakis E, Franks PW, Schwenk JM, Adamski J, McCarthy MI, Pearson E, …Fitipaldi H…., Banasik K, Rasmussen S, Brunak S;., In: Nature Biotechnology.

N/A

2023

Investigating the causal relationships between excess adiposity and cardiometabolic health in men and women

Mutie, P. M., Pomares-Milan, H., Atabaki-Pasdar, N., Coral, D., Fitipaldi, H., Tsereteli, N., Tajes, J. F., Franks, P. W. & Giordano, G. N., 2023, In: Diabetologia. 66, 2 , p. 321-335

N/A

2023

Ethnic, gender and other sociodemographic biases in genome-wide association studies for the most burdensome non-communicable diseases: 2005-2022

Fitipaldi, H. & Franks, P. W., 2005-2022. Hum Mol Genet. 2023 Jan 13;32(3):520-532. doi: 10.1093/hmg/ddac245. PMID: 36190496; PMCID: PMC9851743.

N/A

2023

App-based COVID-19 syndromic surveillance and prediction of hospital admissions in COVID Symptom Study Sweden

Kennedy, B., Fitipaldi, H., Hammar, U., Maziarz, M., Tsereteli, N., Oskolkov, N., Varotsis, G., Franks, C. A., Nguyen, D., Spiliopoulos, L., Adami, H-O., Björk, J., Engblom, S., Fall, K., Grimby-Ekman, A., Litton, J-E., Martinell, M., Oudin, A., Sjöström, T., Timpka, T., & 16 others, 2022 Apr 21, In: Nature Communications. 13, 1, 12 p., 2110.

N/A

2022

Distinct Molecular Signatures of Clinical Clusters in People with Type 2 Diabetes: an IMIRHAPSODY Study

Slieker, R. C., Donnelly, L. A., Fitipaldi, H., Bouland, G. A., Giordano, G. N., Åkerlund, M., Gerl, M. J., Ahlqvist, E., Ali, A., Dragan, I., Elders, P., Festa, A., Hansen, M. K., van der Heijden, A. A., Aly, D. M., Kim, M., Kuznetsov, D., Mehl, F., Klose, C., Simons, K., & 15 others, 2021, In: Diabetes. 70, 11, p. 2683-2693

N/A

2021

Replication and cross-validation of T2D subtypes based on clinical variables: an IMI-RHAPSODY study

Slieker, R. C., Donnelly, L. A., Fitipaldi, H., Bouland, G. A., Giordano, G. N., Åkerlund, M., Gerl, M. J., Ahlqvist, E., Ali, A., Dragan, I., Festa, A., Hansen, M. K., Mansour Aly, D., Kim, M., Kuznetsov, D., Mehl, F., Klose, C., Simons, K., Pavo, I., Pullen, T. J., & 13 others, 2021, In: Diabetologia. 64, 9, p. 1982-1989 8 p.

N/A

2021

Predicting and elucidating the etiology of fatty liver disease: A machine learning modeling and validation study in the IMI DIRECT cohorts

Atabaki-Pasdar N, Ohlsson M, Viñuela A, Frau F, Pomares-Millan H, Haid M, Jones AG, Thomas EL, Koivula RW, Kurbasic A, Mutie PM, Fitipaldi H, Fernandez J, Dawed AY, Giordano GN, Forgie IM, McDonald TJ, Rutters F, Cederberg H, Chabanova E, Dale M, Masi F, Thomas CE, Allin KH, Hansen TH, Heggie A, Hong MG, Elders PJM, Kennedy G, Kokkola T, Pedersen HK, Mahajan A, McEvoy D, Pattou F, Raverdy V, Häussler RS, Sharma S, Thomsen HS, Vangipurapu J, Vestergaard H, ’t Hart LM, Adamski J, Musholt PB, Brage S, Brunak S, Dermitzakis E, Frost G, Hansen T, Laakso M, Pedersen O, Ridderstråle M, Ruetten H, Hattersley AT, Walker M, Beulens JWJ, Mari A, Schwenk JM, Gupta R, McCarthy MI, Pearson ER, Bell JD, Pavo I, Franks PW., 2020, In: PLoS Medicine. 17, 6, p. e1003149

N/A

2020

Genetic studies of abdominal MRI data identify genes regulating hepcidin as major determinants of liver iron concentration

Wilman HR, Parisinos CA, Atabaki-Pasdar N, Kelly M, Thomas EL, Neubauer S… Fitipaldi H, … Mahajan A, Hingorani AD, Patel RS, Hemingway H, Franks PW, Bell JD, Banerjee R, Yaghootkar H., 2019, In: Journal of Hepatology. 71, 3, p. 594-602

N/A

2019

A global overview of Precision Medicine in type 2 diabetes

Fitipaldi, H., McCarthy, M. I., Florez, J. C. & Franks, P. W., 2018, In: Diabetes. 67, 10, p. 1911-1922 12 p.

N/A

2018