Main
Hugo Fitipaldi
I’m a Data Scientist with a Ph.D. in Genetic Epidemiology and a strong background in statistical programming, data analysis, and machine learning, particularly within clinical research settings. My expertise lies in biostatistics and I excel at analyzing complex datasets to uncover significant trends and relationships, which I then translate into clear, practical insights that drive informed decision-making. I am particularly skilled in creating clear visualizations, dashboards, and presentations to communicate complex data effectively to diverse audiences.
Experience
Current2023
Postdoctoral Fellow
Diabetic Complications unit Lund University- I am leading the data analysis for a multicenter project involving European universities and hospitals, using machine learning algorithms on clinical, biobank, and image data to identify and validate prognostic and predictive biomarkers for rapid aggressive disease progression in diabetic kidney disease (DKD).
- Responsible for the harmonization of biobank and clinical data across multiple European centers, ensuring data integrity and consistency in collaborative research environments.
- Contributed to various systematic review projects, where I was responsible for implementing Natural Language Processing (NLP) techniques to extract and analyze key information from scientific papers, and for creating and maintaining data dashboards for exploring results.
20232018
PhD Candidate
GAME unit Lund University- Led the implementation of the CDISC format for the “All New Diabetics In Scania” (ANDIS) cohort within the IMI-RHAPSODY project, integrating clinical and OMICs data. This standardized data was essential for performing cross-validation and replication studies aimed at identifying subtypes of type 2 diabetes.
- Served as the primary analyst for the COVID Symptom Study Sweden, where I developed predictive models to estimate COVID-19 prevalence and hospital admissions using app-based data5. I also created and maintained an online dashboard for real-time data dissemination during the SARS-CoV-2 pandemic, enhancing public health response strategies.
- Employed data mining and Natural Language Processing (NLP) algorithms to analyze the characteristics of genomic research on non-communicable diseases (NCDs) from scientific literature and the GWAS Catalog. My work focused on identifying trends and disparities, contributing valuable insights into the genetic underpinnings of these conditions.
20182017
Research intern
GAME unit Lund University- Collaborated on projects integrating genomic and clinical data to explore gene-environment interactions, leveraging statistical models and machine learning algorithms.
- Developed an R package utilizing NLP’s Named Entity Recognition (NER) to automate the systematic extraction and analysis of key data from scientific documents.
- Conducted a comprehensive review on precision medicine in type 2 diabetes, emphasizing the integration of genomics into clinical practice and highlighting the required technological and data infrastructure.
20182018
Freelance Data Scientist
Remote- During this brief period, I primarily focused on projects related to text mining, machine learning, predictive modeling, and academic research.
Education
20232018
PhD, Genetic Epidemiology
Lund University Malmö, SE- Thesis: Use of data mining and artificial intelligence to derive public health evidence from large datasets
- Coursework included: Applied Statistics, Applied Statistics in Clinical Research, Introduction to Deep Learning, Artificial Intelligence in Medicine and Life Sciences.
20182016
Master of Medical Science in Public Health and Epidemiology
Lund University Lund, SE- Thesis: A global overview of Precision Medicine in type 2 diabetes: a systematic review.
- Coursework included: Epidemiology and Public Health Research Methodology, Planning and Leadership, Applied Public Health Research Methods.
20142008
Bachelors degree in Physiotherapy
Universidade Federal de Pernambuco Recife, BR- Thesis: Effects of taping on the contractile activity and strengthening of skeletal muscles: a narrative review (free translation from Portuguese).
20132012
Exercise Sciences
Kent State University Kent, US- Exchange Student - Science without Borders Program
Complementary Education
20242024
Introducing Generative AI with AWS
Udacity MOOC- This course offered an in-depth exploration of generative AI, emphasizing its foundational concepts and real-world applications. It covered foundational concepts in machine learning and generative AI with hands-on practice.
20222022
Machine Learning: Leveraging Data Insights
MIT Professional Education MOOC- With this online program in Machine Learning, participants will learn the four-step process of machine learning that leads from analyzing data to evaluating the effectiveness of decisions made based on that data. At the end of the program participants will have a better understanding of how machine learning tools and techniques contribute to more efficient decision making in many different environments.
20202020
Artificial Neural Network and Deep Learning
Lund University Lund, SE- A brief introduction to artificial neural networks - workshop.
20202020
Introduction to Natural Language Processing and Text Mining
Lund University Lund, SE- This workshop aims to give an introduction to quantitative methods for analyzing text. We will illustrate a few tools, resources and workflows, including word embeddings, text clustering and binary text classification.
20202020
Introduction to Deep Learning
Lund University Lund, SE- The aim of this course is to introduce students to common deep learning architectures such as multi-layer perceptrons, convolutional neural networks, and recurrent models such as the LSTM.
20192019
Deep Learning
University of Copenhagen Copenhagen, DN- A thorough introduction to the foundations of machine learning, especially neural networks including their training; Introduction to convolutional neural networks and recurrent neural networks; Training and applying convolutional and recurrent neural networks for image analysis; Making use of data augmentation and other preprocessing steps to further improve the generalization performance.
Honors and Awards
2024
Oxford Machine Learning Summer School 2024
Stiftelsen Landshövding Per Westlings minnesfond (95096) Oxford, UK- Travel grant
- Support for scientific research at Lund University and for grants for study trips for both older and younger scientists at the university.
2023
Next Generation Tech Booster Scholarship
Udacity & Bertelsmann Remote- Course cost
- As a global leader in media, education, and services, Bertelsmann wants to empower people around the world to be successful in the tech and data sectors, especially those individuals who historically may not have access to such skill-building opportunities. This program aims to set up eager learners for exciting, high-paying careers in tech.
2022
ACCESS Forum travel allowance and participation
ACCESS (Academic Collaboration Chile Sweden) Punta Arenas, Chile- Travel grant
- The ACCESS forum “Reconnecting for a Sustainable Future” offered a unique opportunity for academic communities from Sweden and Chile to come together, network, and exchange experiences. Parallel discussions on various research themes, virtual sessions, and presentations on funding opportunities were the main activities during the forum. The forum took place in Punta Arenas, Chile from November 7-11, 2022.
2022
Best Challenge Award (Data Science)
Danish Diabetes Academy Vejle, Denmark- Competition
- This prize was presented as part of the Data Science Spring School, hosted by the Danish Diabetes Academy. The event featured a series of lectures and activities centered around data science and artificial intelligence, culminating in a hackathon-style challenge.
2022
Microsoft Power BI Scholarship Program
Microsoft & Dataquest Remote- Course cost
- Dataquest and Microsoft have partnered to offer the first interactive Power BI courses. These courses teach Power BI through a project-based, in-browser approach. This prepares learners for real-world skill application, increasing comprehension and confidence. After completing the 12-week program, learners will be well prepared to pass the PL-300 Microsoft Power BI Data Analyst Exam and continue advancing their data careers.
2022
PyCon US 2022 Financial Aid
Python Software Foundation Remote- Conference cost
- Financial aid recipients receive support for some or all of their expenses including transportation, hotel, and childcare. Because PyCon is the largest Python conference in the world, it’s a meeting place for Python developers from around the world. Therefore, the financial aid award process is designed to enhance PyCon, including speakers, tutorial presenters, and notable open source contributors.
2022
Becas Santander Tech | Emerging Technologies Program by MIT Professional Education
Massachusetts Institute of Technology (MIT) Remote- Course cost
- The “Becas Santander Tech - Emerging Technologies Program by MIT Professional Education” facilitates and promotes the knowledge and use of the emerging innovative technological tools most in demand in companies and will be developed during the 2022 academic year. The main objective of the Program is to offer a relevant and practical learning experience, providing knowledge and skills to learn asynchronously and to be able to put into practice immediately.
2021
Bertelsmann Technology Scholarship
Udacity & Bertelsmann Remote- Course cost
- Technology Scholarship Program powered by Bertelsmann for the Intro to ML with Tensor Flow Challenge Course.
2021
Nordic Probabilistic AI School 2021
Norwegian University of Science and Technology (NTNU) Remote- Course cost
- The mission of the Nordic Probabilistic AI School (ProbAI) is to serve state-of-the-art expertise in machine learning and artificial intelligence to the public, students, academia, and industry. The selection of participants is based on multiple criteria such as experience, institution affiliation, geographical location, gender. The selection process is based on the submitted material and will be executed by a diversified committee of trusted experts.
2020
Virtual ODSC East 2020 Scholarship
Open Data Science Conference Remote- Conference cost
- Each year, candidates who have distinguished themselves through outstanding academic achievement and personal excellence are chosen to attend the Open Data Science Conference.
2020
Udacity Technology Scholarship powered by Bertelsmann (Deep Learning Nanodegree)
Udacity & Bertelsmann Remote- Course cost
- Out of thousands of students in the Technology Scholarship Challenge Course, the progress in the course and dedication in the community stood out and the student was awarded a full Deep Learning Nanodegree.
2020
PyCon US 2020 Financial Aid
Python Software Foundation Pittsburgh, US- Travel grant
- Financial aid recipients receive support for some or all of their expenses including transportation, hotel, and childcare. Because PyCon is the largest Python conference in the world, it’s a meeting place for Python developers from around the world. Therefore, the financial aid award process is designed to enhance PyCon, including speakers, tutorial presenters, and notable open source contributors.
2019
Bertelsmann Scholarship
Udacity & Bertelsmann Remote- Competition
- Bertelsmann’s media, services and educational offerings make it a leader in many areas of the digital world. Accordingly, the company wants to empower as many people as possible to be successful in the digital world. The initial 2019–2020 phase of the program is a two-stage scholarship, open to any student, 18 years of age or older, interested in Cloud Computing, Data Science or Artificial Intelligence. Recipients will spend 3.5 months learning key components for Cloud Computing, Data Analysis, or Artificial Intelligence. Top students from this initial phase will earn a full Nanodegree program scholarship.
2019
ODSC Europe 2019 Scholarship
Open Data Science Conference London, UK- Conference cost
- Each year, candidates who have distinguished themselves through outstanding academic achievement and personal excellence are chosen to attend the Open Data Science Conference.
2019
Medicinska fakulteten resebidrag för forskarstuderande
Lund University Lund, SE- Travel grant
- This grant is to cover travel costs to other institutions in Sweden or abroad which are related to the PhD education. All PhD students that are enrolled in the postgraduate education programme with the Faculty of Medicine in Lund/Malmö are eligible for this grant.
2018
Summer Research Scholarship
Lund University Lund, SE- Research grant
- The purpose of the summer scholarships is to make students interested in research and take the shape of two-month-long research projects.
20182016
Swedish Institute Study Scholarship
Svenska Institutet Stockholm, SE- Study grant
- The Swedish Institute (SI), a government agency, offers scholarships each year for international students and researchers coming to Sweden. The programme offers a unique opportunity for future leaders to develop professionally and academically, to experience Swedish society and culture and to build a long-lasting relationship with Sweden and each other.
20132012
Scholarship - Science without Borders (Ciência sem Fronteiras)
Conselho Nacional de Desenvolvimento Científico e Tecnológico - “National Counsel of Technological and Scientific Development” Kent, US- Research grant
- Science without Borders is a large-scale nationwide scholarship program primarily funded by the Brazilian federal government. The program seeks to strengthen and expand the initiatives of science and technology, innovation and competitiveness through international mobility of undergraduate and graduate students and researchers.
20122010
Scholarship - Programa de Educação pelo Trabalho para a Saúde (PET-Saúde)
Ministry of Health, Brazil Recife, BR- Research grant
- As one of the intersectoral actions directed towards strengthening the primary care and health surveillance, in accordance with the principles and requirements of the Sistema Único de Saúde - SUS, the program presupposes education through work and provides scholarships for tutors, preceptors, and healthcare undergraduate students. The program is part of the strategies of the Programa Nacional de Reorientação da Formação Profissional em Saúde (PRÓ-SAÚDE), implemented in the country since 2005.
Selected Publications
2024
Machine Learning Models for Prediction of Diabetic Microvascular Complications
Kanbour S, Harris C, Lalani B, Wolf RM, Fitipaldi H, Gomez MF, Mathioudakis N. J Diabetes Sci Technol. 2024 Mar;18(2):273-286. doi: 10.1177/19322968231223726. Epub 2024 Jan 8. PMID: 38189280; PMCID: PMC10973856.2023
Precision prognostics for cardiovascular disease in Type 2 diabetes: a systematic review and meta-analysis
Ahmad A, Lim LL, Morieri ML, Tam CH, Cheng F, Chikowore T, Dudenhöffer-Pfeifer M, Fitipaldi H, Huang C, Kanbour S, Sarkar S, Koivula RW, Motala AA, Tye SC, Yu G, Zhang Y, Provenzano M, Sherifali D, de Souza RJ, Tobias DK; ADA/EASD PMDI; Gomez MF, Ma RCW, Mathioudakis N. Commun Med (Lond). 2024 Jan 22;4(1):11. doi: 10.1038/s43856-023-00429-z. PMID: 38253823; PMCID: PMC10803333.2023
Identification of biomarkers for glycaemic deterioration in type 2 diabetes
Roderick C. Slieker, Louise A. Donnelly, Elina Akalestou, Livia Lopez-Noriega, Rana Melhem, Ayşim Güneş, Frederic Abou Azar, Alexander Efanov, Eleni Georgiadou, Hermine Muniangi-Muhitu, Mahsa Sheikh, Giuseppe N. Giordano, Mikael Åkerlund, Emma Ahlqvist, Ashfaq Ali, Karina Banasik, Søren Brunak, Marko Barovic, Gerard A. Bouland, Frédéric Burdet, Mickaël Canouil, Iulian Dragan, Petra J. M. Elders, Celine Fernandez, Andreas Festa, Hugo Fitipaldi, …, Ewan R. Pearson & Guy A. Rutter. Nat Commun. 2023 May 3;14(1):2533. doi: 10.1038/s41467-023-38148-7. PMID: 37137910; PMCID: PMC10156700.2023
Second international consensus report on gaps and opportunities for the clinical translation of precision diabetes medicine
Deirdre K. Tobias, Jordi Merino, Abrar Ahmad, Catherine Aiken, Jamie L. Benham, Dhanasekaran Bodhini, … Hugo Fitipaldi, … Maria F. Gomez, Peter A. Gottlieb, Siri Atma W. Greeley, Kurt Griffin, Andrew T. Hattersley, Irl B. Hirsch, Marie-France Hivert, Korey K. Hood, Jami L. Josefson, Soo Heon Kwak, Lori M. Laffel, Siew S. Lim, Ruth J. F. Loos, Ronald C. W. Ma, Chantal Mathieu, Nestoras Mathioudakis, James B. Meigs, Shivani Misra, Viswanathan Mohan, Rinki Murphy, Richard Oram, Katharine R. Owen, Susan E. Ozanne, Ewan R. Pearson, Wei Perng, Toni I. Pollin, Rodica Pop-Busui, Richard E. Pratley, Leanne M. Redman, Maria J. Redondo, Rebecca M. Reynolds, Robert K. Semple, Jennifer L. Sherr, Emily K. Sims, Arianne Sweeting, Tiinamaija Tuomi, Miriam S. Udler, Kimberly K. Vesco, Tina Vilsbøll, Robert Wagner, Stephen S. Rich & Paul W. Franks. Nat Med. 2023 Oct;29(10):2438-2457. doi: 10.1038/s41591-023-02502-5. Epub 2023 Oct 5. PMID: 37794253; PMCID: PMC10735053.2023
Sociodemographic characteristics and COVID-19 testing rates: spatiotemporal patterns and impact of test accessibility in Sweden
Beatrice Kennedy, Georgios Varotsis, Ulf Hammar, Diem Nguyen, Germán D Carrasquilla, Vera van Zoest, Robert S Kristiansson, Hugo Fitipaldi, Koen F Dekkers, Meena Daivadanam, Mats Martinell, Jonas Björk, Tove Fall. Eur J Public Health. 2023 Nov 27:ckad209. doi: 10.1093/eurpub/ckad209. Epub ahead of print. PMID: 38011903.2023
A phenome-wide comparative analysis of genetic discordance between obesity and type 2 diabetes
Coral, D. E., Fernandez-Tajes, J., Tsereteli, N., Pomares-Millan, H., Fitipaldi, H., Mutie, P. M., Atabaki-Pasdar, N., Kalamajski, S., Poveda, A., Miller-Fleming, T. W., Zhong, X., Giordano, G. N., Pearson, E. R., Cox, N. J. & Franks, P. W., 2023 Jan 26, (E-pub ahead of print) In: Nature Metabolism. 16 p.2023
Discovery of drug–omics associations in type 2 diabetes with generative deep-learning models
Allesøe RL, Lundgaard AT, Hernández Medina R, Aguayo-Orozco A, Johansen J, Nissen JN, Brorsson C, Mazzoni G, Niu L, Biel JH, Brasas V, Webel H, Benros ME, Pedersen AG, Chmura PJ, Jacobsen UP, Mari A, Koivula R, Mahajan A, Vinuela A, Tajes JF, Sharma S, Haid M, Hong MG, Musholt PB, De Masi F, Vogt J, Pedersen HK, Gudmundsdottir V, Jones A, Kennedy G, Bell J, Thomas EL, Frost G, Thomsen H, Hansen E, Hansen TH, Vestergaard H, Muilwijk M, Blom MT, ’t Hart LM, Pattou F, Raverdy V, Brage S, Kokkola T, Heggie A, McEvoy D, Mourby M, Kaye J, Hattersley A, McDonald T, Ridderstråle M, Walker M, Forgie I, Giordano GN, Pavo I, Ruetten H, Pedersen O, Hansen T, Dermitzakis E, Franks PW, Schwenk JM, Adamski J, McCarthy MI, Pearson E, …Fitipaldi H…., Banasik K, Rasmussen S, Brunak S;., In: Nature Biotechnology.2023
Investigating the causal relationships between excess adiposity and cardiometabolic health in men and women
Mutie, P. M., Pomares-Milan, H., Atabaki-Pasdar, N., Coral, D., Fitipaldi, H., Tsereteli, N., Tajes, J. F., Franks, P. W. & Giordano, G. N., 2023, In: Diabetologia. 66, 2 , p. 321-3352023
Ethnic, gender and other sociodemographic biases in genome-wide association studies for the most burdensome non-communicable diseases: 2005-2022
Fitipaldi, H. & Franks, P. W., 2005-2022. Hum Mol Genet. 2023 Jan 13;32(3):520-532. doi: 10.1093/hmg/ddac245. PMID: 36190496; PMCID: PMC9851743.2022
App-based COVID-19 syndromic surveillance and prediction of hospital admissions in COVID Symptom Study Sweden
Kennedy, B., Fitipaldi, H., Hammar, U., Maziarz, M., Tsereteli, N., Oskolkov, N., Varotsis, G., Franks, C. A., Nguyen, D., Spiliopoulos, L., Adami, H-O., Björk, J., Engblom, S., Fall, K., Grimby-Ekman, A., Litton, J-E., Martinell, M., Oudin, A., Sjöström, T., Timpka, T., & 16 others, 2022 Apr 21, In: Nature Communications. 13, 1, 12 p., 2110.2021
Distinct Molecular Signatures of Clinical Clusters in People with Type 2 Diabetes: an IMIRHAPSODY Study
Slieker, R. C., Donnelly, L. A., Fitipaldi, H., Bouland, G. A., Giordano, G. N., Åkerlund, M., Gerl, M. J., Ahlqvist, E., Ali, A., Dragan, I., Elders, P., Festa, A., Hansen, M. K., van der Heijden, A. A., Aly, D. M., Kim, M., Kuznetsov, D., Mehl, F., Klose, C., Simons, K., & 15 others, 2021, In: Diabetes. 70, 11, p. 2683-26932021
Replication and cross-validation of T2D subtypes based on clinical variables: an IMI-RHAPSODY study
Slieker, R. C., Donnelly, L. A., Fitipaldi, H., Bouland, G. A., Giordano, G. N., Åkerlund, M., Gerl, M. J., Ahlqvist, E., Ali, A., Dragan, I., Festa, A., Hansen, M. K., Mansour Aly, D., Kim, M., Kuznetsov, D., Mehl, F., Klose, C., Simons, K., Pavo, I., Pullen, T. J., & 13 others, 2021, In: Diabetologia. 64, 9, p. 1982-1989 8 p.2020
Predicting and elucidating the etiology of fatty liver disease: A machine learning modeling and validation study in the IMI DIRECT cohorts
Atabaki-Pasdar N, Ohlsson M, Viñuela A, Frau F, Pomares-Millan H, Haid M, Jones AG, Thomas EL, Koivula RW, Kurbasic A, Mutie PM, Fitipaldi H, Fernandez J, Dawed AY, Giordano GN, Forgie IM, McDonald TJ, Rutters F, Cederberg H, Chabanova E, Dale M, Masi F, Thomas CE, Allin KH, Hansen TH, Heggie A, Hong MG, Elders PJM, Kennedy G, Kokkola T, Pedersen HK, Mahajan A, McEvoy D, Pattou F, Raverdy V, Häussler RS, Sharma S, Thomsen HS, Vangipurapu J, Vestergaard H, ’t Hart LM, Adamski J, Musholt PB, Brage S, Brunak S, Dermitzakis E, Frost G, Hansen T, Laakso M, Pedersen O, Ridderstråle M, Ruetten H, Hattersley AT, Walker M, Beulens JWJ, Mari A, Schwenk JM, Gupta R, McCarthy MI, Pearson ER, Bell JD, Pavo I, Franks PW., 2020, In: PLoS Medicine. 17, 6, p. e10031492019
Genetic studies of abdominal MRI data identify genes regulating hepcidin as major determinants of liver iron concentration
Wilman HR, Parisinos CA, Atabaki-Pasdar N, Kelly M, Thomas EL, Neubauer S… Fitipaldi H, … Mahajan A, Hingorani AD, Patel RS, Hemingway H, Franks PW, Bell JD, Banerjee R, Yaghootkar H., 2019, In: Journal of Hepatology. 71, 3, p. 594-6022018