Talk Cleaning data to analyze it is a major roadblock to data science. I will discuss two specific problems, missing values and categories which variants and typos, in the context of machine learning. This talk will be on recent publications but give simple solutions in Python. Speaker I am a research director at Inria (French National Computer Science Research Institute), studying machine learning for health, as well as a visiting professor at McGill university. I have a strong academic track record in f
Hide player controls
Hide resume playing