Exploratory analysis of endangered languages with pandas

On the occasion of the International Mother Language Day (21st February), I wrote an essay (in French) about the importance of preserving endangered languages and I thought of pairing that with a data science challenge. In this simple project, I analysed and visualised the global distribution of language families and dialects. Topic: Exploratory Data Analysis... Continue Reading →

Regression Challenge for Kaggle Playground

At the beginning of the new year 2021, Kaggle created a new format of competitions aimed at beginners. On the 1st of each month, a month-long Playground competition is launched, where you can practice your ML skills on simple tabular datasets. Apart from competitive experience, the top 3 teams get to win some Kaggle merchandise!... Continue Reading →

How to scrape IMDb with R

Web scraping is a method of automatically gathering data from websites in a structured manner and storing it into a local database or spreadsheet. Why would you do this? Because you're lazy. Or because it's really impossible to copy-paste all the data you need from the website. Some popular use-cases of web scraping are price... Continue Reading →

