On the occasion of the International Mother Language Day (21st February), I wrote an essay (in French) about the importance of preserving endangered languages and I thought of pairing that with a data science challenge. In this simple project, I analysed and visualised the global distribution of language families and dialects. Topic: Exploratory Data Analysis... Continue Reading →
Regression Challenge for Kaggle Playground
At the beginning of the new year 2021, Kaggle created a new format of competitions aimed at beginners. On the 1st of each month, a month-long Playground competition is launched, where you can practice your ML skills on simple tabular datasets. Apart from competitive experience, the top 3 teams get to win some Kaggle merchandise!... Continue Reading →
“Skills to last a decade” – an online symposium by Women in Data Berlin
Women in Data is an international non-profit organization dedicated to increasing diversity in data-related careers by organizing conferences, networking events, and mentorship programs around the world. The Berlin Chapter of WiD was just founded in May this year and I was very excited to join as a Marketing Coordinator. This week, 27-31 July 2020, we... Continue Reading →
How to scrape IMDb with R
Web scraping is a method of automatically gathering data from websites in a structured manner and storing it into a local database or spreadsheet. Why would you do this? Because you're lazy. Or because it's really impossible to copy-paste all the data you need from the website. Some popular use-cases of web scraping are price... Continue Reading →