cover

It is a truth universally acknowledged that a data scientist in possession of a good portfolio must be in want of a job. A curated selection of your projects is the best way to showcase your work, interests, and thinking to potential employers. From my experience and discussions with colleagues, I foundย four aspectsย that make a data science portfolio impressive:

โœ” ๐—ค๐˜‚๐—ฎ๐—น๐—ถ๐˜๐˜† > ๐—ค๐˜‚๐—ฎ๐—ป๐˜๐—ถ๐˜๐˜†: Itโ€™s better to have only two complex or specialized projects than tens of repos with incomplete ones and errors. Also, psychologically, people tend to get stressed and lose interest if they are given too many options (the paradox of choice). The point is to give employers an idea of your potential and that you can complete a project from idea to presentation. For example, I haveย 16 repos on GitHub, but I only showcase 4-6 of them, which are completed projects.

โœ” ๐—ข๐—ฟ๐—ถ๐—ด๐—ถ๐—ป๐—ฎ๐—น๐—ถ๐˜๐˜†: Wherever you study data science, youโ€™ve most probably learned to predict theย Boston house prices, classify theย Iris flowersย andย Titanic survivors. Though these projects are a good start for learning the basics, they donโ€™t impress anyone anymore, because they are so common. Instead, explore a new dataset of your own interest, apply different models and answer questions that you find insightful. I chose projects that reflect my interests in Linguistics (exploring a dataset on world languages), literature (exploring my Goodreads library), and NLP (doing sentiment analysis on product review).

โœ” ๐—ฅ๐—˜๐—”๐——๐— ๐—˜: You wouldnโ€™t buy a book or read a paper without checking out its summary or abstract first, to see if itโ€™s interesting and worth your time. Same with data science projects. Add a README including the table of contents, a short description of the projects and maybe key findings. For example, I added a README describing the theory, summary, and main tools used inย my psych-verb project.

โœ” ๐—ฃ๐—ฒ๐—ฟ๐˜€๐—ผ๐—ป๐—ฎ๐—น๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป: The more personal you make your projects and profile, the bigger their impact. It doesnโ€™t have to be anything elaborate or too informal, but express your personality. On GitHub itโ€™s really easy to do this with theย special profile READMEย and a custom status.ย I personally added a short descriptionย of my work interests and coding-related activities, and update my status depending on what Iโ€™m working on.

Creating a good data science portfolio takes time! Donโ€™t rush it, take your time to learn and polish both your coding skills and presentation skills.