Skip to main content

Data Science Studies // Python/R [2018 Study Plan]

Starting My Journey into Data Science

It's time to start studying again, and I've decided against pursuing .NET/Web Programming in favor of Data Science. I've only begun researching what I need to learn to get better acquainted with Data Science, and for now, I will focus on Python Programming.

I'm already pretty good with SQL and Relational Databases (SQL Server, Oracle), but there's much more to explore. Beyond Math and Statistics, I want to understand how to work with unstructured data.

Subject List

My initial study list (subject to change as I learn what I need) includes the following topics, presented in no particular order:

  • Python Language
  • R Language
  • MongoDB / NoSQL
  • Big Data (Hadoop, Hive)
  • Cloud Tools (Amazon S3)

Additionally, I will need to brush up on my Math and Statistics skills, as it has been a few years since university.

Reading Estimate

  • 4,000 Pages [5 to 6 books, each 500 to 800 pages]
  • 10 months [100 pages per week] // Estimated Completion Time: October 2018

Resources

Python

  • Intro to Python, 5th Edition - Mark Lutz
  • Programming in Python - Mark Lutz

R

  • The Art of R Programming - Matloff

Certifications

There are several certifications available for Python and R:

  • 70-773 - Analyzing Big Data with Microsoft R [for MCSE - Data Management & Analytics] — $165 USD
  • 98-381 - Introduction to Python [for MTA] — $127 USD
  • MongoDB DBA Associate — $150 USD

Certifications are not my primary goal; rather, they serve as a measuring stick and pace-setter.

Updates

Update (12/13/17) - Python

I am currently studying the 5th Edition of Learning Python by Mark Lutz. It is a larger volume with 40 chapters. I am taking my time and have read about 10 chapters over 17 days. I will try to pick up the pace as I delve deeper into this book and hope to finish it by mid-January 2018.

The content in this book is substantial; I've already filled out a 70-page notebook and had to re-ink three fountain pens. At this rate, I will need three more notebooks.

Writing out the code manually helps me grasp it better and allows me to see differences, such as lists versus dictionaries.


Update (5/30/18) - Art of R Programming

I purchased The Art of R by Matloff and am currently studying it. I have installed R Studio on my i7 Windows 10 and i3 Linux systems.


Update (11/11/18) - Study Statistics First

This process is taking longer than anticipated. I have completed textbooks for both Python and R. Now, I am focusing my efforts on studying statistics itself rather than just tools or platforms. I aim to improve and hone my analytical mindset first, then augment it with the necessary tools.

    Comments

    Popular posts from this blog

    Sony MDR-ZX100 vs ZX-110 vs ZX310 Series Headphones

    Sony ZX Series Headphones Review: A Budget-Friendly Sound Choice If you’re on the hunt for budget-friendly headphones with decent quality, the Sony ZX Series is definitely worth considering. I happen to own several models from the lineup: ZX-100 ZX-110 ZX-310 Let’s dive into how they compare in terms of build quality, cost, specs, sound, and overall value. Build Quality: ZX-310 Takes the Lead The Sony ZX series headphones primarily feature a durable plastic construction. My ZX-100 has lasted over 2½ years, enduring countless tosses into my backpack and car without any issues. However, the lower-end ZX-100 and ZX-110 models have a significant downside: poor-quality earpads. Over time, these earpads disintegrate, leaving vinyl flakes that stick to your hair and ears. The ZX-310, on the other hand, comes with upgraded earpads that don’t suffer from this problem, making them a clear winner in the build department. Cost Comparison: ZX-100/110 Wins for Affordability While the ZX-310 model co

    Casio G-Shock 5600 vs 6900 vs 9000

    G-Shock Preferences and Favorites After trying out several G-Shock models, I've developed a better sense of the specific features and design elements I appreciate most. While features are always a plus, my main priority is size . Here's how some of the models I've tried stack up. Size Preference: DW-5600 Series For overall size, the DW-5600 series stands out as a favorite due to its compact, comfortable form. It’s slim, lightweight, and fits well on my wrist without being too bulky. Although the 6900 series provides the benefit of a well-placed front illumination button, the 5600 remains the ideal size for everyday wear. Best Compromise: G9000 Mudman Series If I had to choose a balanced option between size, comfort, and functionality, the G9000 Mudman series would be it. The buttons are slightly tough to press, but the layout and form factor resonate with what I prefer in a G-Shock. Despite having different module versions (GLX, G, and DW), I find that these models offe

    Eton Microlink FR160 Radio -- Sticky Residue

    Eton Microlink FR160 Handcrank Radio Review I bought an Eton Microlink FR160 handcrank radio for my emergency kit a few years ago, and it’s been great overall. However, there’s one significant issue I've encountered. Sticky Residue Problem Over time, a sticky residue developed on the radio's external surface, which was driving me nuts. At first, I thought there was something wrong with the device. Solution Fortunately, I researched the problem online and discovered that Eton radios are coated with a substance designed to make them easier to grip. Unfortunately, this coating degrades over time and turns into a sticky mess. To resolve the issue, I used isopropyl alcohol and cotton balls to clean most of the gunk. While some paint may have been lost in the process, at least the radio is no longer sticky.