주변에서 데이터 과학을 시작하기 위해 어떤 자원을 추천 할 수 있습니까 라는 질문을 많이 받곤 합니다? 다음은 내 직접 경험해 보고 분석되어진 권장 사항입니다.
Data Science for Business
Data Science for Busine은 훌륭한 개요 책입니다.
그다지 기술적이지 않고 개념을 매우 쉽게 이용할 수 있습니다. 특히 손실 기능이 작동하는 방법을 설명하는 섹션을 발견했습니다. 특히 내가 본 다른 설명과 비교하여 계몽에 중점을 둡니다.
가장 중요한 것은 데이터 과학의 프로세스 및 비즈니스 의미에 관한 것입니다. 이 책은 관리자, 기술자 및 최근 졸업생 모두에게 좋습니다.
I read the book on my phone on kindle, usually when I was travelling. It remained something readable and intelligible at early hours and late nights – a compelling endorsement in an area where the books can be dense and make one’s brain bleed out of one’s ears.
나는 여행 중일 때, 보통 전화로 내 책을 읽었습니다. 이 책은 이른 시간과 늦은 밤에 읽을 수 있고 이해할 수있는 것으로 남아있었습니다. 책이 빽빽 해져서 뇌가 귀로 찢어 질 수있는 영역에서 강력한지지를 보였습니다.
R for Data Science
R for Data Science: Import, Tidy, Transform, Visualize, and Model Data is written by Hadley Wickham and Garett Grolemund. You can buy it and you can also access it online.
If you’re interested in learning to actually start doing data science as a practitioner, this book is a very accessible introduction to programming.
Starting gently, this book doesn’t teach you much about the use of R from a general programming perspective. It takes a very task oriented approach and teaches you R as you go along.
This book doesn’t cover the breadth and depth of data science in R, but it gives you a strong foundation in the coding skills you need and gives you a sense of the of the process you’ll go through.
I really like this book but it’s important to note you may have some gaps in your knowledge if this is your main introduction to R programming.
Introductions to R
There are many introductions to R that are useful. The difficulty lies in that there aren’t many focusing on modern R.
I struggle with the ongoing debate about whether I teach people a strong understanding in base R (or vanilla R) or do I teach them modern R (e.g. data.table and the tidyverse) which I perceive to be easier?
- Base R is quirky and quite difficult to learn. Modern R is much easier.
- Base R knowledge ensures you understand the core objects and programming constructs. Modern R focusses on tabular data, which is what most people need.
- Base R is stable. Modern R is bleeding edge.
There are very good base R introductory books but we’re still relatively lacking in ones that incorporate a lot of modern R.
- An Introduction to R
- R Cookbook: Proven Recipes for Data Analysis, Statistics, and Graphics
- Teach Yourself R in 24 Hours (modern R-ish)
User groups are a great way to meet people in the field and see some talks in the area. For the most part, I recommend you go to meetup.com and look for local data science, python, or R meetups near you.
I love user groups and I think they’re a great way to build a local network of people you can chat to about your start in data science.
There’s a reasonable amount of people out there offering introductions to R and data science. I myself offer training, from bespoke to community workshops to training for BI people. You can often search Eventbrite or ask on twitter to find an event happening near you. Your local user group is another great place to ask.
You can also check out a lot of conferences. KDNuggets keep a good list of conferences that you can attend.
As data science is becoming so popular, you can often see a lot of data science appearing in non-data science conferences, especially data platform and analysis conferences so you might be able to look to conferences you already know about for getting started with data science.
I love reading blogs as a way of learning in an area – not only do you get technical knowledge but people are kind enough to share theory and current trends too. I read a variety that might not suit everyone but if you’re just starting out I recommend Becoming a Data Scientist and R-bloggers.
Getting hands-on is an important aspect of learning for me.
In more gentle ways of learning, you can start using Microsoft free notebook and machine learning platform to start coding things.
Alas, I’m not a podcast person so I’ve only one recommendation for you: Not So Standard Deviations.
By far my most recommended site for learning R and data science is DataCamp. DataCamp blends videos and online exercises making it a great way to learn practically but still get the theory.
There are a number of free introductory courses, but DataCamp works on a monthly subscription access model of $29. For the monthly fee, you can consume any and all of their courses.
DataCamp is great value for money, especially if you want to do an intensive month of learning and then end your subscription.
Coursera is the original online course provider.
They’ve got some fantastic courses for getting started with data science.
With online courses like this, you can find yourself dropping off it and not finishing a course. I recommend you start with just a course or two before you go for something like the $20k Masters in Data Science that you could work towards on there!
The post Getting started with data science – recommended resources appeared first on Locke Data. Locke Data are a data science consultancy aimed at helping organisations get ready and get started with data science.