R for Data Science | PDF下载|ePub下载
出版社: 人民邮电出版社
原作名: Cracking the coding interview:150 programming questions and solutions,fifth edition
译者:李琳骁/漆犇
出版年: 2013-11
页数: 372
定价: 59.00元
装帧: 平装
ISBN: 9787115332912
内容简介 · · · · · ·
Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible.
Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You’ll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you’ve learned along the way.
You’ll learn how to:
• Wrangle—transform your datasets into a form convenient for analysis
• Program—learn powerful R tools for solving data problems with greater clarity and ease
• Explore—examine your data, generate hypotheses, and quickly test them
• Model—provide a low-dimensional summary that captures true “signals” in your dataset
• Communicate—learn R Markdown for integrating prose, code, and results.
作者简介 · · · · · ·
Hadley Wickham is an Assistant Professor and the Dobelman FamilyJunior Chair in Statistics at Rice University. He is an active memberof the R community, has written and contributed to over 30 R packages, and won the John Chambers Award for Statistical Computing for his work developing tools for data reshaping and visualization. His research focuses on how to make data analysis better, faster and easier, with a particular emphasis on the use of visualization to better understand data and models.
Garrett Grolemund is a statistician, teacher and R developer who currently works for RStudio. He sees data analysis as a largely untapped fountain of value for both industry and science. Garrett received his Ph.D at Rice University in Hadley Wickham’s lab, where his research traced the origins of data analysis as a cognitive process and identified how attentional and epistemological concerns guide every data analysis.
Garrett is passionate about helping people avoid the frustration and unnecessary learning he went through while mastering data analysis. Even before he finished his dissertation, he started teaching corporate training in R and data analysis for Revolutions Analytics. He’s taught at Google, eBay, Axciom and many other companies, and is currently developing a training curriculum for RStudio that will make useful know-how even more accessible.
Outside of teaching, Garrett spends time doing clinical trials research, legal research, and financial analysis. He also develops R software, he’s co-authored the lubridate R package which provides methods to parse, manipulate, and do arithmetic with date-times and wrote the ggsubplot package, which extends the ggplot2 package.
目录 · · · · · ·
Chapter 2 Workflow: Basics
Chapter 3 Data Transformation with dplyr
Chapter 4 Workflow: Scripts
Chapter 5 Exploratory Data Analysis
Chapter 6 Workflow: Projects
Chapter 7 Tibbles with tibble
Chapter 8 Data Import with readr
Chapter 9 Tidy Data with tidyr
Chapter 10 Relational Data with dplyr
Chapter 11 Strings with stringr
Chapter 12 Factors with forcats
Chapter 13 Dates and Times with lubridate
Chapter 14 Pipes with magrittr
Chapter 15 Functions
Chapter 16 Vectors
Chapter 17 Iteration with purrr
Chapter 18 Model Basics with modelr
Chapter 19 Model Building
Chapter 20 Many Models with purrr and broom
Chapter 21 R Markdown
Chapter 22 Graphics for Communication with ggplot2
Chapter 23 R Markdown Formats
Chapter 24 R Markdown Workflow
· · · · · ·
发表回复
要发表评论,您必须先登录。