Ah, writing a blog post! This is a pleasure I was forgetting, and you can guess it looking at last post date of publication: it was around january... you may be wondering: what have you done along this long time? Well, quite a lot indeed:
- changed my job ( I am now working @ Intesa Sanpaolo Banking Group on Basel III statistical models)
- became dad for the third time (and if you are guessing, it’s a boy!)
- fixed some issues with the updateR package
- and I wrote a book!
Hope this pretty long list will help you forgive me for my long silence. I am actually pretty proud of all of them, but let’s talk about the book now. I think it is an useful contribution to the R community. But first of all, the title:
RStudio for R Statistical Computing Cookbook
The book is a concentrated of recipes targeted to the average R user to get him from being in the mean to the become an outlier. Aside from statistical analogies, this book provides recipes to perform relevant activities within all phases of data analysis projects, and literally:
- data acquisition;
- data cleansing;
- data visualisation;
- specific domain application (not actually a phase, I know…);
- results sharing.
you find here all you need to get data from around the word into R, using social networks API but also developing custom functions to access specific APIs you need. Data import from nearly every kind of file is also covered, giving you hints on how to streamline your analyses.
within this section you discover how to give your data a tidy structure, which is a structure where each row is an observations and every column host and attribute (yeah yeah, we are talking about tidy data framework by hadley whickam).
no matter if you are within the exploratory data analysis phase or production stage, data visualisation is a core step of your work (I know you already know this, but let me make it a bit more commercial, come on…), that’s why a whole section is devoted to show you basic and advanced visualisation techniques, like great Sankey diagrams and 3D animated plots.
domain specific application
domain specific applications is were R really shines, and you find here nearly a dozens of great R language applications, from text analytics on PDFs to benford analysis on ecommerce data and reccomandation engine modelling.
We could have named this section “Rmarkdown is everywhere“, since within this chapters the reader is introduced to all possible flavours of this powerful tool provided by Rstudio guys. One of my favourite recipe is the one which shows you how to publish and curate a blog directly from Rstudio.
Packt publishing provided for your convenience a free chapter which will let you get a first sense of how the book is structured and conceived. you cane easily find it on slideshare.
Summing it all up: If you are looking for a way to get from good to great with R this book is going to be a good companion to your journey, or at least this was my objective 😅. And if you are thinking if it is worthing to spend a sum of money for me, well, don’t forget I am now dad of three voracious babies :).