Tag: data analytics
Euro 2016 analytics: Who’s playing the toughest game?
I am really enjoying Uefa Euro 2016 Footbal Competition, even because our national team has done pretty well so far. That’s why after Ā browsing for a whileĀ statistics section of official EURO 2016 website I decided to do some analysis on the data they share ( as at the 21th of June).
Just to be clear from the beginning: we are not talking of anything too rigourus, but just about some interesting questions with related answers gathered mainly through data visualisation.
WeĀ can divide following analyses into two main parts: a first part were we analyse distribution of fouls and their incidence on matches outcome and a second part where ball possession in analysed, once again looking at relationship between this stat and matches outcome. Let’s start with fouls then.
which teamĀ committed the Ā greatest number of fouls?
Here we are with the first question. And here it is the answer:
Continue reading “Euro 2016 analytics: Who’s playing the toughest game?”
Over 50 practical recipes for data analysis with R in one book
- changed my job ( I am now working @ Intesa Sanpaolo Banking Group on Basel III statistical models)
- became dad for the third time (and if you are guessing, itās a boy!)
- fixed some issues with the updateR package
- and I wrote a book!
RStudio for R Statistical Computing Cookbook
Continue reading “Over 50 practical recipes for data analysis with R in one book”
how to list loaded packages in R: ramazon gets cleaver
It was around midnight here in Italy:
I shared the code on Github, published a post on G+,Ā Linkedin and Twitter and then went to bed.
In the next hoursĀ things got growing by themselves, with pleasant results like the following:
The R community found ramazon a really helpful package.
And I actually think it is: Amazon AWS is nowadays one of the most common tools for online web applications and websites hosting.
The possibility toĀ getĀ your Shiny App on it just running a function make it even more desirable for the amusingĀ R people.
Therefore, even if I developed ramazon for personal purposesĀ , I now feelĀ some kind of responsibility to further develop it and take it updated.
Continue reading “how to list loaded packages in R: ramazon gets cleaver”
Introducing Afraus: an Unsupervised Fraud Detection Algorithm
on average, fraud accounts for nearly the 5% of companies revenues
Continue reading “Introducing Afraus: an Unsupervised Fraud Detection Algorithm”
Catching Fraud with Benford’s law (and another Shiny App)
In the early ‘900 Frank Benford observed that ‘1’ was more frequent as first digit in his own logarithms manual.
More than one hundred years later, we can use this curious findingĀ to look for fraud on populations of data.
just give a try to the shiny app
What ‘Benford’s Law’ stands for?
Around 1938 Frank Benford, a physicist at the General Electrics research laboratories, observed that logarithmic tables were more wornĀ within first pages: was this casual or due to an actual prevalence of numbers near 1 as first digits?
Continue reading “Catching Fraud with Benford’s law (and another Shiny App)”
Network Visualisation With R
The main reason why
the solution: linker
Best Practices for Scientific Computing
I reproduce here below principles from the amazing paper Best Practices for Scientific Computing, published on 2012 by a group of US and UK professors. The main purpose of the paper is to “teach” Ā good programming habits shared from professional developers to people Ā that weren’t born developer, and became developers just for professional purposes.
Scientists spend an increasing amount of time building and using software. However, most scientists are never taught how to do this efficiently
Best Practices for Scientific Computing
-
Write programs for people, not computers.
- a program should not require its readers to hold more than a handful of facts in memory at once
- names should be consistent, distinctive and meaningful
- code style and formatting should be consistent
- all aspects of software development should be broken down into tasks roughly an hour long Continue reading “Best Practices for Scientific Computing”
excel right() function in R
as part of the excel functions in R, I have developed this custom function, reproducing the excel right() function in th R language. Feel free to copy and use it.
right = function (string, char){ substr(string,nchar(string)-(char-1),nchar(string))}
you can find other function in the Excel functions in R post.
excel left() function in R
as part of the excel functions in R, I haveĀ developedĀ this custom function, emulating the excel left() function in th R language. Feel free to copy and use it.
left = function (string,char){
Ā substr(string,1,char)}
you can find other function in the Excel functions in R post.