Data Analyst: il top player che tutti desiderano in azienda

51
Azzurra Ragone Data Driven Innovation Summit 2016 DATA SCIENTIST: il top player che tutti desiderano in azienda

Transcript of Data Analyst: il top player che tutti desiderano in azienda

Azzurra RagoneData Driven Innovation Summit 2016

DATA SCIENTIST: il top player che tutti desiderano in azienda

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra Ragone Università di Milano Bicocca

https://it.linkedin.com/in/azzurraragone@azzurraragone

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

“Levare il superfluo dalla materia per ridurre a forma di corpo che nella idea dello artefice è disegnata".(Vasari)

Azzurra RagoneData Driven Innovation Summit 2016

The McKinsey Global Institute predicted that by 2018 the U.S. could face a shortage of 1.5 million peoplewho know how to leverage data analysis to make effective decisions

Azzurra RagoneData Driven Innovation Summit 2016

The first steps on your path to professional data viz

Azzurra RagoneData Driven Innovation Summit 2016

Data Career Paths

Degree coursesEngineeringStatisticsMathematics

...

Data Science MastersMix of computer science, statistics, sociology, economic

Online ResourcesMassive Open Online Courses (MOOC)Tutorials

Azzurra RagoneData Driven Innovation Summit 2016

Data Scientist key competences

Machine LearningProgramming languages Math, Stats

Azzurra RagoneData Driven Innovation Summit 2016

Technology

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Data Scientist Jobs - Google Trends

Azzurra RagoneData Driven Innovation Summit 2016

Data Analyst Jobs - Google Trends

Azzurra RagoneData Driven Innovation Summit 2016

Same mission in the company

To glean insight from the massive pool of data available

Azzurra RagoneData Driven Innovation Summit 2016

Where Data Analytics ends and Data Science begins...

Azzurra RagoneData Driven Innovation Summit 2016

A data analyst is a junior data scientist

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Data Analysts focus on visualizations describing and summarizing the past

Azzurra RagoneData Driven Innovation Summit 2016

Data Scientists manipulate data and create models to improve the future (predictive models)

Azzurra RagoneData Driven Innovation Summit 2016

Data scientist?

Thomson Nguyen - CEO and Co-founder of Framed Data

Azzurra RagoneData Driven Innovation Summit 2016

Academic Curiosity

Azzurra RagoneData Driven Innovation Summit 2016

“ Academic curiosity is a desire to go beneath the surface and distill a problem into a very clear set of hypotheses that can be tested [...] look at the available data sets and sources to figure out an experiment or a model that solves one of the company’s problems.Thomson Nguyen

Azzurra RagoneData Driven Innovation Summit 2016

Storytelling

Azzurra RagoneData Driven Innovation Summit 2016

Product Sense

Azzurra RagoneData Driven Innovation Summit 2016

Statistical and machine learning

knowledge

Azzurra RagoneData Driven Innovation Summit 2016

Engineering experience

Azzurra RagoneData Driven Innovation Summit 2016

Creativity

Azzurra RagoneData Driven Innovation Summit 2016

“Skills to work with Big Data” Source: “Creating Value through Open Data” - EU commission - 2015

Azzurra RagoneData Driven Innovation Summit 2016

Data Specialist team

Data Scientist

Data Analyst

Data Engineer

Azzurra RagoneData Driven Innovation Summit 2016

Data Science

Azzurra RagoneData Driven Innovation Summit 2016

Graph Analytics

Azzurra RagoneData Driven Innovation Summit 2016

The Indexable Web graph

Billions of nodes, trillions of edges

Source: Wired

Azzurra RagoneData Driven Innovation Summit 2016

The Facebook graph

1B of nodes (users), 200B of edges(friendships)

Azzurra RagoneData Driven Innovation Summit 2016

Recommender Systems

Matrix with:Millions of usersMillions of itemsBillions of ratings

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Personalized Information Access

▪ Help the user in finding the information they might be interested in

▪ Consider their preferences/past behaviour

▪ Filter irrelevant information

Azzurra RagoneData Driven Innovation Summit 2016

Recommender Systems

Matching

Information Overload

Azzurra RagoneData Driven Innovation Summit 2016

Azzurra RagoneData Driven Innovation Summit 2016

Explanation

Transparency Validity

TrustworthinessEffectiveness

Efficiency

SatisfactionRelevance

Persuasiveness

Comprehensibility

Education

Azzurra RagoneData Driven Innovation Summit 2016

Explanation in Netflix

Azzurra RagoneData Driven Innovation Summit 2016

Accuracy

Diversity

Novelty

Serendipity

Cross-domain

Azzurra RagoneData Driven Innovation Summit 2016

Semantic Recommender Systems

Azzurra RagoneData Driven Innovation Summit 2016

The Linked Open Data Cloud

Web 3.0

Source: (http://lod-cloud.net/)

Azzurra RagoneData Driven Innovation Summit 2016

LOD is the Web

Azzurra RagoneData Driven Innovation Summit 2016

RDF - Resource Description Framework

Azzurra RagoneData Driven Innovation Summit 2016

SPARQL

Azzurra RagoneData Driven Innovation Summit 2016

“ If I had to express my views about the digital future – that of Europe or indeed, of the whole world - I could do it with one word: data.

Andrus Ansip, Vice-President Digital Single Market

Azzurra RagoneData Driven Innovation Summit 2016

Thanks!!Any questions?You can find me at [email protected]