Skip to main content

Data Mining & Analytics with R : Introduction to R, RStudio and Rattle - Day 1

This is a blog post on a workshop I attended : a three-day hands-on Workshop on Data Mining & Analytics with R at Technopark, Trivandrum on 5th May 2015




Taught by : 
Graham Williams - Senior Director and Data Scientist , Australia
Graham.Williams@togaware.com

http://datamining.togaware.com

Read About 
- Literate Programming 
- Literate Data Mining

Philosophy
We are not writing program for the computer, it is written to share with other people.
Everything that we write should be written for others.
We should control the computer not vice versa. 

Introducing Data Science

Data Mining 
Started in around 1989.
Lego house - model - > not real but we get an idea. Data mining is all about building such models.

Analytics
Descriptive Analytics - what happens , suggestions in Amazon
Diagnostic Analytics - explain why the above happened
Correlation and Causation - Find up people who ended up in hospital after taking a particular drug
Predictive Analytics - machine learning and statistics models - predicts when it happen in the future.
Prescriptive Analytics - decide on what to do and how to decide the best interaction

Philosophy
Science brings knowledge, philosophy brings wisdom
Everything begins as science and ends as an ard. 

The Data Roles 
Data Technician -> Data Analyst (Add value to data) -> Data Miner (Computer Scientist/Statistician, Machine Learning etc) -> Data Scientist (Ability to follow one’s intuitions to draw it all together)

Continued.. Read Day 2 Here. 

Comments

Popular posts from this blog

Building Autonomous Drone with Raspberry Pi and APM 2.8

I am a total newbie to hardware and was pushing my limits to see how far I can reach on with hardware projects (which sparked my interest lately). I have set out on a very ambitions mission  to control a drone from raspberry pi .I began the research for this around 2 months ago and had brought a raspberry pi, drone body kit and apm flight controller. The key difference of this project from common drone projects is that I'm trying to avoid the use of and RC and instead use the raspberry pi to control it.  Hardware Ins tallation Setup: I am using APM 2.8 and Mission Planner. I am using RPi 3 to control the APM 2.8 via Telem port of APM I am planning to power the apm via the battery to ESC (Electronic Speed Controllers) Now, documenting my steps below: Day 1 Watch Tutorial To get started with APM flight controller, I watched this video tutorial [1] which gives a gentle introduction about APM board.  Setup APM board and Calibrate Sensors I downloaded the APM Missi

Hadoop The Definitive Guide [Book] - Study Notes

Chap-1- Meet Hadoop Requirement and adoption in yahoo. A framework that can scale to the web. Map and Reduce acitivity and features like data locality. Can be applied with a variety of algorithms Huge data processing can beat good algorithms Chap-2 - MapReduce The Map Java class and Reducer Java class The Job java class Jobtracker and tasktracker Hadoop reduces the input to input splits or just splits Map tasks write the intermediate output to local disks, so that they can be discarded after use. Outputs of Reduce tasks are stored in HDFS Combiner function can be run on map output, and the combiner functions output forms the input to the reduce function Hadoop streaming proivide hadoop apis in languages other than Java Chap-3 - The Hadoop Distributed Filesystem Fault tolerant solution. Same data written at multiple places. Filesystems that manage the storage across a network of machines are called distributed filesystems. Blocks - a block size is the minim

Adafruit GFX - How to change line spacing in text?

  You may want to update the line spacing to be a little lower than default due to small screen size on IoT devices. I faced this challenge while working on a Watchy hobby project. You may have used a font generator or just using the default fonts and got a *.h file that has the details of the font. In that case just change the last integer value in the PROGMEM variable.