Monthly Archives: February 2017

Learning curves (example)

 

Posted in Python for data analysis | Leave a comment

One Hot Encoding using sklearn

The dataset is the famous Titanic dataset. “onehotlabels” is a <891×1726 sparse matrix of type ‘<type ‘numpy.float64′>’ with 4455 stored elements in Compressed Sparse Row format>. Part of it: (0, 1725) 1.0 (0, 1574) 1.0 (0, 1416) 1.0 (0, 892) … Continue reading

Posted in Python for data analysis | Leave a comment

Exploratory Analysis of Movielen Dataset using Python

The MovieLens 20M dataset: GroupLens Research has collected and made available rating data sets from the MovieLens web site (http://movielens.org). The data sets were collected over various periods of time, depending on the size of the set.  20 million ratings … Continue reading

Posted in Uncategorized | Leave a comment

SQL commends cheat sheet 1 (W3school)

1. SELECT: Format: select column1, column 2 from dataset select * from dataset select distinct column from dataset select column1, column2 from table_name where (conditions)  Conditions: AND, OR, LIKE, BETWEEN, IN For example,            WHERE city … Continue reading

Posted in Uncategorized | Leave a comment