-
Recent Posts
Recent Comments
Archives
Categories
Meta
Monthly Archives: February 2017
One Hot Encoding using sklearn
The dataset is the famous Titanic dataset. “onehotlabels” is a <891×1726 sparse matrix of type ‘<type ‘numpy.float64′>’ with 4455 stored elements in Compressed Sparse Row format>. Part of it: (0, 1725) 1.0 (0, 1574) 1.0 (0, 1416) 1.0 (0, 892) … Continue reading
Posted in Python for data analysis
Leave a comment
Exploratory Analysis of Movielen Dataset using Python
The MovieLens 20M dataset: GroupLens Research has collected and made available rating data sets from the MovieLens web site (http://movielens.org). The data sets were collected over various periods of time, depending on the size of the set. 20 million ratings … Continue reading
Posted in Uncategorized
Leave a comment
SQL commends cheat sheet 1 (W3school)
1. SELECT: Format: select column1, column 2 from dataset select * from dataset select distinct column from dataset select column1, column2 from table_name where (conditions) Conditions: AND, OR, LIKE, BETWEEN, IN For example, WHERE city … Continue reading
Posted in Uncategorized
Leave a comment