Category Archives: Python for data analysis

Learning curves (example)

 

Posted in Python for data analysis | Leave a comment

One Hot Encoding using sklearn

The dataset is the famous Titanic dataset. “onehotlabels” is a <891×1726 sparse matrix of type ‘<type ‘numpy.float64′>’ with 4455 stored elements in Compressed Sparse Row format>. Part of it: (0, 1725) 1.0 (0, 1574) 1.0 (0, 1416) 1.0 (0, 892) … Continue reading

Posted in Python for data analysis | Leave a comment