¡@

Home 

python Programming Glossary: datasets

Gantt Chart online

http://stackoverflow.com/questions/1005587/gantt-chart-online

for jQuery. It produces graphical plots of arbitrary datasets on the fly client side. The focus is on simple usage all settings..

Python Numpy Very Large Matrices

http://stackoverflow.com/questions/1053928/python-numpy-very-large-matrices

data on disk in HDF format with optional compression. My datasets often get 10x compression which is handy when dealing with tens..

“Large data” work flows using pandas

http://stackoverflow.com/questions/14262433/large-data-work-flows-using-pandas

but I currently lack an out of core workflow for large datasets. I'm not talking about big data that requires a distributed.. drive. My first thought is to use HDFStore to hold large datasets on disk and pull only the pieces I need into dataframes for.. information like criminal records bankruptcies etc... The datasets I use every day have nearly 1 000 to 2 000 fields on average..

How to prevent numbers being changed to exponential form in Python matplotlib figure

http://stackoverflow.com/questions/14711655/how-to-prevent-numbers-being-changed-to-exponential-form-in-python-matplotlib-fi

figure I'm using Matplotlib in Python to plot simple x y datasets. This produces really nice looking graphs although when I zoom..

How do I resolve 'NoneType' object has no attribute 'write' error with scikit-learn digits dataset?

http://stackoverflow.com/questions/17139658/how-do-i-resolve-nonetype-object-has-no-attribute-write-error-with-scikit-le

of the ref `stat_learn_tut_index`. from sklearn import datasets neighbors linear_model digits datasets.load_digits X_digits.. from sklearn import datasets neighbors linear_model digits datasets.load_digits X_digits digits.data y_digits digits.target n_samples.. to work fine... import numpy as np from sklearn import datasets iris datasets.load_iris iris_X iris.data iris_y iris.target..

Image segmentation based on edge pixel map

http://stackoverflow.com/questions/18972932/image-segmentation-based-on-edge-pixel-map

edge or non edge. I've used it successfully on a few image datasets but am running into problems with this particular dataset which..

Classifying Documents into Categories

http://stackoverflow.com/questions/3113428/classifying-documents-into-categories

score for a given model and given dataset. For larger datasets you should try the vowpal wabbit which is probably the fastest..

Calculating Pearson correlation and significance in Python

http://stackoverflow.com/questions/3949226/calculating-pearson-correlation-and-significance-in-python

coefficient measures the linear relationship between two datasets. Strictly speaking Pearson's correlation requires that each.. the probability of an uncorrelated system producing datasets that have a Pearson correlation at least as extreme as the one.. at least as extreme as the one computed from these datasets. The p values are not entirely reliable but are probably reasonable..

How to get started with Big Data Analysis

http://stackoverflow.com/questions/4322559/how-to-get-started-with-big-data-analysis

for example. Using the RHIPE package and finding toy datasets and problem areas. Finding the right information to allow me.. Play with that. Using the RHIPE package and finding toy datasets and problem areas. Fine. Play with that too. Don't sweat finding.. areas. Fine. Play with that too. Don't sweat finding big datasets. Even small datasets present very interesting problems. Indeed..

Is it possible to specify your own distance function using Scikits.Learn K-Means Clustering?

http://stackoverflow.com/questions/5529625/is-it-possible-to-specify-your-own-distance-function-using-scikits-learn-k-means

X np.random.exponential size N dim # cf scikits learn datasets t0 time if kmsample 0 centres xtoc dist kmeanssample X ncluster..

matplotlib - extracting data from contour lines

http://stackoverflow.com/questions/5666056/matplotlib-extracting-data-from-contour-lines

which could be used for further analysis as in general my datasets are not a small or simple as the example above. python numpy..