Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix/iloc#19 #6

Open
wants to merge 40 commits into
base: master
Choose a base branch
from
Open

Conversation

kayibal
Copy link
Owner

@kayibal kayibal commented Nov 2, 2017

Fixes #19

kayibal and others added 30 commits April 17, 2017 16:34
distributed assign support
Add property columns and index to dask.SparseFrame and increase version to 0.8.0
…ules downgrade to 0.19 until dask releases patch
This is useful mainly to avoid dask processes sharing really big arrays in case the categories get really big
This is useful mainly to avoid dask processes sharing really big arrays in case the categories get really big
was broken if loc returned a single location or integers were used as indexers
kayibal and others added 10 commits July 11, 2017 11:37
if multiindex is contained it is restored when loading. This requires saving of metadata. In case metadata is not available because the file was saved with a previous version the index class is inferred by the array values.
* Fix empty attribute of sparsity.SparseFrame

__init__ method did not initialize empty attribute correctly leading to errors when using the dask datastructure. This commit fixes this and adds corresponding tests.

* Adjust _is_empty method.

if data.nnz == 0 doesn't mean the array is empty it could be an array of purely zeros. replaced this check with a check for zero dimension which clearly indicates an empty frame.
* Possibility to one-hot encode multiple columns.
`one_hot_encode` signature significantly changed!

* Bump version to 0.12.0

* Fix docstring.

* Small refactoring.

* Added backward compatibility.

* Small bugfix.

* Changed interfaces to be (ugly but) consistent with previous version,
so that backward compatibility is cleaner.

* Use warnings module not to depend on drtools.

* Adding prefixes to one-hot-encoded column names.

When one-hot-encoding columns that contain same categories, resulting
 columns must have different names. This feature automatically adds
 original column name to resulting column name. (See docstrings.)
* Use versioneer to create version strings
* Add support to save npz files on s3

* fix ci

* fix ci

* fix to_npz

* Add mock_s3_fs docstring
@michcio1234 michcio1234 deleted the fix/iloc#19 branch November 8, 2017 14:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants