Git Product home page Git Product logo

picklable-itertools's Issues

Make sure every function has at least one verify_pickle

  • count
  • cycle
  • repeat
  • chain
  • compress
  • dropwhile
  • groupby
  • ifilter
  • ifilterfalse
  • islice
  • imap
  • starmap
  • tee
  • takewhile
  • izip
  • izip_longest
  • product
  • permutations
  • combinations
  • combinations_with_replacement
  • accumulate
  • file_iterator
  • range_iterator
  • dict_iterator
  • ordered_sequence_iterator

Badges?

Don't know how to get the badges in README.rst back to normal.

Equizip

It'd be nice to have a picklable version of the equizip operator I wrote in mila-iqia/blocks#458. For example, the Merge transformer from mila-iqia/fuel#31 could use this (optionally, but by default) so that we can raise errors when a user is trying to zip two datastreams of different length.

Set up buildbot

Travis, maybe scrutinizer. It'll run pretty infrequently but this package is important enough for Blocks that we'd like it not to break.

PyPI checklist

  • Reorganize files to clean up namespace
  • Docstrings
  • Naive implementations of combinations and permutations (based on filtering product, permutations, etc. -- see itertools docs)
  • At least one serialization test per public function

itertoolz?

Would be great to extend this package to contain itertoolz implementations as well. The one that I can think of that are particularly handy are partition and partition_all, so that returning batches becomes: partition(batch_size, xrange(num_examples)).

xrange iterator

rangeiterator objects aren't picklable either, but a custom iterator should be simple enough.

Picklable file iterator?

@dwf Wondering if you consider this within the scope of the framework.

File handles are iterators, and they can't be pickled.

>>> f = open('README.md')
>>> f
<open file 'README.md', mode 'r' at 0x7f13b1360d20>
>>> it = iter(f)
>>> it
<open file 'README.md', mode 'r' at 0x7f13b1360d20>
>>> next(it)
'picklable_itertools\n'
>>> cPickle.dumps(it)
TypeError: can't pickle file objects

Analogously to ordered_sequence_iterator, can I add a file_iterator? It will be a bit more involved, because it needs to handle a variety of things through custom __getstate__ and __setstate__ methods:

  • Upon pickling, close and remove file handle
  • When unpickling, re-open the file, re-iterate over the lines until we get back to where we were
  • Have sensible defaults for if the file is not available in the unpickled environment, is no longer writable, etc. Dill implements all of this, so we can steal a lot of the behaviour from there.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.