Git Product home page Git Product logo

kaggle-avazu's People

Contributors

owenzhang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

kaggle-avazu's Issues

No such file or directory: '/dev/shm/_tmp_2way_v.txt.out' when run "python _2c_generate_fm_features.py" in "_0_run_me.sh"

while start _0_run_me.sh,
It report such error when run "python _2c_generate_fm_features.py"

t0tv_mx loaded with shape (10999998, 43)
t0 loaded with shape (10999998, 40)
22 all_withid
to write data files ...
/media/rzai/ai_data/prj/kaggle-2014-criteo/solvers/libffm-1.13/ffm-train -t 4 -s 8 -l 1e-5 /dev/shm/_tmp_2way_v.txt /dev/shm/_tmp_2way_t.txt
load results ...
Traceback (most recent call last):
File "_2c_generate_fm_features.py", line 78, in
fm_predv = pd.read_csv(open(path1 + '_tmp_2way_v.txt.out', 'r'), header=None).ix[:,0].values
IOError: [Errno 2] No such file or directory: '/dev/shm/_tmp_2way_v.txt.out'

out of mem when run 'python _1_encode_cat_features.py' on 64GB machine

site_model site_model 24 3335302 3335302
site_model site_model 25 3363122 3363122
site_model site_model 26 3835892 3835892
site_model site_model 27 3225010 3225010
site_model site_model 28 5287222 5287222
site_model site_model 29 3832608 3832608
site_model site_model 30 4218938 4218938
site_model site_model 31 4577464 4577464
app_model app_model 22 5337126 5337126
app_model app_model 23 3870752 3870752
app_model app_model 24 3335302 3335302
app_model app_model 25 3363122 3363122
app_model app_model 26 3835892 3835892
app_model app_model 27 3225010 3225010
app_model app_model 28 5287222 5287222
app_model app_model 29 3832608 3832608
app_model app_model 30 4218938 4218938
app_model app_model 31 4577464 4577464
dev_id_ip dev_id_ip 22 5337126 5337126
Traceback (most recent call last):
File "_1_encode_cat_features.py", line 56, in
calc_exptv(t0, exptv_vn_list)
File "/media/rzai/ai_data/prj/kaggle-avazu-2nd/utils.py", line 499, in calc_exptv
t0.loc[t0.day.values == day_v, vn_exp]=day_exps[day_v][vn_key]['exp']
File "/usr/local/lib/python2.7/dist-packages/pandas/core/indexing.py", line 118, in setitem
self._setitem_with_indexer(indexer, value)
File "/usr/local/lib/python2.7/dist-packages/pandas/core/indexing.py", line 210, in _setitem_with_indexer
take_split_path = self.obj._is_mixed_type
File "/usr/local/lib/python2.7/dist-packages/pandas/core/generic.py", line 2054, in _is_mixed_type
return self._protect_consolidate(f)
File "/usr/local/lib/python2.7/dist-packages/pandas/core/generic.py", line 2020, in _protect_consolidate
result = f()
File "/usr/local/lib/python2.7/dist-packages/pandas/core/generic.py", line 2053, in
f = lambda: self._data.is_mixed_type
File "/usr/local/lib/python2.7/dist-packages/pandas/core/internals.py", line 2568, in is_mixed_type
self._consolidate_inplace()
File "/usr/local/lib/python2.7/dist-packages/pandas/core/internals.py", line 2830, in _consolidate_inplace
self.blocks = tuple(_consolidate(self.blocks))
File "/usr/local/lib/python2.7/dist-packages/pandas/core/internals.py", line 3799, in _consolidate
_can_consolidate=_can_consolidate)
File "/usr/local/lib/python2.7/dist-packages/pandas/core/internals.py", line 3825, in _merge_blocks
new_values = new_values[argsort]
MemoryError

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.