Comments (2)
@hemant-git10 This is an OS-specific problem/doubt. You are better off asking this in StackOverflow, rather than the author :)
from brown-cluster.
I haven't tried this out with brown-cluster, but the usual way is either:
- Concatenate all the files into one single file (which might consume a lot of disk space):
cat file1 file2 file3 > bigfile
./wcluster --text bigfile --c 50
or
2. Concatenate all the files to stdin, then use /dev/stdin:
cat file1 file2 file3 | ./wcluster --text /dev/stdin --c 50 --output_dir combined-input
from brown-cluster.
Related Issues (15)
- A library for brown clustering? HOT 1
- Broken link to thesis HOT 2
- When size of data is large (over 100 MB), Brown-cluster program will be killed. How can I fix this error? HOT 2
- what are these results? HOT 1
- Is there any limit for the vocab size (#types)? HOT 5
- Is it possible to cluster new documents without relearning everything? HOT 2
- what happened if length of text is bigger than INT_MAX ?
- Clustering perplexity measure
- basic/prob-utils.cc:8:37: error: ‘M_PI’ was not declared in this scope HOT 1
- How to choose optimized number of cluster for specific input corpus ?
- Question
- Problem compiling on Windows 7 HOT 8
- how Paths2map is used
- Speed up with compiler optimization HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from brown-cluster.