movietrends's People
movietrends's Issues
Complete naver movie review crawler
- Implement Naver movie review crawler
- Review crawling for about 600 movies
Define project directory structure
ROOT
├── data: input data
├── resources: .rd, .rds, .pickle, etc.
├── src: source codes
├── output: output data
└── (docs, refs, ...)
@lehup0803 선임님, 저는 보통 많이 나누면 이 정도로 나눴던 것 같은데 편하신대로 수정해주세요~
Make datasets for analysis
- source code refactoring
- Files List:
- data/mv_std_info.dat
- output/ts_reviews1.dat
- output/ts_reviews2.dat
- output/ts_reviews_cnt.dat
- output/ts_scores.dat
- output/ts_trend_index_aggr.dat
- output/ts_trend_index.dat
Unify data file encoding type
Encoding Type: EUC-KR (default encoding type of NCMS)
- 신규 데이터 파일 인코딩 EUC-KR로 생성하도록 소스 코드 수정
- 기존 데이터 파일 인코딩 변환
- UTF-8 파일 읽도록 되어 있는 소스 코드 수정
Refactor source code
- 최초 Input 데이터 레이아웃 정의 및 반영
- 메소드 Input, Output 타입 정비
- 소스 코드 분할: 트랜드 조회 / 조회 결과 분석 / 시각화 / API 작동방식 검증
Change data source from Wikipedia to NCMS
Note: need to check the file encoding of the data file. [#3]
Remove unnecessary files
Implement the method to request trends in keyword groups
- 기능 추가: 주제어 연관 검색어 API 요청에 반영하도록 수정
Collect movie ratings
- collect netizen rating
- collect movie audience rating
- collect movie critic rating
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.