demo-1's Introduction

DEMO-1

USED TO LEARN GIT (CLONE)

Scraping Numbers from HTML using BeautifulSoup In this assignment you will write a Python program similar to http://www.py4e.com/code3/urllink2.py. The program will use urllib to read the HTML from the data files below, and parse the data, extracting numbers and compute the sum of the numbers in the file.

#We provide two files for this assignment. One is a sample file where we give you the sum for your testing and the other is the actual data you need to process for the assignment.

Sample data: http://py4e-data.dr-chuck.net/comments_42.html (Sum=2553) Actual data: http://py4e-data.dr-chuck.net/comments_716253.html (Sum ends with 65) You do not need to save these files to your folder since your program will read the data directly from the URL. Note: Each student will have a distinct data url for the assignment - so only use your own data url for analysis. Data Format The file is a table of names and comment counts. You can ignore most of the data in the file except for lines like the following:

Modu90 Kenzie88 Hubert87 You are to find all the tags in the file and pull out the numbers from the tag and sum the numbers.

THIS IS A LOCAL MADE CHANGE (JUST AN EXAMPLE)

Recommend Projects

magicmike29 / demo-1 Goto Github PK

demo-1's Introduction

DEMO-1

USED TO LEARN GIT (CLONE)

THIS IS A LOCAL MADE CHANGE (JUST AN EXAMPLE)

demo-1's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent