Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 271 Bytes

File metadata and controls

13 lines (10 loc) · 271 Bytes

Introduction:

This program is intended to crawl the Chinese New York Times website and align the Chinese sentences to English sentences.

How to run the crawler:

python3 scraper.py --num_pages [NUM]

How to run the aligner:

python3 aligner.py