Movie Review Scraper

movie review scraper to generate data for natural language processing

Description

Collecting data in the form of sentences can be useful for many applications including sentiment analysis using NLP or for a simple classification model. Gathering such data can often be a tedious and frustrating task. This project will help you generate the required data by performing a few simple steps.

Requirements

python
scrapy python package

How To Use

Installing scrapy

   pip install scrapy

This project was created on Scrapy 2.5.0, but any subsequent versions will also work.

Setting up the project

You will first need to go to the rotten tomatoes website and search for the movie whose reviews you want. Then proceed to the critic or audience reviews and click on view all. The link in your browser will act as the starting point for the crawler. Copy this link and navigate to Scrapy Web Crawler/crawler/crawler/spiders/review_spider.py. Paste the copied link in the start_urls list and also in the next_page variable below.

Running the project

To crawl the reviews and store them in a csv file, we simply have to navigate to the project in our command line terminal and write

   scrapy crawl reviews -o reviews.csv

Here, you can replace reviews.csv with any filename of your choice. The crawler will crawl and store the first 500 reviews of the movie. This can be changed by navigating to the reviews_spider.py file as shown above and changing the number of pages from 25 to the required amount.

Author Info

Instagram - @AnishMulay
Email - f20180907@goa.bits-pilani.ac.in

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie Review Scraper

Table of Contents

Description

Requirements

How To Use

Installing scrapy

Setting up the project

Running the project

Author Info

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Movie Review Scraper

Table of Contents

Description

Requirements

How To Use

Installing scrapy

Setting up the project

Running the project

Author Info

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages