mirror of
https://github.com/NohamR/phpBB-forum-scraper.git
synced 2026-02-22 02:25:43 +00:00
Updated README.md
This commit is contained in:
10
README.md
10
README.md
@@ -3,9 +3,9 @@ Python-based scraper for phpBB forums.
|
|||||||
|
|
||||||
Code requires:
|
Code requires:
|
||||||
|
|
||||||
1. Python scraping library, <a href="http://scrapy.org/" target="_blank">Scrapy</a>.
|
1. Python scraping library, [Scrapy.]: http://scrapy.org/
|
||||||
|
|
||||||
2. Python HTML parsing library, <a href="ttps://www.crummy.com/software/BeautifulSoup/bs4/doc/" target="_blank">BeautifulSoup</a>.
|
2. Python HTML parsing library, [BeautifulSoup.]: https://www.crummy.com/software/BeautifulSoup/bs4/doc/
|
||||||
|
|
||||||
|
|
||||||
## Scraper Output
|
## Scraper Output
|
||||||
@@ -22,12 +22,6 @@ Scrapes the following information from forum posts:
|
|||||||
5. Quoted text
|
5. Quoted text
|
||||||
|
|
||||||
|
|
||||||
allowed_domains = ['']
|
|
||||||
start_urls = ['']
|
|
||||||
username = ''
|
|
||||||
password = ''
|
|
||||||
form_login = False
|
|
||||||
|
|
||||||
Edit `phpBB.py` and specify:
|
Edit `phpBB.py` and specify:
|
||||||
|
|
||||||
1. `allowed_domains`
|
1. `allowed_domains`
|
||||||
|
|||||||
Reference in New Issue
Block a user