Skip to content

Commit

Permalink
Refactor crawler file name
Browse files Browse the repository at this point in the history
  • Loading branch information
shhs29 committed Mar 4, 2022
1 parent 495eca6 commit 6fc1268
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 3 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,5 3,5 @@ main.py contains code to crawl ontario tech websites and visualise it as a graph

To run crawler, execute command:
```
python main.py
python crawler.py
```
4 changes: 2 additions & 2 deletions main.py → crawler.py
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 6,7 @@
from bs4 import BeautifulSoup


def crawler(root_url: str):
def crawl_engine(root_url: str):
# list of extensions to ignore from crawler
extensions = ['.jpg', '.jpeg', '.png', '.pdf', '.svg']
# create a connection pool to handle requests
Expand Down Expand Up @@ -89,4 89,4 @@ def crawler(root_url: str):

if __name__ == '__main__':
# crawl all ontario tech webpages
crawler(root_url="https://ontariotechu.ca/")
crawl_engine(root_url="https://ontariotechu.ca/")

0 comments on commit 6fc1268

Please sign in to comment.