~ $ python script/spider.py --config config/prod.cfg --notify ifttt --claimOnly
__ __ __ __
____ ____ ______/ /__/ /_____ __ __/ /_ ______________ __ __/ /__ _____
/ __ \/ __ `/ ___/ //_/ __/ __ \/ / / / __ \______/ ___/ ___/ __ `/ | /| / / / _ \/ ___/
/ /_/ / /_/ / /__/ ,< / /_/ /_/ / /_/ / /_/ /_____/ /__/ / / /_/ /| |/ |/ / / __/ /
/ .___/\__,_/\___/_/|_|\__/ .___/\__,_/_.___/ \___/_/ \__,_/ |__/|__/_/\___/_/
/_/ /_/
Download FREE eBook every day from www.packtpub.com
@see github.com/niqdev/packtpub-crawler
[*] 2017-01-31 10:30 - fetching today's eBooks
[*] configuration file: /app/config/prod.cfg
[*] getting daily free eBook
[*] fetching url... 200 | https://www.packtpub.com/packt/offers/free-learning
[*] fetching url... 200 | https://www.packtpub.com/packt/offers/free-learning
[*] fetching url... 200 | https://www.packtpub.com/account/my-ebooks
[+] book successfully claimed
[+] notification sent to IFTTT
[*] getting free eBook from newsletter
[*] fetching url... 200 | https://www.packtpub.com/packt/free-ebook/practical-data-analysis
[-] <type 'exceptions.IndexError'> list index out of range | spider.py@123
Traceback (most recent call last):
File "script/spider.py", line 123, in main
packtpub.runNewsletter(currentNewsletterUrl)
File "/app/script/packtpub.py", line 160, in runNewsletter
self.__parseNewsletterBookInfo(soup)
File "/app/script/packtpub.py", line 98, in __parseNewsletterBookInfo
title = urlWithTitle.split('/')[4].replace('-', ' ').title()
IndexError: list index out of range
[+] error notification sent to IFTTT
[*] done
~ $
It has successfully claimed the book from the newsletter already, but on subsequent days I'm getting the above error.
It has successfully claimed the book from the newsletter already, but on subsequent days I'm getting the above error.
And it sends an IFTTT notification for the second one :(