Go to file
2023-05-27 18:22:50 -04:00
cleanit.sh Initial commit 2022-01-30 04:53:28 -05:00
fetchmap.sh Initial commit 2022-01-30 04:53:28 -05:00
gencaption.py Initial commit 2022-01-30 04:53:28 -05:00
gencsv.py update README 2022-07-10 00:42:21 -04:00
pullit_atoz.py pull ids from collections instead of sitemap 2022-07-09 23:00:04 -04:00
pullit_series.sh Initial commit 2022-01-30 04:53:28 -05:00
pullit.py pullit: add Python version 2022-06-27 01:06:26 -04:00
pullit.sh Initial commit 2022-01-30 04:53:28 -05:00
README.md add scraped data for 20230524 and 20230527 2023-05-27 18:22:50 -04:00
runscrape.sh pull ids from collections instead of sitemap 2022-07-09 23:00:04 -04:00

Scripts for scraping metadata from Disney+.

Usage:

./runscrape.sh scrape_outdir
./gencsv_all_lang.sh scrape_outdir

Scraped output (2023-05-27):

Scraped output (2023-05-24):

Scraped output (2022-10-14):

Scraped output (2022-09-10):

Scraped output (2022-08-07):

Scraped output (2022-07-09):

Scraped output (2022-06-27):

Scraped output (en-US, 2022-01-30):