Commit Graph

  • 2a4af674ef Merge pull request #30 from bellingcat/change-default-limit main Tristan Lee 2024-03-11 18:54:51 -05:00
  • 3f4fd2606b Bump version Galen Reich 2024-03-08 14:50:07 +00:00
  • d2fb0cc484 change video limit to avoid error Galen Reich 2024-03-08 14:44:55 +00:00
  • 18e2c4de3e Merge pull request #26 from bellingcat/error-module Tristan Lee 2023-12-08 05:19:17 -06:00
  • 6369e9579a used updated Playwright error module in import v2.0.3 Tristan Lee 2023-12-08 05:13:03 -06:00
  • efa3a47984 changed publishing workflow file to hopefully avoid making a new PyPi package every push to main, and incremented version patch number v2.0.2 Tristan Lee 2023-09-21 06:14:59 -05:00
  • 8a416c098d Merge pull request #23 from bellingcat/workflow-tests Tristan Lee 2023-09-21 06:00:11 -05:00
  • c7f2db1f9d no way to robustly set Windows files to readonly, so removed those tests Tristan Lee 2023-09-21 05:51:28 -05:00
  • 89d89521fa added headed argument to more robustly handle issues with scrapers headless mode Tristan Lee 2023-09-19 16:30:13 -05:00
  • 0bd87f944e removed tmate from windows workflow Tristan Lee 2023-09-19 00:52:12 -05:00
  • d2a0e3d5ad updated README from main Tristan Lee 2023-09-19 00:51:05 -05:00
  • fee300d4d7 reorganized output directory parsing tests Tristan Lee 2023-09-19 00:45:56 -05:00
  • e548b6fca9 Update README.md (fixed typo) Tristan Lee 2023-09-15 16:15:19 -05:00
  • 0b273cb7bd Updated README.md to include Playwright installation command Tristan Lee 2023-09-15 16:14:58 -05:00
  • 7603d9c769 attempting to debug Windows workflow Tristan Lee 2023-09-15 02:55:12 -05:00
  • 8e10c93e31 made the process_output_dir function more reliable on Windows Tristan Lee 2023-09-15 02:43:30 -05:00
  • fc61489def added tests for Windows environment Tristan Lee 2023-09-15 01:31:37 -05:00
  • 847fcb55cb Merge branch 'main' into workflow-tests Tristan Lee 2023-09-15 01:27:07 -05:00
  • 4836fd93aa updated README with playwright installation command, added pytest workflow Tristan Lee 2023-09-15 01:26:51 -05:00
  • ea4da1b700 Merge pull request #22 from bellingcat/adding-token-video Tristan Lee 2023-09-12 11:29:17 -05:00
  • 92ae29c722 updated version Tristan Lee 2023-09-12 11:26:07 -05:00
  • b916512bde removed auth module and authorization, since msToken isnt actually required to run scraper Tristan Lee 2023-09-11 21:43:33 -05:00
  • 92861e0e5d configured verbosity argument with logging level Tristan Lee 2023-09-11 21:29:37 -05:00
  • 6fa1e5026c made downloading more robust against transient and permanent errors, fixed issue where media file URLs weren't being updated after scraping Tristan Lee 2023-09-09 00:42:56 -05:00
  • 1f4b956ce9 made scraping more robust against transient playwright exceptions, set order of hashtags to scrape based on file modified time Tristan Lee 2023-09-07 11:18:22 -05:00
  • 91a8aaef38 added video link to msToken input, improved handling of output directories without write permission (and added relevant unit test), removed unused requirements.txt things Tristan Lee 2023-09-06 19:51:16 -05:00
  • 6a56c354e1 Update README.md Tristan Lee 2023-09-06 13:17:27 -05:00
  • 900d6adc69 Merge pull request #20 from bellingcat/refactor Tristan Lee 2023-09-06 09:53:57 -05:00
  • 10821e30f2 preparing for publishing (removed pipenv commands from workflow, added Contributing section on README, added functionality to pin dependency versions with requirements.txt) Tristan Lee 2023-09-06 09:51:31 -05:00
  • 8c32a3cf16 updated README, made yt-dlp downloading more robust against errors, changed name of videos folder to media (since images and audio files are also downloaded now) Tristan Lee 2023-09-04 13:51:28 -05:00
  • 5ae9624968 added tests, changed __main__ to cli Tristan Lee 2023-09-04 13:26:38 -05:00
  • 0f8e865bf3 added type hints for auth, incorporated auth into base module Tristan Lee 2023-09-04 10:40:30 -05:00
  • cf575e6cf6 updated README and added authorization Tristan Lee 2023-09-01 18:33:32 -05:00
  • a7bd023c21 simplified downloading logic (methods for keeping track of files less necessary since scraping can be done in Python), added functionality to use yt-dlp to download videos, added functionality to download TikTok image galleries Tristan Lee 2023-09-01 17:05:13 -05:00
  • 06b4a74c7d Update README.md Miguel Sozinho Ramalho 2023-03-16 09:48:50 +00:00
  • 7b63b9f349 Bump version to v1.0.4 for release v1.0.4 msramalho 2023-03-13 10:08:42 +00:00
  • e1ac3b5057 fixing not founds msramalho 2023-03-13 10:08:35 +00:00
  • f962878354 Bump version to v1.0.3 for release v1.0.3 msramalho 2023-03-13 09:54:14 +00:00
  • 6b4ceaae61 attempts at fixing CLI issues #18 msramalho 2023-03-13 09:54:06 +00:00
  • c4aa5a6cc5 Bump version to v1.0.2 for release v1.0.2 msramalho 2023-02-13 16:55:27 +00:00
  • ad9cac8cdd readme fix msramalho 2023-02-13 16:54:20 +00:00
  • 2968ada6c8 Bump version to v1.0.1 for release v1.0.1 msramalho 2023-02-13 16:52:23 +00:00
  • 4f81673f04 Bump version to v1.0.0 for release v1.0.0 msramalho 2023-02-13 16:49:54 +00:00
  • 14eaae0f20 Merge pull request #17 from rly0nheart/main Miguel Sozinho Ramalho 2023-02-13 16:49:02 +00:00
  • 980a27ff96 pypi fixes msramalho 2023-02-13 16:48:26 +00:00
  • 83fe050c15 Update run_downloader.py Richard Mwewa 2023-01-19 03:41:10 +02:00
  • f8c12a8d68 Update hashtag_frequencies.py Richard Mwewa 2023-01-19 03:40:37 +02:00
  • 99467f0e91 Update README.md Richard Mwewa 2023-01-19 03:35:34 +02:00
  • fb4755244f Update main.py Richard Mwewa 2023-01-19 03:32:04 +02:00
  • 5df653ccef Create Dockerfile Richard Mwewa 2023-01-19 03:29:10 +02:00
  • 4c69f616e6 Create setup.py Richard Mwewa 2023-01-19 03:26:38 +02:00
  • 9dd22c90c7 Create main.py Richard Mwewa 2023-01-19 03:17:11 +02:00
  • 5f4eb9f2c8 Refactored for PyPI Richard Mwewa 2023-01-19 03:15:28 +02:00
  • 1409c50034 Refactored for PyPI Richard Mwewa 2023-01-19 03:13:20 +02:00
  • a0f4320635 Update README.md johannawild 2022-05-16 13:14:40 +02:00
  • c3d9b415c6 Update README.md johannawild 2022-05-16 13:13:57 +02:00
  • db08aacab5 Update README.md johannawild 2022-05-16 13:12:54 +02:00
  • 99a0b16d66 Update README.md johannawild 2022-05-16 13:11:50 +02:00
  • 26b4bcc00d Update README.md johannawild 2022-05-10 16:34:24 +02:00
  • 5866763adc Update README.md johannawild 2022-05-10 16:34:05 +02:00
  • 2c345fa27a Update README.md johannawild 2022-05-10 16:33:38 +02:00
  • 41007a8fa6 changed filehandler to debug level to capture logged data X 2022-05-06 12:20:00 +02:00
  • 280303f461 changed filehandler level to INFO and changed Logger to files X 2022-05-06 11:54:00 +02:00
  • 161699d2b9 Update README.md johannawild 2022-05-06 11:24:42 +02:00
  • 474e39568b Merge pull request #5 from bellingcat/even_more_tristan_edits johannawild 2022-05-06 10:42:40 +02:00
  • e0f55145e1 fixed typo in error message Tristan Lee 2022-05-06 03:39:33 -05:00
  • 52338d47de Add total posts to the hashtag_frequencies console printing X 2022-05-06 10:25:10 +02:00
  • 21b404ff57 renamed source directory Tristan Lee 2022-05-06 03:13:40 -05:00
  • f377408960 updated README with new hashtag_frequencies table Tristan Lee 2022-05-06 02:57:56 -05:00
  • 6bddcfb238 modified formatting of print_occurrences function Tristan Lee 2022-05-06 02:56:38 -05:00
  • f77214c71f fixed typo in hashtag_frequencies.plot Tristan Lee 2022-05-06 02:49:46 -05:00
  • 595a6e6535 specified filepath argument in tiktok-scraper to avoid chdir commands Tristan Lee 2022-05-06 02:36:21 -05:00
  • 0cb9d4b1b9 made docstrings more consistent, changed argument of hashtag_frequencies script to use the hashtag rather than the post_id file for the hashtag, to make it easier to use Tristan Lee 2022-05-06 01:49:55 -05:00
  • be05ea0fe2 fixed problems with type hints, clarified documentation Tristan Lee 2022-05-05 20:50:54 -05:00
  • f918f06c28 check formatting using black and add type hinting to functions X 2022-05-05 19:10:37 +02:00
  • 4d18522a54 incorporate Tristan's suggestions and further fix logging X 2022-05-05 17:07:33 +02:00
  • bffc8f85a3 Merge pull request #4 from bellingcat/more_tristan_edits johannawild 2022-05-05 12:58:06 +02:00
  • 9d6b62f731 Update LICENSE johannawild 2022-05-05 12:48:54 +02:00
  • af5bcc9433 fixed typo in Windows venv activation command Tristan Lee 2022-05-05 02:58:42 -05:00
  • cd883eeeb1 minor fixes in the README and LICENSE Tristan Lee 2022-05-05 02:39:23 -05:00
  • 64354f6099 Updated plot figure in README Tristan Lee 2022-05-05 02:32:32 -05:00
  • 14c52e5d75 simplified logging, used warnings.warn and calling exceptions rather than logging them, various code cleanups and clarifications Tristan Lee 2022-05-05 02:23:50 -05:00
  • a54a233766 Update data_methods.py johannawild 2022-05-04 20:32:46 +02:00
  • feb827d317 Update file_methods.py johannawild 2022-05-04 17:11:01 +02:00
  • 1a96807d60 Merge pull request #3 from bellingcat/tristan_edits johannawild 2022-05-04 17:01:48 +02:00
  • 34a7c432a3 Merge branch 'main' into tristan_edits johannawild 2022-05-04 17:00:41 +02:00
  • 0126f36107 Update README.md johannawild 2022-05-04 16:55:37 +02:00
  • b3a8fd6a9e Update README.md johannawild 2022-05-04 16:53:48 +02:00
  • 858835c881 Update README.md johannawild 2022-05-04 16:52:22 +02:00
  • 52d37d9ff8 merged changes Tristan Lee 2022-05-04 01:31:39 -05:00
  • 234b763f49 Update README.md johannawild 2022-05-04 00:44:04 +02:00
  • ed15e3b6d7 Update README.md johannawild 2022-05-04 00:42:27 +02:00
  • fa2f113b42 Update README.md johannawild 2022-05-04 00:41:42 +02:00
  • 0b15617b5c Update README.md johannawild 2022-05-04 00:41:13 +02:00
  • 304293046a Update README.md johannawild 2022-05-04 00:40:47 +02:00
  • 24e5828ec9 Update README.md johannawild 2022-05-04 00:40:10 +02:00
  • 137be84305 Update README.md johannawild 2022-05-04 00:39:49 +02:00
  • 3e0dd154d8 Update README.md johannawild 2022-05-04 00:38:27 +02:00
  • ccd185aa87 Update README.md johannawild 2022-05-04 00:37:49 +02:00
  • dbf6eb595e Update README.md johannawild 2022-05-04 00:36:38 +02:00