-
-
Notifications
You must be signed in to change notification settings - Fork 110
Comparing changes
Open a pull request
base repository: webrecorder/browsertrix-crawler
base: v1.6.1
head repository: webrecorder/browsertrix-crawler
compare: v1.6.2
- 7 commits
- 12 files changed
- 2 contributors
Commits on May 12, 2025
-
- validate --lang values, fail immediately with invalid iso-639-1 country code - ignore --lang value when using profile, print warning that profile language takes precedence - fixes #833
Configuration menu - View commit details
-
Copy full SHA for 71de8d6 - Browse repository at this point
Copy the full SHA 71de8d6View commit details
Commits on May 20, 2025
-
Add WARC-Protocol header (#715)
- add WARC-Protocol repeated header(s) for HTTP, TLS as per iipc/warc-specifications#42 - also set HTTP/1.0 on WARC record if actually http/1.0, otherwise keep HTTP/1.1 --------- Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>
Configuration menu - View commit details
-
Copy full SHA for e72b343 - Browse repository at this point
Copy the full SHA e72b343View commit details
Commits on May 28, 2025
-
tmpdir: use os.tmpdir() instead of hardcoded '/tmp' (#842)
allows for customizing tmp directory with TMPDIR env var
Configuration menu - View commit details
-
Copy full SHA for 52235ab - Browse repository at this point
Copy the full SHA 52235abView commit details -
Remove hardcoded /tmp prefix from path (#843)
Fast-follow to #842 to fix a typo
Configuration menu - View commit details
-
Copy full SHA for 46a02d1 - Browse repository at this point
Copy the full SHA 46a02d1View commit details -
optimization: normalize dedup status: treat 0 (response code not yet …
…known) or 206 as 200… (#835) Avoids fetching duplicate content when fetched through different code path (eg. autoplay behavior calling fetch, vs video playing automatically)
Configuration menu - View commit details
-
Copy full SHA for 7bf10f7 - Browse repository at this point
Copy the full SHA 7bf10f7View commit details
Commits on May 29, 2025
-
remove early serialization which may result in missing WARC-Protocol …
…and security metadata (#844) - drop early serialization in handleFetchResponse(), can result in writing WARC record too early, before the WARC-Protocol and other data is available. (Added previously for requests loaded via browser context / service worker which did not get a 'loadingFinished' message, but now these will still be closed in awaitPageResources()) - don't log 'skipping URL from unknown frame' warning since it is often spurious, since frame can be added in subsequent message and response is *not* skipped.
Configuration menu - View commit details
-
Copy full SHA for 178b10a - Browse repository at this point
Copy the full SHA 178b10aView commit details
Commits on Jun 3, 2025
-
Configuration menu - View commit details
-
Copy full SHA for a5936b5 - Browse repository at this point
Copy the full SHA a5936b5View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v1.6.1...v1.6.2