anonymous_github

mirror of https://github.com/tdurieux/anonymous_github.git synced 2026-05-15 22:48:00 +02:00

Author	SHA1	Message	Date
tdurieux	9adff11e74	fix(cache): atomic file writes and size-validated cache reads A failed/interrupted GitHub fetch could leave a 0-byte or truncated file in the local cache. Subsequent reads happily streamed the empty content as the file's body — visible to users as an "Empty file" with HTTP 200. Reproduced on artifact-70B6/Lethe/configs.py (#694). - FileSystem.write: stream into a sibling .tmp and rename into place only on finish. Stream errors discard the tmp and leave any prior cached file untouched. Drop the utf-8 encoding that was silently corrupting binary blobs. - GitHubStream.getFileContentCache: accept an expected size and treat cached.size < expected as a poisoned cache (truncated fetch) → rm and re-fetch. cached.size >= expected is accepted, which keeps Git LFS-resolved files (whose FileModel.size is the pointer size) working. - AnonymizedFile: expose size() and pass it through to the streamer alongside sha so the cache check has the upstream size. Existing poisoned entries self-heal on next access. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-05 08:47:41 +03:00
tdurieux	f0f6436370	feat: resolve Git LFS pointers via the raw URL endpoint Files tracked by Git LFS used to come out as the pointer text: version https://git-lfs.github.com/spec/v1 oid sha256:... size ... …because GitHub's blob API returns the pointer, not the resolved content. Detect that prefix on the first ~150 bytes of the blob stream and switch to a fresh fetch via the web raw URL (github.com/<owner>/<repo>/raw/<commit>/<path>), which auto-redirects to media.githubusercontent.com and resolves the LFS object — auth header carries through. Non-LFS files are forwarded through the existing pipeline unchanged. Fixes #95.	2026-05-04 12:18:55 +02:00
tdurieux	a5f66d6844	multiple fixes	2026-05-03 15:30:54 +02:00
Thomas Durieux	f4209110c7	Fix all 93 ESLint issues (3 errors, 90 warnings) (#666 )	2026-04-15 09:04:22 +02:00
Thomas Durieux	655ae92c4c	Remove OpenTelemetry tracing infrastructure (#662 )	2026-04-15 04:39:08 +02:00
Thomas Durieux	f3641c8ce3	Set up CI with ESLint linter and Mocha test runner (#661 )	2026-04-15 04:34:03 +02:00
tdurieux	dcf483ea03	feat: improve download anonymized repository	2024-05-06 11:52:32 +02:00
tdurieux	17abc47d08	fix: fix webview on root repo	2024-04-28 08:08:39 +01:00
tdurieux	a86e050f8b	fix: handle empty repository	2024-04-26 13:48:32 +01:00
tdurieux	710f7328e7	feat: flatten file tree for better performance	2024-04-26 10:32:09 +01:00
tdurieux	1d4bab7866	fix: fix webview & improve download progress	2024-04-03 18:25:33 +01:00
tdurieux	db67f53b2c	fix: fix GitHubDownload	2024-04-03 13:24:34 +01:00
tdurieux	4d12641c7e	feat: introduce streamers that handle the stream and anonymization from github	2024-04-03 11:13:01 +01:00

13 Commits