mirror of
https://github.com/thegeneralist01/archivr
synced 2026-05-30 08:36:47 +02:00
Revert "feat: add generic media source handling and local file archiving"
This commit is contained in:
parent
cd7dfd7c8a
commit
9e38c18bd4
8 changed files with 22 additions and 2252 deletions
|
|
@ -20,7 +20,7 @@ An open-source self-hosted archiving tool. Work in progress.
|
|||
- [ ] Dropbox
|
||||
- [ ] OneDrive
|
||||
- (Some of these could be postponed for later.)
|
||||
- [X] Archiving Twitter threads
|
||||
- [ ] Archiving Twitter threads
|
||||
- [ ] Archive web pages (HTML, CSS, JS, images)
|
||||
- [ ] Archiving emails (???)
|
||||
- [ ] Gmail
|
||||
|
|
@ -45,14 +45,5 @@ There are two driving factors behind this project:
|
|||
|
||||
This project aims to provide a reliable solution for archiving important data from various sources, ensuring that users can preserve their digital assets for the long term.
|
||||
|
||||
## Twitter/X Archive Inputs
|
||||
- Tweet content TOML: `tweet:ID`, `x:tweet:ID`, `x:x:ID`, `twitter:x:ID`, `twitter:tweet:ID`
|
||||
- Tweet media/video: `tweet:media:ID`
|
||||
- Thread TOML content: `x:thread:ID`, `twitter:thread:ID`
|
||||
|
||||
Tweet and thread TOMLs are stored directly in `raw_tweets/`. Downloaded tweet media and avatars are re-archived into the hashed `raw/` store, and the TOMLs point at those archived files using store-relative `raw/...` paths.
|
||||
|
||||
Twitter tweet/thread scraping requires `ARCHIVR_TWITTER_CREDENTIALS_FILE` to point to a cookies file for the vendored scraper.
|
||||
|
||||
## License
|
||||
This project is licensed under the MIT License. See the [LICENSE](LICENSE.md) file for details.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue