homepages.news¶
An open-source archive that gathers, saves, shares and analyzes news homepages
Features¶
Directory¶
About¶
The archive at homepages.news is an open-source software project managed by Ben Welsh. Each day it gathers screenshots, accessibility trees, hyperlink lists, robots.txt and Lighthouse audits from hundreds of news homepages around the world. It also ensures that all sites are routinely saved by the Wayback Machine at archive.org.
The assets are archived in a permanent collection at the Internet Archive. The latest screenshots, analysis and data are published here, as well as on Mastodon.
The system supports the creation of bots to post a newsroom’s latest screenshots into a private Slack channel. The tool is used by organizations in the U.S. and abroad to save and share images each day.
Contributing¶
Links¶
Internet Archive: archive.org/details/news-homepages
Mastodon: @newshomepages
Task runner: github.com/palewire/news-homepages/actions
Packaging: pypi.org/project/newshomepages