Blog
Product-focused articles without fluff.
- Schedule settings: daily auto-run and background collection
README: Schedule settings enable daily auto-run and parameters for background collection.
- DOCX and app.py: command-line flags for one-off reports
README: app.py requires a query; examples include max age, max items, min text length, and DOCX output path.
- File exports: history, download links, and JSON preview
README: File exports lists JSON/DOCX history, download links, and JSON preview on the export record.
- JSON export: three modes on the news collection management page
README: Export JSON in three modes—last successful run, single date, and date range—from the collection management UI.
- News list in the admin: browse, search, and filter collected items
README: the News section supports viewing/searching/filtering collected news items.
- Collection logs: what stages NewsPulse records
README: detailed collection steps, filtering, and errors appear in Collection logs and in the run record.
- Collection runs: history, status, and manual runs from the admin
The README lists Collection runs: history of runs and the ability to start collection from the admin UI.
- Keywords and an empty main query in the schedule
NewsPulse: keywords are added manually; the main schedule query may be empty so only enabled keywords drive collection—per README.
- Yandex RSS returns a login page: how NewsPulse skips the source
README behaviour: when XML is replaced by a challenge page, the source is skipped—how to diagnose network or proxy issues.
- National wire RSS feeds: disable or remove a source in the admin
README: extra federal feeds appear after migration; in News sources you can disable or delete entries—no fictional modes beyond what the admin provides.
- Adding a Google News RSS source in NewsPulse
How Google News works as a news source type in the admin UI, including the proxy chain documented for Google News RSS in the README.
- Google News RSS: order of direct access and proxies in NewsPulse
How NewsPulse tries Google News RSS: direct first, then source proxy, NEWSPULSE_RSS_PROXY, and fallback URLs—per project README.
- GOOGLE_TRY_DIRECT_FIRST and RSS proxy fallbacks for Google News
Environment variables that tune Google News RSS in NewsPulse: turn off direct-first and provide comma-separated fallback proxy URLs.
- Publishing queue: VK, Telegram, and Odnoklassniki
Queued posts, templates, and result logs—how NewsPulse connects to social networks.
- NEWSPULSE_RSS_PROXY: shared proxy when a source has no proxy URL
When to set NEWSPULSE_RSS_PROXY on the server: one outbound path for RSS where the source row leaves the proxy field empty.
- Per-source proxy for RSS and article pages in NewsPulse
Why NewsPulse lets you set a proxy on each news source: corporate networks, different rules per feed, and fetching article HTML without relying on one global proxy.
- socks5:// proxies and why PySocks is mentioned in the README
NewsPulse supports SOCKS5 proxy URLs for RSS and article fetching; the README asks you to install PySocks so requests can use socks5:// endpoints.
- RIA/TASS sources: query filter on RSS title and description
NewsPulse applies a filter requiring the query to appear in the RSS item title or description for some additional sources such as RIA and TASS—per README.
- RSS news collection: filters, Google News and Yandex.News feeds
How NewsPulse pulls RSS feeds and filters by freshness and text length—only features that exist in the product today.
- Custom RSS feed URL: adding your own source
NewsPulse lets you attach RSS feeds by URL alongside built-in source types, using per-source proxy settings and NEWSPULSE_RSS_PROXY when the field is empty.
- Install NewsPulse on your server: archive, Linux, MySQL or SQLite
What is inside NewsPulse-web-archive.zip and how the Linux install script works—no fake cloud SaaS promises.
- Yandex News RSS as a source: what NewsPulse documents
Using Yandex News RSS in NewsPulse, including the README note that some networks may receive a challenge page instead of XML.