Projects with this topic
-
Postgres DB + Crawlers and Scrapers for Apache HOP + Img Proxy + Fast API
Updated -
Download high-quality media from Instagram.
Updated -
Plain text boilerplate removal using character n-gram frequency across a corpus. Builds a template model from a sample, filters files in a single linear pass, and validates automatically. Includes an obfuscated mode where the model is a set of integers and output filenames are hashed: the operator never reads the content. AWK for character processing, Bash for orchestration, Lisp layer planned for positional classification.
Updated -
Advanced cross-platform IP Rotation Tool using the Tor Network. Includes interactive UI, proxy scanning, and Docker support.
Updated -
Eksport i Generowanie Talii Fiszek do nauki prawa jazdy
Updated -
CLI to notify consumers of available Global Entry appointments
Updated -
-
Projet Python de web scraping permettant d’extraire des informations de livres à partir de pages HTML en utilisant BeautifulSoup, puis de stocker les données dans une base de données SQLite. Projet pédagogique axé sur le scraping, la structuration des données et l’automatisation.
Updated -
modulargrid-based scraper of some eurorack resellers
Updated -
Часть поисковой системы. Поисковый бот. Блуждает по сети и скачивает текст
Updated -
Bug fixes for the abandoned python Wikipedia project to warn the user when the Wikpedia suggestion engine is corrupting the titles of valid Wikipedia articles. Required for the examples in Natural Language Processing in Action, 2nd Edition by Maria Dyshel and Hobson Lane (and a community of more than 30 contributing authors and editors).
Updated -
NeuralNiche is a modern full-stack SaaS platform that delivers weekly validated AI and micro-SaaS ideas directly to builders who want to ship fast and build profitably. Built with Next.js 15, Flask, and PostgreSQL, it features a stunning responsive landing page with glassmorphism design, smooth Framer Motion animations, and a robust waitlist system with email/WhatsApp integration. The platform uses a 12-point validation framework to analyze engagement patterns across 50+ platforms like Reddit, Twitter, and ProductHunt, delivering only the top 3-5 ideas (validation score 70+) with detailed execution playbooks, revenue models, and source links. With comprehensive form validation, real-time error handling, SEO optimization, and professional UI/UX, NeuralNiche transforms the way indie makers and entrepreneurs discover and validate their next big idea, moving them from endless scrolling to profitable building.
Updated -
Tool to get and list favorite publications from rule34.xxx page in a SCV file
Updated -
WordPress experimental plugin for scraping chess data from websites such as chess-results.com, fide.ratings.com and chess.sk. Used as an example of using the Simple HTML DOM library for scraping in WordPress.
Updated -
A Rag application that keeps you updated about AI news, scrapped from MIT AI newsletter website
Updated -
A bash script that automatically downloads the latest news videos from Nordic public broadcasters for convenient viewing on a kitchen TV or similar setup.
Updated -
-
Since avherald.com does not have a RESTful API, RSS Feed, or other ways to get data without visiting the website, this Python 3 script will extract the data for you using website scraping.
Updated -
این پروژه یک API مبتنی بر جنگو و Django REST Framework است که امکان دریافت لیست اخبار را فراهم میکند. در این API، اخبار شامل عنوان، متن، تگها و منبع میباشند. همچنین قابلیت فیلتر کردن اخبار بر اساس تگها، کلیدواژههای موجود و کلیدواژههای حذف شده فراهم شده است. این پروژه شامل طراحی مدلهای دیتابیس و نوشتن تستهای واحد برای اطمینان از عملکرد صحیح است.
Updated