Logo

dev-resources.site

for different kinds of informations.

The Tech News Scraper

Published at
12/29/2024
Categories
brightdatachallenge
devchallenge
webdev
api
Author
chethanyadav
Author
12 person written this
chethanyadav
open
The Tech News Scraper

This is a submission for the Bright Data Web Scraping Challenge: Scrape Data from Complex, Interactive Websites

What I Built

This project scrapes data from websites that offer the latest technological news and updates. It uses JavaScript and Node.js, with Puppeteer and the Bright Data Scraping Browser to handle dynamic content. It scrapes data from two major websites:

  1. Artificial Intelligence News
  2. The Hacker News

Demo

You can view the source code and instructions for running the project on GitHub.

Articles display webpage

How I Used Bright Data

I leveraged Bright Data’s Scraping Browser to handle JavaScript-heavy and interactive websites that require dynamic content loading. The project scrapes real-time data, including titles, descriptions, URLs, images, and published dates. Bright Data's browser provided a smooth solution to maintain the scraping process without additional overhead.

Challenge Prompt: Bright Data Web Scraping Challenge

Installation

  1. Clone the repository
git clone https://github.com/chethanyadav456/Scraping_Master.git
Enter fullscreen mode Exit fullscreen mode
  1. Install dependencies
npm install
Enter fullscreen mode Exit fullscreen mode
  1. Run the project
node master.js
Enter fullscreen mode Exit fullscreen mode
  1. Create a .env file and add:
MONGO_URI=
BROWSER_WS=
Enter fullscreen mode Exit fullscreen mode

License

This project is licensed under the MIT License - see the LICENSE file for details

brightdatachallenge Article's
30 articles in total
Favicon
Scrape Data from Shopee
Favicon
Estile: AI-Driven Clothing Recommendations Enhanced by Bright Data Scraping
Favicon
Detoxify: Make your YouTube Feed 100x better
Favicon
Scrape Phone Plans
Favicon
Fascinating and brilliantly done!
Favicon
The Tech News Scraper
Favicon
Scrape Data from Complex, Interactive Websites
Favicon
MyGithub scrap datas from Your Github account with a new format
Favicon
Web Scraping Tutorial: Extract Data from Websites Using Python
Favicon
[Boost]
Favicon
Trading Signal from Sentiment Analysis using Bright Data API
Favicon
Congrats to the Bright Data Web Scraping Challenge Winners!
Favicon
WebCrawlAI: An AI-Powered Web Scraper Built Using Bright Data
Favicon
Compare prices across AliExpress, eBay, & Amazon.
Favicon
Track Amazon Prices in Real-Time and Solve CAPTCHAs Seamlessly with Bright Data
Favicon
Gigs AI: A Conversational Chatbot Powered by Aggregated Data from Freelancer and Upwork
Favicon
SEO Performance Analysis Tool: AI-Powered SEO Insights with Complex Web Scraping
Favicon
Reddit Recap: Audio summaries of subreddits powered by BrightData
Favicon
AI-pipe: Pipeline for generating/storing embeddings from AI models to DB with data scraped from sites using custom scripts
Favicon
JobScout.ai: Smarter Job Search with AI and Bright Data
Favicon
State of the Art Automated Web Scraper using Bright Data
Favicon
Trend Chat
Favicon
PriceTracker Pro: Multi-E-Commerce Price Tracking with Bright Data's Web Scrapers API πŸš€
Favicon
Web Scraper API to Solve Business Problems
Favicon
Tech Trend Tracker: AI-Powered News Analysis for Technology Insights
Favicon
Yoda’s EU Grant Finder for Solopreneurs: Powered by Bright Data
Favicon
Scrape Unscrapeable Amazon Dataset with BrightData, React.js and Node.js
Favicon
Scrapping Yahoo Finance with AI Analysis
Favicon
Make Cursor Composer Smarter with Bright Web Scraping Capabilities
Favicon
Bright data Challenge - Industry AI Watchdog

Featured ones: