Crawler Web dev.to using Colly when learning Golang

Published at

11/1/2022

Categories

4 categories in total

Author

7 person written this

chieund

open

Crawler Web dev.to using Colly when learning Golang

I would like to recommend a website of mine that I made during my Golang learning.
My website http://techdaily.info is for learning golang language.
Besides crawling dev.to, I also crawl some other websites like freecodecamp.com, medium.com, hashnode.com, logrocket.com, infoq.com
So I built a website that specializes in crawling other sites
some technology that i used.

Golang
Colly
Nginx
Service
Docker
Mysql
Run action deploy to server
Cronjob daily crawl

Build Run Local

Change file app_example.yaml to app.yaml

cp app_example.yaml app.yaml

Build Docker

docker-compose up --build

Install package Golang

docker-compose exec crawl go mod tidy

Folder vendor

docker-compose exec crawl go mod vendor

Run Crawl

docker-compose exec crawl go run cmd/main.go

Use air autoload

docker-compose exec crawl air -c .air.conf

Deploy

Run file makefile build project into folder bin

make copy_template build_app_web build_app_crawl

Create Services in run in background

Create Service and Run App Web

sudo nano /lib/systemd/system/app_web.service

Copy Content

[Unit]
Description=App Web

[Service]
Type=simple
Restart=always
RestartSec=5s
WorkingDirectory=/root/actions-runner/crawl/crawl/crawl/bin
ExecStart=/root/actions-runner/crawl/crawl/crawl/bin/app_web

[Install]
WantedBy=multi-user.target

sudo systemctl enable app_web
sudo systemctl start app_web
sudo systemctl status app_web

Run App Crawl

./app_crawl

Add CronTab

crontab -e

add cron time

*/60 * * * * /root/actions-runner/crawl/crawl/crawl/bin/app_crawl crawl-article
*/20 * * * * /root/actions-runner/crawl/crawl/crawl/bin/app_crawl crawl-article-detail

Reload cron run

sudo service cron reload

Website

http://techdaily.info/

https://github.com/chieund/crawl

dev-resources.site

Crawler Web dev.to using Colly when learning Golang

Build Run Local

Change file app_example.yaml to app.yaml

Build Docker

Install package Golang

Folder vendor

Run Crawl

Use air autoload

Deploy

Run file makefile build project into folder bin

Create Services in run in background

Create Service and Run App Web

Copy Content

Run App Crawl

Add CronTab

add cron time

Reload cron run

Website