GoXCrap

X (formerly Twitter) web scrapper, written in Go

GoXCrap

This application collects tweets based on a defined search criteria, and save them in a database.

Set up & run (locally)

Set up

First of all you need to download the Chrome Web Driver that matches with the installed version of Google Chrome (the browser used for testing this project).
You can download it from here, or you can use @puppeteer/browsers with this installation guide.
After that, copy it inside the internal/webdriver folder.
Create a .env file at the root of the project, and add the following environment variables:

EMAIL=<Twitter account email>
USERNAME=<Twitter username>
PASSWORD=<Twitter password>
AHBCC_DOMAIN=<Domain of the application with the endpoint /tweets/v1> --> In this case the app AHBCC

¹

Run

In the root folder, run:

go run cmd/api/main.go --local

Setting up & run (into a Docker container)

Setup

Create a .env file at the root of the project, and add the following environment variables:

EMAIL=<Twitter account email>
USERNAME=<Twitter username>
PASSWORD=<Twitter password>
DRIVER_PATH=<The path to the Chrome driver> --> Example: /usr/bin/chromedriver
BROWSER_PATH=<The path to the Chrome browser> --> Example: /usr/bin/chromium
RABBITMQ_USER=<The RabbitMQ user>
RABBITMQ_PASS=<The RabbitMQ password>
AHBCC_DOMAIN=<Domain of the application with the endpoint /tweets/v1> --> In this case the app AHBCC

¹

Build & Run

docker compose up --build

License

MIT

Logo License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

The logo was obtained from https://github.com/ashleymcnamara/gophers, but it was slightly modified to be representative for this repository.

AHBCC: Adverse Human Behaviour Corpus Creator. More information here ↩ ↩²

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.github/workflows		.github/workflows
cmd/api		cmd/api
internal		internal
media		media
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
codecov.yml		codecov.yml
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GoXCrap

GoXCrap

Set up & run (locally)

Set up

Run

Setting up & run (into a Docker container)

Setup

Build & Run

License

Logo License

About

Releases

Packages

Languages

License

lhbelfanti/goxcrap

Folders and files

Latest commit

History

Repository files navigation

GoXCrap

GoXCrap

Set up & run (locally)

Set up

Run

Setting up & run (into a Docker container)

Setup

Build & Run

License

Logo License

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages