Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

After I deployed the project locally, he couldn't crawl the webpage and kept waiting #340

Open
ahasasjeb opened this issue Jul 1, 2024 · 5 comments

Comments

@ahasasjeb
Copy link

I used Docker to deploy the project and sent two URLs through Python CDK, but the UI interface kept showing waiting

! Corepack is about to download https://registry.npmjs.org/pnpm/-/pnpm-9.4.0.tgz

> [email protected] start:production /app
> tsc && node dist/src/index.js

Authentication is disabled. Supabase client will not be initialized.
POSTHOG_API_KEY is not provided - your events will not be logged. Using MockPostHog as a fallback. See posthog.ts for more.
Number of CPUs: 2 available
Master 32 is running
Connected to Redis Session Store!
Authentication is disabled. Supabase client will not be initialized.
Authentication is disabled. Supabase client will not be initialized.
POSTHOG_API_KEY is not provided - your events will not be logged. Using MockPostHog as a fallback. See posthog.ts for more.
Number of CPUs: 2 available
Web scraper queue created
Worker 39 started
Worker 39 listening on port 3002
For the UI, open http://0.0.0.0:3002/admin//queues

1. Make sure Redis is running on port 6379 by default
2. If you want to run nango, make sure you do port forwarding in 3002 using ngrok http 3002 
Connected to Redis Session Store!
POSTHOG_API_KEY is not provided - your events will not be logged. Using MockPostHog as a fallback. See posthog.ts for more.
Number of CPUs: 2 available
Web scraper queue created
Worker 41 started
Worker 41 listening on port 3002
For the UI, open http://0.0.0.0:3002/admin//queues

1. Make sure Redis is running on port 6379 by default
2. If you want to run nango, make sure you do port forwarding in 3002 using ngrok http 3002 
Connected to Redis Session Store!
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
[Playwright] Error fetching url: https://mendable.ai -> AxiosError: Request failed with status code 404
Falling back to fetch
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
Attempted to access Supabase client when it's not configured.
Error logging crawl job:
 Error: Supabase client is not configured.
    at Proxy.<anonymous> (/app/dist/src/services/supabase.js:38:23)
    at logCrawl (/app/dist/src/services/logging/crawl_log.js:9:14)
    at crawlController (/app/dist/src/controllers/crawl.js:86:40)
    at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
WARNING - You're bypassing authentication
[Playwright] Error fetching url: https://mendable.ai -> AxiosError: Request failed with status code 404
Falling back to fetch
[Playwright] Error fetching url: https://mendable.ai -> AxiosError: Request failed with status code 404
Falling back to fetch
Attempted to access Supabase client when it's not configured.
Error logging crawl job:
 Error: Supabase client is not configured.
    at Proxy.<anonymous> (/app/dist/src/services/supabase.js:38:23)
    at logCrawl (/app/dist/src/services/logging/crawl_log.js:9:14)
    at crawlController (/app/dist/src/controllers/crawl.js:86:40)
    at process.processTicksAndRejections (node:internal/process/task_queues:95:5)

image

@ahasasjeb ahasasjeb changed the title HELP! After I deployed the project locally, he couldn't crawl the webpage and kept waiting After I deployed the project locally, he couldn't crawl the webpage and kept waiting Jul 1, 2024
@leo8198
Copy link

leo8198 commented Jul 1, 2024

Set the env variable PLAYWRIGHT_MICROSERVICE_URL to PLAYWRIGHT_MICROSERVICE_URL=http://playwright-service:3000/html should solve the issue

@rafaelsideguide
Copy link
Collaborator

@ahasasjeb did this solve the issue?

@ahasasjeb
Copy link
Author

ahasasjeb commented Jul 2, 2024 via email

@notV3NOM
Copy link

notV3NOM commented Jul 3, 2024

Facing the same issue

Set the env variable PLAYWRIGHT_MICROSERVICE_URL to PLAYWRIGHT_MICROSERVICE_URL=http://playwright-service:3000/html should solve the issue

This is already set

@gubinjie
Copy link

gubinjie commented Jul 5, 2024

set USE_DB_AUTHENTICATION=false

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants