Skip to content

In standby mode, a new crawler is created even though it should not be necessary. #60

Open
@jirispilka

Description

@jirispilka
2025-03-18T15:50:33.520Z INFO  The Actor web server is listening for user requests at https://jiri-spilka--rag-web-browser-task.apify.actor:4321
2025-03-18T15:50:33.521Z INFO  Creating new cheerio crawler with key {"keepAlive":true,"maxRequestRetries":2,"proxyConfiguration":{"isManInTheMiddle":false,"nextCustomUrlIndex":0,"usedProxyUrls":{},"log":{"LEVELS":{"0":"OFF","1":"ERROR","2":"SOFT_FAIL","3":"WARNING","4":"INFO","5":"DEBUG","6":"PERF","OFF":0,"ERROR":1,"SOFT_FAIL":2,"WARNING":3,"INFO":4,"DEBUG":5,"PERF":6},"options":{"level":4,"maxDepth":4,"maxStringLength":2000,"prefix":"ProxyConfiguration","suffix":null,"logger":{"_events":{},"_eventsCount":0,"options":{"skipTime":true}},"data":{}},"warningsOnceLogged":{}},"domainTiers":{},"config":{"options":{},"services":{},"storageManagers":{}},"groups":["GOOGLE_SERP"],"password":"*********","hostname":"10.0.47.179","port":8011,"usesApifyProxy":true},"autoscaledPoolOptions":{"desiredConcurrency":1}}
2025-03-18T15:50:33.521Z INFO  Creating new playwright crawler with key {"headless":true,"keepAlive":true,"maxRequestRetries":1,"proxyConfiguration":{"isManInTheMiddle":false,"nextCustomUrlIndex":0,"usedProxyUrls":{},"log":{"LEVELS":{"0":"OFF","1":"ERROR","2":"SOFT_FAIL","3":"WARNING","4":"INFO","5":"DEBUG","6":"PERF","OFF":0,"ERROR":1,"SOFT_FAIL":2,"WARNING":3,"INFO":4,"DEBUG":5,"PERF":6},"options":{"level":4,"maxDepth":4,"maxStringLength":2000,"prefix":"ProxyConfiguration","suffix":null,"logger":{"_events":{},"_eventsCount":0,"options":{"skipTime":true}},"data":{}},"warningsOnceLogged":{}},"domainTiers":{},"config":{"options":{},"services":{},"storageManagers":{}},"groups":[],"password":"*********","hostname":"10.0.47.179","port":8011,"usesApifyProxy":true},"requestHandlerTimeoutSecs":40,"launchContext":{"launcher":{"_type":"BrowserType","_guid":"browser-type@b0609a9f3b47933ffb3abf0682593d2e"}},"browserPoolOptions":{"fingerprintOptions":{"fingerprintGene... [line-too-long]
2025-03-18T15:50:33.522Z INFO  Creating new cheerio crawler with key {"keepAlive":true,"maxRequestRetries":1,"proxyConfiguration":{"isManInTheMiddle":false,"nextCustomUrlIndex":0,"usedProxyUrls":{},"log":{"LEVELS":{"0":"OFF","1":"ERROR","2":"SOFT_FAIL","3":"WARNING","4":"INFO","5":"DEBUG","6":"PERF","OFF":0,"ERROR":1,"SOFT_FAIL":2,"WARNING":3,"INFO":4,"DEBUG":5,"PERF":6},"options":{"level":4,"maxDepth":4,"maxStringLength":2000,"prefix":"ProxyConfiguration","suffix":null,"logger":{"_events":{},"_eventsCount":0,"options":{"skipTime":true}},"data":{}},"warningsOnceLogged":{}},"domainTiers":{},"config":{"options":{},"services":{},"storageManagers":{}},"groups":[],"password":"*********","hostname":"10.0.47.179","port":8011,"usesApifyProxy":true},"requestHandlerTimeoutSecs":40,"autoscaledPoolOptions":{"desiredConcurrency":5,"maxConcurrency":50,"minConcurrency":3}}
2025-03-18T15:50:33.759Z INFO  Crawler playwright has started 💪🏼
2025-03-18T15:50:33.760Z INFO  Number of crawlers 1
2025-03-18T15:50:33.768Z INFO  Crawler cheerio has started 💪🏼
2025-03-18T15:50:33.769Z INFO  Number of crawlers 2
2025-03-18T15:50:33.771Z INFO  Google-search-crawler has started 🫡
2025-03-18T15:50:33.773Z INFO  Number of crawlers 3
2025-03-18T15:50:33.860Z INFO  CheerioCrawler: Starting the crawler.
2025-03-18T15:50:33.863Z INFO  CheerioCrawler: Starting the crawler.
2025-03-18T15:50:33.889Z INFO  Received GET message at: /
2025-03-18T15:50:33.922Z INFO  PlaywrightCrawler: Starting the crawler.
2025-03-18T15:50:34.590Z INFO  Received GET message at: /search?query=apify&scrapingTool=raw-http&blockMedia=true&debugMode=true&maxResults=1
2025-03-18T15:50:34.591Z INFO  Received query parameters: {"query":"apify","scrapingTool":"raw-http","blockMedia":"true","debugMode":"true","maxResults":"1"}
2025-03-18T15:50:34.945Z INFO  Creating new cheerio crawler with key {"keepAlive":true,"maxRequestRetries":2,"proxyConfiguration":{"isManInTheMiddle":false,"nextCustomUrlIndex":0,"usedProxyUrls":{},"log":{"LEVELS":{"0":"OFF","1":"ERROR","2":"SOFT_FAIL","3":"WARNING","4":"INFO","5":"DEBUG","6":"PERF","OFF":0,"ERROR":1,"SOFT_FAIL":2,"WARNING":3,"INFO":4,"DEBUG":5,"PERF":6},"options":{"level":5,"maxDepth":4,"maxStringLength":2000,"prefix":"ProxyConfiguration","suffix":null,"logger":{"_events":{},"_eventsCount":0,"options":{"skipTime":true}},"data":{}},"warningsOnceLogged":{}},"domainTiers":{},"config":{"options":{},"services":{},"storageManagers":{}},"groups":["GOOGLE_SERP"],"password":"*********","hostname":"10.0.47.179","port":8011,"usesApifyProxy":true},"autoscaledPoolOptions":{"desiredConcurrency":1}}
2025-03-18T15:50:34.947Z INFO  Google-search-crawler has started 🫡
2025-03-18T15:50:34.948Z INFO  Number of crawlers 4
2025-03-18T15:50:34.949Z INFO  Creating new cheerio crawler with key {"keepAlive":true,"maxRequestRetries":1,"proxyConfiguration":{"isManInTheMiddle":false,"nextCustomUrlIndex":0,"usedProxyUrls":{},"log":{"LEVELS":{"0":"OFF","1":"ERROR","2":"SOFT_FAIL","3":"WARNING","4":"INFO","5":"DEBUG","6":"PERF","OFF":0,"ERROR":1,"SOFT_FAIL":2,"WARNING":3,"INFO":4,"DEBUG":5,"PERF":6},"options":{"level":5,"maxDepth":4,"maxStringLength":2000,"prefix":"ProxyConfiguration","suffix":null,"logger":{"_events":{},"_eventsCount":0,"options":{"skipTime":true}},"data":{}},"warningsOnceLogged":{}},"domainTiers":{},"config":{"options":{},"services":{},"storageManagers":{}},"groups":[],"password":"*********","hostname":"10.0.47.179","port":8011,"usesApifyProxy":true},"requestHandlerTimeoutSecs":40,"autoscaledPoolOptions":{"desiredConcurrency":5,"maxConcurrency":50,"minConcurrency":3}}
2025-03-18T15:50:34.950Z DEBUG CheerioCrawler:SessionPool: No 'persistStateKeyValueStoreId' options specified, this session pool's data has been saved in the KeyValueStore with the id: PAth5tXJ7hgcWjTc6
2025-03-18T15:50:34.951Z INFO  Crawler cheerio has started 💪🏼
2025-03-18T15:50:34.952Z INFO  Number of crawlers 5

Metadata

Metadata

Assignees

No one assigned

    Labels

    t-aiIssues owned by the AI team.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions