-
Notifications
You must be signed in to change notification settings - Fork 74
Increasing ram usage and tool never finishes. #42
Comments
Do you have the same behavior when using chromium bundled with the puppeteer node package? |
How can I use the puppeteer node package? (sorry, I have little experience with nodeJs). I installed the latest stable version (mentioned in your reply in my previous issue), it's the same issue. |
I have the same in docker, which is using the puppeteer node package. I removed the versions in the Dockerfile to get it working: RUN apk add --no-cache \
chromium \
nss \
freetype \
freetype-dev \
harfbuzz \
ca-certificates \
ttf-freefont \
nodejs \
yarn \ |
It works for you now? Could you prepare a pull request then to help others? Could you find out which versions you are using instead? I think I decided to fix the version numbers to have a more reproducable setup which is important for auditing. |
Sorry, it was not clear in my previous answer. I am trying things out, but they all break if I test them on my website (also for a client, but I don't want to share that one as my website is a good "test" example). So I said that tried docker (using a modified version to get it working), but got the same bug. |
In your example you have incuded |
It's unfortunately the same (when using the installed version in my initial post but with --max 1). I gave up on docker because I get this error: error An unexpected error occurred: "EACCES: permission denied, scandir '/opt/website-evidence-collector/output/browser-profile'". |
@rriemann-eu if you need more info to debug let me know! |
So when I execute the following two commands, I do not get any error. website-evidence-collector --output output_dir/vincentcox.com --json --max 1 https://vincentcox.com
website-evidence-collector --output output_dir/vincentcox.com2 --json --max 1 https://vincentcox.com -- --no-sandbox I am using the latest version from master on opensuse. From the script:
host: mars.fritz.box
version:
npm: 0.4.0
commit: v0.4.0-70-ga956e2d
cmd_args: '--output output_dir/vincentcox.com --json --max 1 https://vincentcox.com'
environment: {}
node_version: v10.22.1
browser:
name: Chromium
version: HeadlessChrome/80.0.3987.0
user_agent: >-
Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/72.0.3617.0 Safari/537.36
platform:
name: Linux
version: 5.8.14-1-default
extra_headers: {}
preset_cookies: {}
start_time: 2020-11-24T11:30:47.957Z
end_time: 2020-11-24T11:31:00.650Z Does your problem occurs with all websites? |
Hmmm, might be something with my installation then. I'll go with docker then to avoid further mistakes and debugging time on your side. The dockerfile in the Repo doesn't work anymore. If I want to build this I get this error:
I think this error is caused by this https://superuser.com/a/1486407/1039133
|
OK, so I will close this one until we know how to reproduce your problem on other systems. I will open a new issue on the docker problem, which deserves a solution. |
Good idea, feel free to tag me in this! |
I can confirm this on docker:
As I do not have this problem on my local computer without docker, I can imagine that it somehow depends on the Chromium version that is used. Maybe newer Chromium versions behave differently than the version |
Yeah the thing is: if it was just on my machine and not on docker it would be something on my side. But even if docker it's giving me the same issue. With chromium 77.0.3865 (as used in this working dockerfile), it works for me. Maybe this issue is not even in the scope of this project, but a chromium issue itself. For me it's okay if you guys close it, but keep in mind that other people might face the same issue (in docker or just using it installed on a system). Maybe my website is quite heavy to parse, but it's a standard Wordpress website so I think chances are high people will face the same situation. |
Steps to reproduce:
Spawn a fresh Ubuntu 20.04 server (no GUI) VPS, install all the tools:
It keeps running and it keeps eating resources:
(rip memory)
Note that I am using the latest version from Github and that something might broke it in the Github version. But as explained in this issue (#41), I cannot access the official download link of the stable version.
The text was updated successfully, but these errors were encountered: