- Run
yarn
to install libs - Input
files
in thefiles.js
- Change
text
to detect in thestart.js
// Detect text in the pdf file
if (text.indexOf("Tên") >= 0) {
console.log(url);
fs.appendFileSync("result.txt", url + "\n");
invalidUrls.push(url);
}
- Run
yarn src/start.js
the result will be written in theresult.txt