-
Notifications
You must be signed in to change notification settings - Fork 16
Setup and Usage
Make sure you have installed this script as written in the installation guide
Follow theses steps to generate your sitemap:
- Setup Sitepod
- Run Sitepod
- Edit the result
- Store sitemap
- Inform Google
- Watch your website within Google
Execute the script with your browser: browse to http://podaac.jpl.nasa.gov/admin/sitemap/. You’ll get a screen like this:
Set the settings according to the following cases
- Check the settings in Page root (this is the path on the web server where your files are stored) and the Website (the URL of your website) and maybe correct them if necessary.
- Exclude directories – this directories will not be scanned for URLs; Example: to exclude all files in the directory img but not images, add /img/ to the list of directories; there is only a sub-string comparison performed.
- Exclude files – URLs containing strings that are in this list, will neither be added to the sitemap nor crawled for URLs
Choose the item that matches your website and set the settings accordingly:
I have a small website with less than 500 files
Make the following settings:
Check Ping google
Uncheck compress sitemap
Check Scan website
Check Display edit screen after scan
Check Store filelist
Adapt Lastmod, Priority and Changefreq to your needs
With this settings, Sitepod will crawl your website and finally give you an edit screen where you can adapt the values.
I have a bigger website with more than 500 files
Make the following settings:
Check Ping google
Check compress sitemap
Check Scan website
Uncheck Display edit screen after scan
Uncheck Store filelist
Adapt Lastmod, Priority and Changefreq to your needs
With this settings, Sitepod will crawl your website and will write the result direct into the sitemap file.
There will be no edit screen since this can slow down or kill your browser. The created sitemap file will be compressed (if available). Check the expert settings to get some background information about the timeout functionality and the crawler if you’re running into problems when saving the settings and executing the scan.
- Ping Google – inform Google when a sitemap has been created
- Timeout functionality – Sitepod allows you to set the time how long it will perform the actions. This is necessary if you’re running a big website with many URLs, but your web host allows you to execute php scripts only for a short amount of time.
There are two different timeouts that might occur:
- PHP timeout: The PHP engine stops the execution and prints an error at the end of the script. Solution: Just press the Setup link, enable the timeout and type in the average time to the timeout (normally 30 seconds) minus 5 seconds (backup) time. Now the browser is forced to reload the page with JavaScript (if not enabled, please do it here!) when a timeout occurs.
- Web server timeout: The web server stops the output thread and does not print an error at the end of the script. Solution: Just press the Setup link, enable the timeout and type in the average time to the timeout (normally 300 seconds) minus 5 seconds (backup) time.
Press the Submit Settings button to store the settings and start the scan of your website.
If you checked the “Display edit screen after scan” checkbox, you’ll see a page like this:
You can now adopt the settings to your needs. Press the “Create file” button to use this information for generation of the sitemap file.
When you’ve pressed the button “Create file” in the result overview page the sitemap will be generated and written to the sitemap file specified at the settings page.
When the sitemap has been successfully written to the filesystem you’ll get the option to inform Google. Just press the “Submit to google” button. Is this your first sitemap Google suggests to submit the sitemap within a Google Sitemaps account. You can get one at http://www.google.com/webmasters/sitemaps/. This is the only possibility to track the status of the Google sitemap at the moment – so please do so. The message given by Sitepod only means that Google received the Podaac Sitemap URL, not that it successfully downloaded and computed the sitemap.
That’s it, you’ve successfully created your Google sitemap. So track the impact of Google Sitemaps you can query Google about the number of indexed pages: Type site:podaac.jpl.nasa.gov (modify the website after site: to match your website). Then you’ll see the number of indexed pages: Results 1-… of about x from … Where x is the number of pages of your website that is integrated in the Google Search index at the moment. This number can differ depending on your location because of the not synchronized Google index. Maybe you have some webpages in the result without description, only the URL of the webpage. This means that Google got these URLs (from the sitemap file) but does not have crawled, computed or added of this webpage to its index. It normally takes 1-5 days then there will be a description.