Skip to content

GoogleAPI startRow/paging support and Data capture loop to import multiple dates#54

Open
pfauenauge wants to merge 1 commit intoPromInc:masterfrom
pfauenauge:googleAPI-startRow
Open

GoogleAPI startRow/paging support and Data capture loop to import multiple dates#54
pfauenauge wants to merge 1 commit intoPromInc:masterfrom
pfauenauge:googleAPI-startRow

Conversation

@pfauenauge
Copy link
Copy Markdown

GoogleAPI startRow/paging support:
GoogleAPI supports startRow to page through >5000 results. Thats implemented and can be configered in config/config.php Class Config const downloadGoogleMaxPages = 99;

Data capture via CLI/Cronjob was a little bit difficult because the dates were not clear.
Now its possible to loop the import through multiple dates from CLI.
Use Config: const maxDaysBatchImport = 99; to allow/limit that.

…xPages = 10;) and Data capture loop import multiple dates from CLI (use Config: const maxDaysBatchImport = 99;)
@PromInc
Copy link
Copy Markdown
Owner

PromInc commented May 19, 2017

Thank you @pfauenauge for this commit. But more so, thank you for bringing to light the fact that Google has modified their AIP with the startRow feature. I was unaware of this.

I want to review your changes and how they will impact this tool overall before merging this pull request. This is a significant change (for the better) and thus should be reviewed properly.

For reference - here is the Google API documentation regarding this feature.
https://developers.google.com/webmaster-tools/search-console-api-original/v3/how-tos/search_analytics#getting-more-than-5000-rows

@pfauenauge
Copy link
Copy Markdown
Author

pfauenauge commented May 20, 2017 via email

@PromInc
Copy link
Copy Markdown
Owner

PromInc commented May 21, 2017

Yes, you are right that updating the Google API class would overwrite your change.

The version of the Google API (version 1.1.5 vs the current version 2.1.3) in this project is actually quite out of date and thus may be worth upgrading anyway. The newest version includes the startRow.

I'm exploring what updating the Google API will look like right now, which would then pave a solid foundation to layer your solution on top of.

@PromInc
Copy link
Copy Markdown
Owner

PromInc commented May 22, 2017 via email

@pfauenauge
Copy link
Copy Markdown
Author

pfauenauge commented May 23, 2017 via email

@PromInc
Copy link
Copy Markdown
Owner

PromInc commented May 25, 2017

Thank you for the explanation.

The limit of 50,000 is just a personal limit you set then, is that correct? I can't find any documentation that says this is a hard limit by the API (only the 5,000 per request).

Google Webmaster Tools API Quotas & Limits

Per-site limit (calls querying the same site) Per-user limit (calls made by the same user) Per-project limit (calls made using the same Developer Console key)
5 queries per second 200 queries per minute 100,000,000 queries per day
5 queries per second 200 queries per minute

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants