Exetools  

Go Back   Exetools > General > General Discussion

Notices

Reply
 
Thread Tools Display Modes
  #1  
Old 05-29-2024, 18:21
wx69wx2023 wx69wx2023 is offline
Family
 
Join Date: Sep 2023
Posts: 224
Rept. Given: 26
Rept. Rcvd 44 Times in 23 Posts
Thanks Given: 330
Thanks Rcvd at 600 Times in 154 Posts
wx69wx2023 Reputation: 44
google api document leak

https://sparktoro.com/blog/an-anonymous-source-shared-thousands-of-leaked-google-search-api-documents-with-me-everyone-in-seo-should-see-them/



the document leaked at hexdocs:

https://hexdocs.pm/google_api_content_warehouse/0.4.0/api-reference.html
Attached Files
File Type: 7z google_api_content_warehouse.7z (3.85 MB, 33 views)

Last edited by wx69wx2023; 05-29-2024 at 22:04.
Reply With Quote
The Following 3 Users Say Thank You to wx69wx2023 For This Useful Post:
Apuromafo (05-30-2024), hcedu (05-30-2024), MarcElBichon (05-30-2024)
  #2  
Old 05-30-2024, 07:17
chants chants is offline
VIP
 
Join Date: Jul 2016
Posts: 821
Rept. Given: 46
Rept. Rcvd 50 Times in 31 Posts
Thanks Given: 730
Thanks Rcvd at 1,136 Times in 527 Posts
chants Reputation: 51
You literally just beat me to it . I think they are still on github where they originally leaked:
Quote:
https://github.com/yoshi-code-bot/elixir-google-api/commit/d7a637f4391b2174a2cf43ee11e6577a204a161e
Note worthy:
Quote:
Ranking features: 2,596 modules are represented in the API documentation with 14,014 attributes.
Weighting: The documents did not specify how any of the ranking features are weighted – just that they exist.
Twiddlers: These are re-ranking functions that “can adjust the information retrieval score of a document or change the ranking of a document,” according to King.
Demotions: Content can be demoted for a variety of reasons, such as:
A link doesn’t match the target site.
SERP signals indicate user dissatisfaction.
Product reviews.
Location.
Exact match domains.
Porn
Change history: Google apparently keeps a copy of every version of every page it has ever indexed. Meaning, Google can “remember” every change ever made to a page. However, Google only uses the last 20 changes of a URL when analyzing links.

...

Links matter. Shocking, I know. Link diversity and relevance remain key, the documents show. And PageRank is still very much alive within Google’s ranking features. PageRank for a website’s homepage is considered for every document.

This doesn’t prove Google spokespeople have lied about links not being a “top 3 ranking factor” or links mattering less for ranking. Two things can be true at once. Again, we don’t know how any of these features are weighted.
Successful clicks matter. This should not be a shocker, but if you want to rank well, you need to keep creating great content and user experiences, based on the documents. Google uses a variety of measurements, including badClicks, goodClicks, lastLongestClicks and unsquashedClicks.

Also, longer documents may get truncated, while shorter content gets a score (from 0-512) based on originality. Scores are also given to Your Money Your Life content, like health and news.

...

Brand matters. Fishkin’s big takeaway? Brand matters more than anything else:

“If there was one universal piece of advice I had for marketers seeking to broadly improve their organic search rankings and traffic, it would be: ‘Build a notable, popular, well-recognized brand in your space, outside of Google search.'”
Entities matter. Authorship lives. Google stores author information associated with content and tries to determine whether an entity is the author of the document.

SiteAuthority: Google uses something called “siteAuthority”.

Google told us something like this existed in 2011, after the Panda update launched, stating publicly that “low quality content on part of a site can impact a site’s ranking as a whole.”
However, Google has denied having a website authority score in the years since then.
Chrome data. A module called ChromeInTotal indicates that Google uses data from its Chrome browser for ranking.

Whitelists. A couple of modules indicate Google whitelist certain domains related to elections and COVID – isElectionAuthority and isCovidLocalAuthority. Though we’ve long known Google (and Bing) have “exception lists” when “specific algorithms inadvertently impact websites.”

Small sites. Another feature is smallPersonalSite – for a small personal site or blog. King speculated that Google could boost or demote such sites via a Twiddler. However, that remains an open question. Again, we don’t know for certain how much these features are weighted.

Other interesting findings. According to Google’s internal documents:

Freshness matters – Google looks at dates in the byline (bylineDate), URL (syntacticDate) and on-page content (semanticDate).
To determine whether a document is or isn’t a core topic of the website, Google vectorizes pages and sites, then compares the page embeddings (siteRadius) to the site embeddings (siteFocusScore).
Google stores domain registration information (RegistrationInfo).
Page titles still matter. Google has a feature called titlematchScore that is believed to measure how well a page title matches a query.
Google measures the average weighted font size of terms in documents (avgTermWeight) and anchor text.
Analysis in full:
Quote:
https://searchengineland.com/google-search-document-leak-ranking-442617
Reply With Quote
The Following User Says Thank You to chants For This Useful Post:
wx69wx2023 (05-30-2024)
  #3  
Old 05-30-2024, 14:13
Sh4DoVV Sh4DoVV is offline
Friend
 
Join Date: Sep 2023
Posts: 35
Rept. Given: 1
Rept. Rcvd 3 Times in 3 Posts
Thanks Given: 13
Thanks Rcvd at 72 Times in 13 Posts
Sh4DoVV Reputation: 3
Quote:
Originally Posted by wx69wx2023 View Post
https://sparktoro.com/blog/an-anonymous-source-shared-thousands-of-leaked-google-search-api-documents-with-me-everyone-in-seo-should-see-them/



the document leaked at hexdocs:

https://hexdocs.pm/google_api_content_warehouse/0.4.0/api-reference.html
Hi
Please upload in mega or mediafire
Reply With Quote
  #4  
Old 05-30-2024, 18:59
wx69wx2023 wx69wx2023 is offline
Family
 
Join Date: Sep 2023
Posts: 224
Rept. Given: 26
Rept. Rcvd 44 Times in 23 Posts
Thanks Given: 330
Thanks Rcvd at 600 Times in 154 Posts
wx69wx2023 Reputation: 44
Quote:
Originally Posted by Sh4DoVV View Post
Hi
Please upload in mega or mediafire
https://hexdocs.pm/google_api_content_warehouse/0.4.0/google_api_content_warehouse.epub
Reply With Quote
The Following 2 Users Say Thank You to wx69wx2023 For This Useful Post:
mmx (06-12-2024), Sh4DoVV (05-30-2024)
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
iOS iBoot Source code leak - Probably termed as the biggest leak in the history foosaa Source Code 13 03-14-2018 01:02


All times are GMT +8. The time now is 15:17.


Always Your Best Friend: Aaron, JMI, ahmadmansoor, ZeNiX, chessgod101
( 1998 - 2025 )