Google News URL Format
-
Hi,
We are currently redesigning our gaming website (www.totallygn.com) and one of our main goals is to get listed by Google News in future.
Looking at the Google News URL requirements "The URL for each article must contain a unique number consisting of at least three digits."
How does the above affect SEO structure? I was planning on using a format such as
www.totallygn.com/xbox-360/360-reviews/fifa-12-review
how would this compare to something like?
www.totallygn.com/xbox-360/360-reviews/fifa-12-review234
Thanks in advance for your help
-
Hi all,
Is it still the case that you can submit EITHER with 3 digits in the URL OR via a news sitemap? I can't see anything in the official instructions about the sitemap route... they seem pretty insistent on the 3 digit rule though.
-
Can we do it just by submitting a news sitemap via GWT?
-
Do you still have to go through the inclusion process here: http://support.google.com/news/publisher/bin/bin/static.py?hl=en&ts=2394225&page=ts.cs&from=191208
Thanks guys... MB.
-
-
My site was just accepted in to Google News yesturday and when I went to check the sitemap for the news, Google Webmaster showed errors for the news sitemap.
So I have tried every wordpress plugin I could find, and submitted the news sitempa.
Each one had errors, the only one that worked for me and my site is now showing in Google News is this plugin BWP Google XML Sitemaps
Hope that helps
-
Hi WalesDragon,
Did these answers solve your question, or are you looking for some more advice still?
-
No worries!
I am pretty sure that plugin is the one which allows the WP admin to select JUST posts, and leave out pages... but I am not 100%.
The reason I recommended that particular plugin though, is that from experience, many of the other Google news sitemap plugins seem to cause some sort of XML error when submitting the sitemap to Google news, but this one doesn't, so using it should save a few headaches, and having to 'shop around', so to speak!
Another thing to bear in mind, is that if you have 1 section of your site (say, domain.com/news) and you have an RSS feed on there, showing a feed of a different section of your website (say, domain.com/self-promotional-company-blog), and the second blog for any reason ends up with 3 unique digital in the URL of a post, then Google news can find the link in the RSS feed of your news section, and index the page on the (self promotional blog) in error -
Sounds harmless, but if the news team then decided that you were actually TRYING to get self promotional stuff (even company news) into Google news, you could loose your news approved status... short solution is just to be careful when putting any RSS feeds (of other parts of your site/domain) on your news section!!! (Hope that makes sense?!) - I learned this the hard way (didn't get dropped or anything, as I acted swiftly to sort the issue!).
Hope that helps!
Mike.
-
Mike,
Thanks for this, I personally found it helpful. I like the idea of the Google News Plugin and will test it out on a small site.
Good info,Robert
-
In addition to the excellent response by Robert Fisher, below, you do not actually NEED to do this, but you CAN do it automatically if you choose to.
Google News needs...
EITHER a unique 3 digit code in the URL...
OR
A Google news specific sitemap.
So, your options are to either change your WP (I checked, your site is Wordpress based, yes?) Permalinks settings, to include post id, OR use a google news sitemap plugin.
You can always put a number in front of the post id, so use something like:
/%postname%/1%post_id%
So, adding a numerical '1' befor %post_id% in your permalinks.
If you are worried about lots of 404 errors due to changing your URL structure, then how about using deans permalinks migration (install it BEFORE changing your permalink settings!) - http://wordpress.org/extend/plugins/permalinks-migration-plugin-for-wordpress/
As for a Google News sitemap... For wordpress, I recommend this one: http://wordpress.org/extend/plugins/gn-xml-sitemap/
If you go down the sitemap route, do be sure that ONLY news posts are included... E.G. NOT your static, non-news content pages!
IN TERMS OF SEO -
I don't feel it will effect things too much, so long as everything else is good as regards your on-page SEO etc.
Hope that helps!
-
If you understand that the requirement for the three or more digits is around insuring that there is a unique page for each individual article. So if you look at: www.totallygn.com/xbox-360/360-reviews/fifa-12-review, It appears to me that the second 360 is still associated with reviews of games associated with XBox 360. The fifa-12-review appears to be a soccer game (I have never played on one of those things I am an intelligent worker and not involved in any type of warfare even modern).
So, the second where you have review 234 does work because the three digit number appears to give a unique numeric identifier to that article. (Note if a 4 digit number it cannot start with 199 or 200).
In the event there is something that would prevent you from using this convention, you can always create a news Sitemap. Google Support News Sitemap.
Hope this helps, best,
Edit: missed seo question: It has a positive effect on SEO as it is following Google's convention. (One question is whether or not having a news sitemap would give more credence/weight as a news site versus the unique identifier???) My guess is it would.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Metadata configured, but Google only shows URL with sitelinks. Something wrong with my code?
Hi guys! I have a metadata problem with my home page. If I look for the brand's keyword, the SERPs don´t show the metadata I configured, instead it shows the URL with sitelinks. If I use the "site:" command, it doesn't appear at all. This happens only on the home page, not the rest, which are roughly 700 pages. Those appear fine. I already have a meta title and meta description configured, which include the mentioned KW. It used to appear correctly before. GSC shows it indexed. Most audit tools (configured to crawl JS) detect the metadata. Moz's On Page tool doesn't. Could it be because of the JS configuration? Or am I missing something else? Here´s the meta description code:What do you think? I'd appreciate your input. Thanks!
Technical SEO | | Reprise0 -
My SEO friend says my website is not being indexed by Google considering the keywords he has placed in the page and URL what does that mean?
My SEO friend says my website is not being indexed by Google considering the keywords he has placed in the page and URL what does that mean? We have added some text in the pages with keywords thats related the page
Technical SEO | | AlexisWithers0 -
How to delete specific url?
I just ran drawl diagnostics and trying to delete pages such as "oops that page can't be found" or "404 (not found_ error response pages. Can anyone help?
Technical SEO | | sawedding0 -
Duplicate pages in Google index despite canonical tag and URL Parameter in GWMT
Good morning Moz... This is a weird one. It seems to be a "bug" with Google, honest... We migrated our site www.three-clearance.co.uk to a Drupal platform over the new year. The old site used URL-based tracking for heat map purposes, so for instance www.three-clearance.co.uk/apple-phones.html ..could be reached via www.three-clearance.co.uk/apple-phones.html?ref=menu or www.three-clearance.co.uk/apple-phones.html?ref=sidebar and so on. GWMT was told of the ref parameter and the canonical meta tag used to indicate our preference. As expected we encountered no duplicate content issues and everything was good. This is the chain of events: Site migrated to new platform following best practice, as far as I can attest to. Only known issue was that the verification for both google analytics (meta tag) and GWMT (HTML file) didn't transfer as expected so between relaunch on the 22nd Dec and the fix on 2nd Jan we have no GA data, and presumably there was a period where GWMT became unverified. URL structure and URIs were maintained 100% (which may be a problem, now) Yesterday I discovered 200-ish 'duplicate meta titles' and 'duplicate meta descriptions' in GWMT. Uh oh, thought I. Expand the report out and the duplicates are in fact ?ref= versions of the same root URL. Double uh oh, thought I. Run, not walk, to google and do some Fu: http://is.gd/yJ3U24 (9 versions of the same page, in the index, the only variation being the ?ref= URI) Checked BING and it has indexed each root URL once, as it should. Situation now: Site no longer uses ?ref= parameter, although of course there still exists some external backlinks that use it. This was intentional and happened when we migrated. I 'reset' the URL parameter in GWMT yesterday, given that there's no "delete" option. The "URLs monitored" count went from 900 to 0, but today is at over 1,000 (another wtf moment) I also resubmitted the XML sitemap and fetched 5 'hub' pages as Google, including the homepage and HTML site-map page. The ?ref= URls in the index have the disadvantage of actually working, given that we transferred the URL structure and of course the webserver just ignores the nonsense arguments and serves the page. So I assume Google assumes the pages still exist, and won't drop them from the index but will instead apply a dupe content penalty. Or maybe call us a spam farm. Who knows. Options that occurred to me (other than maybe making our canonical tags bold or locating a Google bug submission form 😄 ) include A) robots.txt-ing .?ref=. but to me this says "you can't see these pages", not "these pages don't exist", so isn't correct B) Hand-removing the URLs from the index through a page removal request per indexed URL C) Apply 301 to each indexed URL (hello BING dirty sitemap penalty) D) Post on SEOMoz because I genuinely can't understand this. Even if the gap in verification caused GWMT to forget that we had set ?ref= as a URL parameter, the parameter was no longer in use because the verification only went missing when we relaunched the site without this tracking. Google is seemingly 100% ignoring our canonical tags as well as the GWMT URL setting - I have no idea why and can't think of the best way to correct the situation. Do you? 🙂 Edited To Add: As of this morning the "edit/reset" buttons have disappeared from GWMT URL Parameters page, along with the option to add a new one. There's no messages explaining why and of course the Google help page doesn't mention disappearing buttons (it doesn't even explain what 'reset' does, or why there's no 'remove' option).
Technical SEO | | Tinhat0 -
Google Sitelinks
Is there anyway to control the sitelinks under a listing in Google? I have a group of lawyers where 1 of the them is showing up in the sitelinks. They want all of the lawyers to show up. Right now it is showing 1 lawyer, about page, contact us page, etc. Thanks!!!!
Technical SEO | | SixTwoInteractive0 -
Google is indexing my directories
I'm sure this has been asked before, but I was looking at all of Google's results for my site and I found dozens of results for directories such as: Index of /scouting/blog/wp-includes/js/swfupload/plugins Obviously I don't want those indexed. How do I prevent Google from indexing those? Also, it only seems to be doing it with Wordpress, not any of the directories on my main site. (We have a wordpress blog, which is only a portion of the site)
Technical SEO | | UnderRugSwept0 -
Will google let me do this
Hi i am working on my site at the moment www.in2town.co.uk and i am adding new sections and was thinking about buying domain names that best describe that section and which people would remember. so for example i am looking at adding a tenerife magazine to my site and would like to know if it would be wise to buy a domain name for example tenerife magazine and then have it directed to the section of my site. would this benefit my site in any way and would google allow this. instead of having in2town.co.uk and then tenerife magazine after it, sorry cannot find the slash as i am on a spanish keyborad at the moment, i would like to have something like tenerifemagazine.co..uk etc If anyone can give me advice on this then that would be great. also can anyone let me know if this is a wise idea or not, to have sub domain names on my main site. i would like to know if i had tenerifemagazine under the in2town domain name would it slow the site down or should i consider building a brand new site just for that and then making people aware that it comes under the in2town umbrella many thanks
Technical SEO | | ClaireH-1848861 -
Google Sandboxing
I have a new site with a new domain that ranked well the 1st week or so after it was indexed then it totally dropped off the SERP. My question is, does Google Sandboxing affect new sites on new domains that don't have any incoming links? The site dropped off before I began link building - from what I've read unnatural link build is often the cause. Can you still be sandboxed without any link building? If this is the case, are there things I can do to get out of the sandbox? Thanks folks, Jason
Technical SEO | | OptioPublishing0