Help with site structure needed - any assistance welcomed!
-
Hi all,
I am currently tasked with finding a better way to optimise our website ukdocumentstorage dot com.
For starters, I would like to know what our site structure actually is at present. So I would like to be able to see which pages are linking to what at the moment & which pages have broken links on which I need to remove from the content. Hopefully I'd then be able to tidy up any errors that the site already has in its internal linking.
Is there a way to do this easily? Or to have a graphical representation of the sites structure?
I have just signed into our Webmaster Tools account and I am faced with a list of 10 'Crawl Errors' which are all 404 errors. Some of them do not actually exist anymore, but are still being linked to from a few pages according to WMT.
For example, /industries_served_legal.htm is still being linked to from 5 of our pages (including /industries_served_local_authority.htm)
However, this doesn't seem to be a case at all on the page as I can't find a link to /industries_served_legal.htm on /industries_served_local_authority.htm. Any advice as to why this is happening? Is there a way to find out easily where these broken links are situated on the page? And if I do actually manage to find our broken links, how would I go about removing them?
The page /document_security.htm doesn't exist in our Sitewizard list of pages anymore, yet still exists online. How do I go about deleting this unecessary page properly? And does this harm our rankings?
The document_security page also has an extra link on the top toolbar to a Document Management page, an addition which is no longer present on our up to date pages. Now this page (and the extra dropdown page when you hover over it) still exist on our list of Sitewizard pages at the moment, but we obviously no longer want to have these online anymore. How should I remove these?
I understand that this is a lot of information, and so I would appreciate any help that can be given on these!
Many thanks
-
Perfect sense thank you! I'll now research how to actually do this re-direct.
-
If this is an internal link on your website, you would want to change the actual path to point to the newer secure-document-storage page.
If this is an external link from another website, you'd create a redirect that will take the incoming request for the old document-security page and push the visitor to the new secure-document-storage page.
Make sense?
Mike
-
So even though the text is different, I should re-direct people clicking on the link to the old document-security to the newer secure-document-storage page?
-
Here is an example that may help:
You have the following pages on your site - /product1.html, /product2.html, and /product3.html.
An external site (externalsite.com) links to the product 2 page on your site (yoursite.com/product2.html).
You decide to no longer sell product 2, so your remove /product2.html from your website; however, externalsite.com is still linking to yoursite.com/product2.html. You see a 404 warning in Google Webmaster Tools referencing this error.
You then have two options:
-
You recently started selling product 4, which is not the same product, but still offers the same solution to a potential customer. You create a /product4.html page and set up a 301 redirect from externalsite.com to yoursite.com/product4.html.
-
You no longer sell this product or solutions like it, because it was not needed by visitors. There is a link from externalsite.com is no longer applicable to your site; therefore, you disregard the warning in Google Webmaster Tools and the link will eventually not be followed by Google.
Now, if the /product2.html page was still accessible online, but you no longer linked to it via yoursite.com, that is kind of a problem, because if externalsite.com is still linking there, visitors could stumble upon your old/outdated/not-used page. You do not need to actively worry about removing the link, but you should work on removing the page if it is no longer used.
Does that help and did I understand your question correctly?
Mike
-
-
Apologies for the overload!
So my take-way from this is that any pages that I have deleted but are still able to be found the internet (e.g. /document_security) I don't need to worry about actively trying to remove from the internet as it will be removed by Google automatically in the future? And having these pages still existing on the internet (despite not having any current links going to them from pages I haven't deleted) will not harm my site?
Thank you for all of your help so far!
-
To add to Mike's answer
2: If the page is deleted and isn't coming back you may want to 301 it to its new equivalent of possible even return a 410 a status code to tell search engines the pages has been permanently removed
For more info on Status codes see the following article
http://www.seomoz.org/learn-seo/http-status-codes -
Whoa! Information overload!!!
-
I don't know of anything that shows you a graphical representation of your site's linking structure; however, I do know of a program that will list out all of the linking pages on your site and the number of in and out links, including anchor text, etc. The number of in links can be an indicator of structurally how your site is organized.
-
404 errors or not bad as long as they are known. If you no longer have a page and you decide not to redirect from the old page to a new one, that is fine. Google is just giving you a heads up that your site or someone else's is linking to a non existent page. If you do nothing to fix these 404 errors, the page will eventually be removed from Google's index and not be a problem.
-
/document_security.htm looks like it is being linked to from /services_storage_fast_retrieval.htm and /services_archive_storage.htm
I would recommend downloading and installing Screaming Frog that is the program I was referencing in my response to #1 and that is how I found the issue in #3.
Seer Interactive also wrote a great blog on all of the things this tool can do.
Hope this helps.
Mike
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are sliders killing our site?
Our website, http://shatterbuggy.com, has what I believe is a systemic issue that stems from the heavy reliance upon the Revolution Slider for Wordpress. I am not an SEO expert and our site has vexed many SEOs in the past. We get feedback regularly from customers (especially those that are not tech savvy) that express gratitude for the ease of use via following an image to image sequence to get to their respective booking. This was our goal when creating the site. Incidentally, in many cases, the only linking from page to page is within the slider itself (clickable image) and there is little to no content. That said, we seems to stumble in SERPS against seemingly inferior competition. For example, we should be ranked in spot 1, 2, or 3 ish for "iPhone repair Minneapolis" but rather we are stuck near spot 15. Any thoughts on whether this is a strategy that may be harming us? If so, would simply creating content on these empty (slider only) pages help? Should we create "static links" that connect to the same places as the slider? Also, is our particular use of the slider creating H1 issues? Thank you all! B.
Technical SEO | | BenjaminH0 -
Site Migration Questions
Hello everyone, We are in the process of going from a .net to a .com and we have also done a complete site redesign as well as refreshed all of our content. I know it is generally ideal to not do all of this at once but I have no control over that part. I have a few questions and would like any input on avoiding losing rankings and traffic. One of my first concerns is that we have done away with some of our higher ranking pages and combined them into one parallax scrolling page. Basically, instead of having a product page for each product they are now all on one page. This of course has made some difficulty because search terms we were using for the individual pages no longer apply. My next concern is that we are adding keywords to the ends of our urls in attempt to raise rankings. So an example: website.com/product/product-name/keywords-for-product if a customer deletes keywords-for-product they end up being re-directed back to the page again. Since the keywords cannot be removed is a redirect the best way to handle this? Would a canonical tag be better? I'm trying to avoid duplicate content since my request to remove the keywords in urls was denied. Also when a customer deletes everything but website.com/product/ it goes to the home page and the url turns to website.com/product/#. Will those pages with # at the end be indexed separately or does google ignore that? Lastly, how can I determine what kind of loss in traffic we are looking at upon launch? I know some is to be expected but I want to avoid it as much as I can so any advice for this migration would be greatly appreciated.
Technical SEO | | Sika220 -
Linking out to authoritive sites from my ecommerce site
Good afternoon SEOmoz community. I was looking for a specific answer or advice or opinion about linking out to other sites. My Site www.tacticalbootstore.com has been undergoing a complete content rewrite. In the process we have been told and read where it can be good to link out to other authoritive sites. One of the pages we have rewritten is here. http://www.tacticalbootstore.com/belleville-boots-sizing-chart-a-97.html We have not added the graphics yet as they are being built now. This is just an informational page about sizing of a particular manufacturers boots. Once you get to the bottom of the text we have added a link to the actual manufacturers page. Is this helpful for us in the SERPS or not? Thank you for your time. Chris
Technical SEO | | scamper0 -
Help with Places Pages
How can we get our Google Place page to rank higher, and how can we then keep it there instead of seeing it bounce around? We seem to have trouble getting a decent ranking for our places page even though out website ranks well on Google for geographical phrases?
Technical SEO | | onlinechester0 -
I am Posting an article on my site and another site has asked to use the same article - Is this a duplicate content issue with google if i am the creator of the content and will it penalize our sites - or one more than the other??
I operate an ecommerce site for outdoor gear and was invited to guest post on a popular blog (not my site) for a trip i had been on. I wrote the aritcle for them and i also will post this same article on my website. Is this a dup content problem with google? and or the other site? Any Help. Also if i wanted to post this same article to 1 or 2 other blogs as long as they link back to me as the author of the article
Technical SEO | | isle_surf0 -
I changed the domain and structure of my site,is there anything I can do to help speed the recovery in SERPs?
I change the domain of my site in March (pretty much exactly when Panda hit, by coincidence). Our search traffic has dropped by 90% in that time with little recovery. In webmaster tools it shows about 400,000 pages on the new domain and about 85,000 still indexed on the old domain. I set up custom 301 redirects to all of the new pages on the new domain so everything that was moved has a good one hop redirect. I've been told that the only thing I can do is sit back and wait for everything to finish transitioning. The problem is that it has been 5 months of poor traffic, which means 5 months of slow sales. Is there anything I can do the speed up the transition?
Technical SEO | | iJeep0 -
Which is more accurate? site: or GWT?
when viewing urls in google's index, is it more accurate to refer to site:www.domain.com or google webmaster tools (urls in web index)?
Technical SEO | | nicole.healthline0 -
.CA site same as .com site - are both necessary?
Dear Friend, We representa a major national brand in the auto care industry, and they have locations in both US and Canada. There is a primary content site at .com that we have duplicated at .ca. We are hosting the .ca site on a separate IP on a server in Canada - but by in large it is the same site. (there are some minor changes we made to change US English to Canadian English - though minor. When we search Google.ca we generally see strong search results for the .com site, but rarely, if ever any evidence of rankings for the .ca site. The .com site was launched several years ago about 18 months before the .ca site. Why doesn't Google.ca show the .ca site? Is this an issue of duplicate content, and Google.ca simply shows the .com version which it knew about first? Are we wasting our time, money and efforts having both? Thanks, Tim ps. this isn't about location. We use a separate site to locate local shops, and have coordinated that well with Google Places, and when looking for local auto care - we do well in both US and Canada. The sites described above are largetl content sites.
Technical SEO | | lunavista-comm0