How do I diagnose a site that has had a corrupted database restored?
-
Here's the situation:
Downloaded a backup of a full database from CPanel, because we needed to perform some queries on it.
Found out that after restoring it, CPanel had not been able to zip up the full database because the server memory was set so low (some posts weren't showing up after the restore).
SO, how would I go about determining exactly what content is missing from the site? What search engine queries would you perform? Is there a plugin I could use to establish the inconsistencies between the database content and the search results?
Your help is appreciated in advance!
-
Thanks a lot, Ryan. This response was really helpful.
-
Prior to SEO, my time was spent as a Microsoft Database Administrator. Your site almost certainly is using a MySQL database which is a different format then T-SQL, but the comparison likely holds.
Whenever I set up a MS SQL database, a task would be scheduled to automatically shrink the database. There are temp tables which are added and removed, files within the database which can have unused pages removed, etc. A database size reduction of 5% does not indicate to me any data loss.
At a high level, you need to compare the "corrupted" database with the one you restored on a file or table level to determine any differences. That is the only reasonable way to achieve your goal. The work should ideally be performed by a professional who is highly experienced in MySQL.
If you desire further assistance, I recommend pursuing the topic on a MySQL forum as this does not really pertain to SEO nor even WordPress.
-
Hi Ryan,
Apologies for not mentioning the software. The website is built on Wordpress.
Here's a bit extra information for you regarding the issue: upon inspection after seeing that the database was corrupted, comparing file sizes of the recently backed-up database, with a database that was backed-up three days beforehand, we found that the most recently backed-up database was around 5% smaller in file size than the earlier one—if anything it should be larger.
Regarding timestamps, the latest posts are there, and the comments seem to be there, so determining what exactly IS missing is something I'm going to need help with.
I can't restore the earlier version of the database either, because important data has been added since then.
The site works fine, for now. I'm just worried somewhere down the road we're going to find that there are 100 posts missing that are now turning up as 404 pages and lost links.
Does that make sense? Thanks for your help.
-
Based on your inquiry, it seems likely you are using specific software to run your site. It is unclear what type of software is being used, which is a critical factor. It could be a CMS such as WordPress. a shopping cart such as ZenCart, a forum such as vBulletin, etc.
You would likely receive the fastest and most accurate response by using the support site of the specific software in use.
Based on your questions, you are in far over your head and should ideally step aside and find a programmer who can resolve the issue. With that said, I'll try to answer your questions.
"how would I go about determining exactly what content is missing from the site?"
In order to determine what is missing, you need a baseline. You need to understand the site's function and activity. For example, if you are running an ecommerce site, what is the timestamp of the last order placed on the site?
"What search engine queries would you perform?"
None. You have an onsite issue. That is where your attention needs to be focused.
** "Is there a plugin I could use to establish the inconsistencies between the database content and the search results?"**
No. The search results should not even be a consideration. Search engines may choose to index or not index your content based on numerous factors including the robots.txt file, the meta tags on each page, the content on each page and so forth. Asking this question indicates you are grasping at straws. If your site is important to you, hire a professional developer to fix the problem. If the site is not of great importance (i.e. it does not generate revenue) then you can visit the site of the software in use and spend a day or two reading various articles, forums and such, then making various setting changes in an attempt to restore the site.
Another option....contact the web host and request them to restore a full backup of the entire site. This option would likely be best, but you would lose all data from after the time the backup was taken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Mobile site domain authority
Hello, I think this may be a coding issue, but hoping someone can help me. I am still having issues with our mobile site ranking, even though we created redirects/canonical to identify similar content between desktop version and mobile. I did notice through MOZ analysis of backlinks that we have no domain authority. If the mobile site is automatically detected dependent on the user, shouldn't we also have the same domain authority? How does that work exactly? How can we build up the domain authority for our mobile site? Any help would be greatly appreciated! Thanks
Technical SEO | | lfrazer0 -
301 Multiple Sites to Main Site
Over the past couple years I had 3 sites that sold basically the same products and content. I later realized this had no value to my customers or Google so I 301 redirected Site 2 and Site 3 to my main site (Site 1). Of course this pushed a lot of page rank over to Site 1 and the site has been ranking great. About a week ago I moved my main site to a new eCommerce platform which required me to 301 redirect all the url's to the new platform url's which I did for all the main site links (Site 1). During this time I decided it was probably better off if I DID NOT 301 redirect all the links from the other 2 sites as well. I just didn't see the need as I figured Google realized at this point those sites were gone and I started fearing Google would get me for Page Rank munipulation for 301 redirecting 2 whole sites to my main site. Now I am getting over 1,000 404 crawl errors in GWT as Google can no longer find the URL's for Site 2 and Site 3. Plus my rankings have dropped substantially over the past week, part of which I know is from switching platforms. Question, did I make a mistake not 301 redirecting the url's from the old sites (Site 2 and Site 3) to my new ecommerce url's at Site 1?
Technical SEO | | SLINC0 -
Googlebot cannot access your site
"At the end of July I received a message in my Google webmaster tools saying "Googlebot can't access your site" We checked our robots.txt file and removed a line break in it, and then I had Google Fetch the file again. I have not received any more messages since then. When we created the website I wrote all of the content and optimized each page for about 1 local keyword. A few weeks after I checked my keywords and did have a few on the first page of google. Since then almost all of them have completely disappeared. Because we had not link building effort I would not expect to still be on the first page, but I should definitely be seeing them before the 5th or even 10th page of Google. The address is http://www.tile-pompanobeach.com I'm not sure if these horrible results have something to do with the message from Google or something else. The problem is this client now wants to sign a contract with us for SEO and I really have no Idea what happened and if I will be able to figure it out. The main keyword for my home page is tile pompano beach and I aslo was using Pompano Beach Tile store for the About page which was previously on the first page of Google. Does anyone have some input?
Technical SEO | | DTOSI0 -
Will training videos available on the "members only" section of a site contribute to the sites ranking?
Hello, I got asked a question recently as to whether training videos on the deeper pages of a website (that you can only access if you are a member and log in) will help with the sites ranking. On the SEOMoz software these deeper pages have been crawled as far as I can tell with errors reported on pages from the "members only" section of the site, leading me to believe the members only pages and their content will contribute to the sites overall ranking profile. I have suggested uploading the informational videos on the main pages of the site for now, making them accessible to all visitors and putting them in a more obvious place to encourage more sharing and views, however I've also said I would check it out with some experts so any information will be greatly appreciated! Many thanks 🙂 Charlotte
Technical SEO | | CharlotteWaller0 -
Adding no follow links on my site
I am getting a warning about having too many links on my page www.accessoriesonline.co.uk (152) but I don't want to remove any links from the site. Its an ecommerce site with categories across the top, featured products and then a further category navigation in the footer. Would it be beneficial if I added a rel="nofollow" to the links in the footer as these are duplicates of the one's in the header or would this harm the links in the header and the destination URL's which I definitely want to be crawled? Also, does anyone know if SEOMOZ considers links with a rel=nofollow as an actually link when they calculate their overview? Thanks in advance
Technical SEO | | gavinhoman0 -
Best SEO strategy for a site that has been down
Because of hosting problems we're trying to work out, our domain was down all weekend, and we have lost all of our rankings. Doe anyone have any experience with this kind of thing in terms of how long it takes to figure out where you stand once you have the site back up? what the best SEO strategy is for immediately addressing this problem? Besides just plugging away at getting links like normal, is there anything specific we should do right away when the site goes back up? Resubmit a site map, etc? Thanks!
Technical SEO | | OneClickVentures0