TD*IDF analysis Tools
-
Hi guys,
I was wondering if anyone knew of free TD*IDF analysis tools on the market?
I know about onpage.org and Text-tools.net both paid.
I was wondering if anyone knows of other tools?
Cheers,
Chris
-
Hi Chris,
I don't know of any free tools that do this unless you want to write some code yourself. If you go that route we have some open source libraries that you might find useful, especially qdr that implements the TF-IDF scoring and dragnet for parsing/cleaning the HTML. Good luck in your search!
-
Hi Chris,
It's not the TD-IDF solution you're after but may help? SEO Quake (available as a free Chrome plug-in: https://chrome.google.com/webstore/detail/seoquake/akdgnmcogleenhbclghghlkkdndkjdjc) approximates some of this data for you.
It will show the most commonly recurring 1, 2, 3 and 4 word phrases appearing on a web page. It won't compare this to a corpus (e.g. your whole site). It then gives a Density % (broadly, how often this word/phrase appears) and a Prominence % (based around density but also where it appears: title, description, keywords etc.).
Hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEO-optimized Data Visualizations (e.g. Charts) Tools
Hi there! We are currently evaluating data visualization / charting tools for rich content. Are there any open source solutions that work best in your opinion? Why? Some specific questions: Are static image / svg rendered images better than a javascript dynamic chart (canvas/HTML5)? Which gets indexed better? Is there any proven or perceived benefit to using Google Charts API that gives you an SEO boost? Are there tools for progressively enhancing HTML raw data tables to generate charts? Looking at a couple of solutions: Google Charts API C3.js Chartjs Thanks for your feedback!
Intermediate & Advanced SEO | | insurifyusa0 -
Link Type Analysis
Howdy Moz Fans, Just wondering if anyone knows any tools to which can identify link types. E.g. is the link - navigational, in the footer or in the body text. Specifically for internal links. Any suggestions? Cheers, RM
Intermediate & Advanced SEO | | MBASydney0 -
Custom sitemap or sitemap generator tool
I have recently launched a website which is using a free sitemap generator (http://web-site-map.com/). It's a large travel agency site (www.yougoadventure.com) with predominantly dynamically generated content - users can add their products as and when and be listed automatically. The guy doing the programming for the site says the sitemap generator is not up to the job and that I should be ranking far better for certain search terms than the site is now. He reckons it doesn't provide lastmod info and the sitemap should be submitted every time a new directory is added or change made. He seems to think that I need to spend £400-£500 for him to custom build a site map. Surely there's a cheaper option out there for a sitemap that can be generated daily or 'ping' google every-time an addition to the site is made or product added? Sorry for the non tech speak - Ive got my web designer telling one thing and the programmer another so im just left trawling through Q&As. Thanks
Intermediate & Advanced SEO | | Curran0 -
Google Webmaster Tools > HTML Improvements > 301 Moved Permanently pages - how did they even get there?
Hello experts! I'm going through my Google Webmaster Tools > HTML Improvements looking for pages with duplicate meta descriptions/titles that I can fix. And I noticed there are about 60 pages odd looking page titles that have duplicate meta descriptions, which are also noted as: 301 Moved Permanently Moved Permanently The document has moved here. Apache Server at sports When I click on the link to see the page names, all of them are pages we never created. The pages are all sports blog related. Here are few examples: http://www.titanium-jewelry.com/justin-tuck-blog.html http://www.titanium-jewelry.com/unlimited-potential-project-blog.html http://www.titanium-jewelry.com/left-handed-baseball-glove-blog.html http://www.titanium-jewelry.com/adjustable-basketball-hoops-blog.html how did they get on our site? Is this some sort of malicious attack? Most of them are sports related blog looking names. I just don't know how these pages could have been created. 2) is this hurting us with Google?3) Can you tell when the page was created?Thanks ron xEtX3op.jpg
Intermediate & Advanced SEO | | yatesandcojewelers0 -
Can anyone explain sudden drop in impressions. Webmaster tools screenshot attached.
I posted here recently concerned about a drop in rankings. I'm yet to figure out what happened and thought I'd post a screenshot of the loss of impressions to see if anyone can help? jeZO1r1.png
Intermediate & Advanced SEO | | SamCUK0 -
Matt Cutts Announces Disavow Google Webmasters Tool
Today at Pubcon Cutts announced this tool similar to Bing's - http://searchengineland.com/google-launches-disavow-links-tool-136826. My question is, has anybody used Bing's? Do you foresee any problems or issues to consider? Just checking before going ahead with using it 🙂 Thanks
Intermediate & Advanced SEO | | bradkrussell0 -
Magic keywords in Google Webmaster Tools
Hi All, Recently moved a friend to a new WP back-end website as they were on Flash which is pretty, but not necessarily the best for SEO. http://francesphotography.com My question is that once Google finally indexed the site, I noticed in Google Webmaster tools that it found the most significant keyword to be: automatically On the following top pages: | tag/snow-boarding-photography/ |
Intermediate & Advanced SEO | | BoulderJoe
|tag/style-photography/ |
|tag/underwater-photography/ |
|tag/vacation-photography/ |
|tag/wedding-photography-beaver-creek/ |
|tag/wedding-photography-copper-mountain/ |
|tag/wedding-photography-denver/ |
|tag/wedding-photography/ |
|underwater-photography-scuba-diving-cozumel-mexico/ |
|wedding-photography/ | The goofy thing is I can find anywhere that "automatically" is used - perhaps it is coming from a plug-in or magically keyword beans that Google found? Any guidance is appreciated.
0 -
Tool to calculate the number of pages in Google's index?
When working with a very large site, are there any tools that will help you calculate the number of links in the Google index? I know you can use site:www.domain.com to see all the links indexed for a particular url. But what if you want to see the number of pages indexed for 100 different subdirectories (i.e. www.domain.com/a, www.domain.com/b)? is there a tool to help automate the process of finding the number of pages from each subdirectory in Google's index?
Intermediate & Advanced SEO | | nicole.healthline0