Showing posts with label index. Show all posts
Showing posts with label index. Show all posts

Friday, November 4, 2011

Get Indexed by Google's Googlebot Right Away, the Right Way

Everyone in the online world knows extremely well that the most sought after traffic to one's site comes from a Google search. Folks, 80% of searches on the internet are done in Google.

In theory, it is simple - if you have something interesting to someone else, if you build a website with the honest to goodness goal to provide something useful for someone else, that someone else will find you. That is also how the creators of Google describe their main goal, to more or less have a great repository of information, and help people of our planet find useful stuff.

In practice, it is not that simple. It is not that simple because there are thousands, possibly even millions of sites like yours, because you might be running a very honest online business, selling some very useful product, but do not have unheard of, exceptionally grand 'content'. If your site is listed on page 265 of a search results set, be sure you will never get any visitors that way.

Unlike Yahoo and others, who rely on human involvement, Google does everything through automation. Websites are indexed (or crawled, or spidered - all terms refer to the same process) by their indexing software called Googlebot. Googlebot looks at websites daily, and rules programmed into the software decide which of your pages make it into the main Google index and which don't. After your site was indexed, whether it was submitted for indexing by a human or the robot just stumbled upon it, your pages are ranked, so Google knows on which page of a search to put your site on, and on what search phrases should your site even be part of the result search.

The Googlebot is very smart and works really well. Keep in mind however, that is just a piece of software, a very sophisticated one, but it's just a computer program. Consequently, it has a set of algorithms (rules) it uses to index web site content (information), a set of capabilities (as I said before, Googlebot is really intelligent) and a set of limitations. As such, there is an impressive number of ways in which one can trip up the Googlebot and make it impossible for it to index your content. Alternately, the Googlebot can index your site well, and then people will find it when searching for words it contains.

This article will try to teach you all the basics necessary to achieve consistency and persistency in Google, starting with the very basic step: getting indexed by Googlebot, Google's indexing robot.

1. Read Google's own Webmaster Guidelines

The people behind Google seem to have two main things down to a science: One, most of their algorithms (rules) are so secret, that all us non-Google employees do is speculate. Two, their guidelines are very simple, direct and precise. Following their guidelines will never hurt your site's ranking. Disregarding their guidelines can and probably will hurt you in the long run. So go to http://www.google.com/webmasters/guidelines.html and read what Google has to say about itself.

2. Have text links.

Make every single page on your site accessible via a text-based link, as opposed to Javascript, Flash, DHTML (Dynamic HTML), etc. Googlebot's native language is text.
Google says: "Make a site with a clear hierarchy and text links. Every page should be reachable from at least one static text link."

This is probably the number one key to your site's existence in Google. Googlebot is actually a robotic, browser-like software, based on the venerable Lynx browser. The reasoning behind this approach is that the creators are trying to get as close as possible to emulating human browsing, making sure your website is actually human friendly. Consequently, by downloading Lynx on your computer and looking at your site through Lynx (http://lynx.isc.org), you will see more or less exactly the information Googlebot can read and index and the links Googlebot can follow. You will also see HTML errors on your pages and places where a robot would be stuck and could not reach the rest of your site.

I know it is very unfair to those of us who understand and love the potential of websites built completely in Flash, or other engines. However, until the nice folks who run Google figure out a good way to crawl inside a Flash file and extract the appropriate information, we are stuck with standard HTML.

This is not to say that you cannot make your site really pretty and fill it with Java Script and Flash eye candy. But you must have regular text and standard text links. Usually you can achieve the desired effect by having extra navigation menus based on standard text links.

3. Avoid frames.

Avoid frames at all cost. If you must use them (for example to make someone else's page look like it's part of your site), do not use them on your front page.

Frames are like the plague, they sneak up on you. It is incredibly easy to lose Googlebot's tracks inside a badly formatted frameset. You might hear that some of the robots, including Google's Googlebot and Yahoo's Slurp are quickly gaining capabilities to go inside frames properly. My philosophy is, until a feature becomes ubiquitous, if you're uncertain, leave it in the closet.

4. Keep the number of links on a given page less than 100.

This comes straight from Google's Webmaster Guidelines: "Keep the links on a given page to a reasonable number (fewer than 100)."

This looks more like a suggestion and I am not 100% sure if you get penalized in any way or if Googlebot just stops reading your links after 100. I can however tell you from personal experience that I tried a page with 700 links and it seemed fine. Then one day I tried to view the page from my Blackberry PDA and I got this strange error message saying my page is illegally formatted. After I split the page into several ones with 80 links each, the pages worked on the PDA also.

Who cares about the Blackberry? Well, if you're reading this and your goal is to get visitors, then your main concern should be not to alienate anyone. Remember, today more than ever, people use different devices and different software to access the web. Every visitor is a potential customer. Every employee at a major US lawfirm and many other corporate people use a Blackberry.

Lastly, why would you need that many links on one page anyway? Let's say, for example, that you specialize in promotional products - corporate branded gifts, such as pens, caps, mints and other products (called sometimes 'premiums') imprinted with one's logo. Your name is John Doe, and you decided to name your company JDPromos (not very imaginative, but will do for our examples). You would want to have every item in your catalog as a text link, so every item gets indexed as a link and as a keyword. Also, those who run forums, ezines, blogs, might want to have standard links to their articles, as the software they use might create dynamic links, invisible to certain robots.

5. Give every page a meaningful title.

Give every single page on the site a complete and meaningful title. This is also directly from Google's Webmaster Guidelines. See Rule #1.

Incidentally, for those who are fascinated by the debates on the death of the Meta Tags, the

<title></title>
tag is not a Meta Tag, but a required element for every page.

The "title" tag is supported by every web creation tool out there, and goes in the header of a web page (between the "head" and the "/head" tags).

Google offers the 'allintitle' syntax, which lets users search only text that appears in a page title. A lot of people who integrate a Google bar into their websites allow users to get results only by title. There are over 29 million results returned for Untitled Document.

Most of us - myself included - copy and paste template pages, out of the convenience of not having to recreate all design elements from scratch. If you do so, do not forget to change the title.

Make sure your title is not just a list of keywords and that it is related to the actual content of the page. Google can and will check that, before deciding on your page's 'relevance'.

6. Do not place important text inside images.

Google says: "Try to use text instead of images to display important names, content, or links. The Google crawler doesn't recognize text contained in images."

It is very tempting to create images with text inside them, for the very simple reason that as designers, we are not limited to the very few font (type) options that basic HTML allows. Also, different browsers tend to display things differently nowadays, so it is much easier to create a text image, which will be shown consistently and not worry about styles, operating systems, etc.

7. Use descriptive "ALT" tags.

The "ALT" tag is used as a text alternative (hence the name) for images and image links and was designed so that text browsers (such as Lynx) do not just display a generic 'Image' for every picture link you might have. If all your links say 'Image', how would a potential visitor know what they are?

Make sure that the text description is meaningful and accurate. Take our promotional items company as an example. Let's say they have a picture of a tradeshow display, as an example of a service they provide outside the ordinary imprinted mint boxes, calculators and keychains. If the "ALT" tag only says "display", that is what Googlebot will see and index. If the tag says something like "example of a tradeshow display design", that is certainly more useful and more Googlebot friendly.

Please note that although the "ALT" tag does count and Google seems to put a high price on this tag, it ranks lower than plain text.

8. Use meaningful descriptions for links

With the risk of sounding like a scratched CD, I'll have to say this again: Whether you use picture links or text links, please use meaningful text inside your tags so that Googlebot can associate that text with that href link.

In other words, let's pretend again that we are designing that website for that imaginary promotional items company we called JDPromos. If you intend to put a link to a set of sample coffee mugs promos, say something like "link to JDPromos samples of branded coffee mugs", not just "coffee mugs", or even worse, "click here for pictures". Never use link text like "read more" or "go here" or "download it", "click here", "don't click here", you get the picture - I hope.

Don't try to fool the Googlebot with hidden links or duplicate content or irrelevant pages of words like "sex" and "hot girls." The Googlebot doesn't like being played and you will be penalized, one way or another, in the long run.

9. Use a "description" tag for every page

Include a

<meta name="description" content="[insert your site's description here]">

tag in your page header to summarize your site. Use a meaningful one or two sentence description, do not keyword spam.

Even better, include descriptive text on the site's front page where users can actually read it. This text will appear as the description for your site in Google results.

Place more important content higher in the page than less important content in a page, Google does categorize text on a page based on it's position, text at the bottom of a page is considered less important, or 'relevant', to use one of Google's own terms.

10. Use short query strings

Use URLs with query strings sparingly, if at all possible. Query strings are also called dynamic pages. You can usually recognize dynamic pages by the presence of the "?" character. Keep in mind that the shorter the list of query string parameters, the better. Be aware that not every search engine robot can crawl dynamic pages as well as static pages. It helps to keep the parameters short and the number of them few.

11. Never use the "&id=" parameter

If you must use query strings, or dynamic pages, never use the "&id=" parameter as part of the string.

I know this might sound ridiculous, as it might be hard or impossible for you not to use the "&id=" parameter, but if you are a programmer and you can change the variable's name, replace "id" with something else. Otherwise, Googlebot will just skip that page.

Google says: "Don't use "&id=" as a parameter in your URLs, as we don't include these pages in our index."

12. Use robots.txt

Use robots.txt to show the Googlebot around your site. This ancient and very standard mechanism for directing well-behaved robots like the Googlebot will allow you to specify places where the robot is not welcome, whether for privacy reasons, or for reasons of avoiding Google penalties. You might want to keep the robot away from your cgi-bin directory and other places you maybe don't want available to the entire searching population of the globe. Remember this is a guideline, not a barrier, robots that are not programmed to comply, will disregard. Bottom line, use the robots.txt to guide Googlebot, but not to enforce strict security.

Google says: "Make use of the robots.txt file on your web server. This file tells crawlers which directories can or cannot be crawled."

13. Make a sitemap

A site map is just a page on your website where you guide your users through the structure of your site. The most basic form of sitemap is a page that lists all of your pages, with a brief description and a link - all text, of course. When you make the sitemap, follow all the rules above and don't forget that the purpose of the sitemap is to guide your human visitor.

Google says: "Offer a site map to your users with links that point to the important parts of your site. If the site map is larger than 100 or so links, you may want to break the site map into separate pages."

14. Use the Google Sitemaps project

At the time of this writing, the fastest, best and most accurate way to make sure your site is properly crawled and indexed by Googlebot is to participate in the Google Sitemaps project.

In a nutshell, you make a sitemap as an XML page and submit it directly to Google. Google then sends Googlebot to index your site. Besides the speedy free submission, you also get a good amount of statistics and the opportunity to fix potential errors in your site.

Please note that the XML sitemap needed for the Google Sitemap project is intended specifically for Googlebot, and is different from the sitemap described in the previous Rule, which is intended solely for human users.

Also, do not be afraid of XML, Google's sitemap is a very simple text file and they give you all the necessary information and directions at: https://www.google.com/webmasters/sitemaps

Good luck!

---
Andrei co-owns Bsleek - a company that specializes in web design, hosting, promotional items, printing, tradeshow displays, logos, CD presentations, SEO and more. Andrei has amassed an extensive technical knowledge and experience through his career as the CIO for a major travel management company and through his past careers in military research, data acquisition and airspace engineering. He also consults for Trinity Investigations, a New York based PI firm.



---
Bsleek - Redefining cheap web hosting [http://www.bsleek.com/hosting/]

Article Source: http://EzineArticles.com/?expert=Andrei_Smith



Getting Google to Index Your Website

When you first build a website, it can take anything from several days to weeks and months for search engines to include its pages in their indexes. Certain engines, such as Yahoo!, offer a premium service for sites to appear more quickly, whereas Google has no paid version for site inclusion.

Google's crawler is a process known as GoogleBot, which follows links on sites to discover new content and understand the connection of any given page to the rest of the Internet. GoogleBot then updates Google's indexes accordingly, and once the spider has visited your site, there's nothing you need to do to keep it coming back. GoogleBot will decide based upon your site's size, links and frequency of changing content how often to revisit to check for updates.

Contrary to common belief, there's actually no way to submit your site to Google. Even though they offer a "Submit URL" form, there's no evidence that it causes GoogleBot to view your webpages or has an impact on their crawling whatsoever. But although site submission is not possible, the simplest way to get Google to notice you is to have an inbound link from another site already in Google's index. This works because GoogleBot follows links it discovers, so if you can develop inbound links to your site, it will follow the trail and read your pages.

Another useful tool once you are indexed is Google Webmaster Tools, which allows you to register any site you administer, and provides statistics about how GoogleBot views your site. This can help uncover common problems, such as broken links, missing pages, malware and other issues that can cause your pages to lose rankings in the search engines.

James Beswick is the author of "Ranking #1: 50 Essential Tips to Boost Your Search Engine Results" available on Amazon.com at http://www.amazon.com/gp/product/1452849900?ie=UTF8&ref_=sr_1_1&s=books&qid=1279571154&sr=1-1&linkCode=shr&camp=213733&creative=393177&tag=ekcy-20.

For more information, visit http://ranking-number1.com.

Article Source: http://EzineArticles.com/?expert=James_Beswick


Article Source: http://EzineArticles.com/4713221

Saturday, October 15, 2011

Surviving Google's Aging Delay

Google has always been the search industry's innovator and that's just what Google's aging delay symbolizes, the evolution of search innovation... yet another significant step forward for Google.

Google's success as a search engine can undeniably be attributed to its ability to consistently return the most relevant search engine results. That's what kept the search giant on top of the pack and leading the multi-billion dollar search industry & that's what's going to keep them there!

Now that said, is it any wonder why Google incorporated the infamous aging delay into their ranking criterion? The simple truth is, Google's aging delay is a full frontal assault on artificial link inflation.

With the induction of multiple clever off-page reciprocal-linking strategies engineered to artificially inflate link popularity and PageRank, Googles aging delay wasn't only necessary and long overdue; it was the next logical step in the evolution of search.

The confusion and misunderstanding of the aging delay among site owners is nothing short of amazing. Many of my clients are confounded because their new sites are well positioned in Yahoo, MSN & the other large search engines while they're site is no where to be found in Google's search engine result's pages (SERPs)... except for perhaps on the most obscure search terms.

Current and unconfirmed speculation has been misplacing the blame on Google's 'sandbox' effect. While this is a possibility I believe it's also highly improbable.

The sandbox holding period is typically anywhere between 90 to 120 days, the aging delay appears to be much longer. I've seen new sites delayed for up to 6-8 months.

The premise of the sandbox delay theory suggests that new sites are being penalized for gaining too many links too fast. To date I haven't seen a scrap of evidence to support that claim.

The sandbox theory is further disproved by the fact that newer sites engaged in procuring relevant links experience the same delay in climbing Google's SERPs as other new sites utilizing scores of purchased text links. This lends credibility to my thought that new sites are not being penalized on the premise of acquisition or quantity of inbound links and; supports my theory that it's the reciprocated links that are being delayed by an aging filter.

It just doesn't seem 'reasonable' for Google to penalize sites for acquiring legitimate directory listings & building an optimized reciprocal link based network. In my opinion, mainstream SEOs are confusing the existing sandbox effect, with Google's new 'aging filter' that arrived on the search scene earlier this year.

It seems more likely that Google's aging filter is weighing the 'maturity' of inbound links and not the new site itself. Meaning that in addition to the traditional ranking criterion, the age of a sites inbound links are also now considered.

My own theory is that newly acquired inbound links are placed on a 'probationary' status until they've 'matured' before they're considered. For example, a new and relevant inbound PR 6 link would not be given the same weight or consideration as a 'grandfathered' PR 6 link until the aging delay expired.

By placing newly acquired links on a probationary period and delaying the ranking of newer sites Google has effectively offset the instant free ride to the top of their SERPs. Purchasing volumes of brokered links to that end is now a moot point. After all, your site will still be delayed regardless of the amount of links you purchase and you won't see any return on investment (ROI) for at least 6 - 8 months.

Existing Site owners interested in immediate (ROI) are now strongly motivated to build new pages or expand existing sites in order to avoid Googles lengthy aging delay. With the 'all-the-rage' mini-network strategy shifting to more of a long-term commitment it seems likely that's exactly what will happen!

Whether by clever design or not, the only alternative to riding out the aging delay that produces immediate results in Googles SERPs is to advertise through Google's AdWords Program. So it seems that Google's solution vis-à-vis the aging delay has turned out to be an excellent vehicle to promote Googles own AdWords Program as well. Hmm...

How do you survive Google's aging delay? By taking pro-active action!

I haven't seen any new sites with new domains appear at the top of Google's (SERPs) since early to mid 2004. I've consulted with and tracked many of my clients' new sites and despite the fact they have hundreds of #1 positions in Yahoo, Alta Vista, AllTheWeb & MSN for their keywords I've yet to see any remarkable results in Google until the 6 - 8 month period.

The trend I've noticed suggests that new sites are initially indexed; ranked accordingly in Google's SERPs for a week or so and then literally vanish from the SERPs for several months. In most cases they can't even be found with the most obscure search terms... including their own name and address.

If you're launching a new site don't panic. Once you've registered your domain name and configured the hosting, you should set up a few temporary pages. Obtain links to them from other sites in Google's index to start the aging delay count down. I recommend launching a site immediately with enough content to set up and facilitate the requirements for directory listings to start. The sooner Google is aware of your domain the better. Just don't hold your breath waiting to see results... It could be as long as 6 - 8 months!

Gauge your optimization efforts by where your site ranks in the other search engines. Provided you're not engaged in unethical practices and followed Google's Webmaster's guidelines this should give you a ballpark indication of where Google will rank your site after the aging delay, just be patient.

To that end, don't keep tweaking and changing your pages source code and trying to manipulate your rankings until your site has been in Google's index for at least 6 - 8 months. In other words, there's no need to reinvent the wheel here because it doesn't seem to matter what you do, your site will still be delayed regardless.

Don't keep submitting your pages to Google either! It won't make any difference.

Check your server logs to confirm Googlebots' crawl and then forget about it. Googlebot will find your site again if you're actively reciprocating links so your time would be better served building an optimized reciprocal link network to get your site out there and linked to as early as possible.

If traffic from Google is crucial to your marketing and promotion plan, and I don't know anyone who would argue otherwise... budget to run an AdWords campaign for a few months until the site is indexed and positioned. You might even consider running an Overture campaign as well!

If you purchase non-directory links, reallocate that budget to Adwords advertising.

It's pointless to purchase links when you can invest in an AdWords or Overture campaign. Purchasing links is an investment you won't see a red cent ROI for at least 6 - 8 months while an Adwords campaign will drive targeted traffic to your site that can convert immediately. Keep in mind that Lycos, HotBot, AOL, Ask Jeeves, Iwon, Netscape & Teoma also receive paid results from Google! MSN, AllTheWeb, AltaVista & Yahoo receives paid results from Overture.

Do other search engines have an aging delay?

Google provides primary search results to other search engines. It only seems reasonable to expect that your site may be delayed in Google's partner sites as well.

One-way you may be able to work around this, and I can't emphasize this enough; is to make sure you submit your site to DMOZ, the open source directory. Google, in addition to the other major engines, receives directory results directly from DMOZ.

Yahoo and its partner sites don't seem to be utilizing an aging delay, nor does MSN, so focusing your early efforts on these search engines might give you a competitive edge in the Yahoo network.

At the end of the day when it's all said and done surviving Google's aging delay is just a matter of time. The days of purchasing instant link popularity and PageRank are over and in due course you will see Google give your site the recognition it deserves.

Copyright 2005 Lawrence Deon

Lawrence Deon is a Search Engine Optimization/Marketing Consultant, Author and Developer of the popular search engine optimization and marketing model Ranking Your Way To The Bank. http://www.rankingyourwaytothebank.com

Article Source: http://EzineArticles.com/?expert=Lawrence_Deon



Sunday, August 28, 2011

Why Backlink Indexing Is Crucial To Your SEO Efforts

If you have created a large number of back links, and you cannot figure out why they are not improving your ranking with the big search engines like, Yahoo, or Bing, and Google, the reason is you need to do some backlink indexing to get the crawlers to see, and identify your links. You can set up links all day, but if you are not getting them indexed to where they draw attention, you are basically wasting your time.

The odds are only one in every two hundred of your links being found without any index help. Those are not good betting averages. You are going to have to be submitting those connector links day and night to win at those odds.

Everybody that has a web site on the internet that provides a service, or sells a product needs to have their site ranked in the top ten of the possibilities. So you want your keyword to cause the search engines to put your name on the top of their list, every time someone types it in. You started using search engine optimization to get this done.

Search engine optimization has taught you that by providing back links from other sites you can increase your popularity with the big guns, and inch your way to the top spot. You know that the proven method is to get some links on relevant sites, and to make sure everyone you have out there is still working. You are doing everything right, you just do not seem to be increasing your ranking. Why?

You simply need to get all of those you already have out there indexed so they start pulling the traffic to you. Sometimes the crawlers from the search engines need to be directed to your links. The SEO was right about you needing to direct that traffic your way in order to get those rankings up. They just did not tell you that you might have to give the links a little help to get recognized.

You can go to online web sites to find options for ways to get your links indexed. You will find that the price of the different things you need to get this job done is relatively inexpensive, when compared to the traffic it will provide to your web pages. Most of the things designed to help with this problem are simple to put to use.

The big search engines like Yahoo, and Bing, and Google, all have some sites that they go to on a more frequent basis than others. The options you will find for indexing your current links will allow you to get them noticed when the crawlers go to their most frequented sites. You could just create new ones to put out there, but you already have at least one hundred and ninety nine out of every two hundred that are just sitting there. To put your investment to work you need to make those get noticed. Backlink indexing can get you the recognition for the hard work you have already done.

Do you want to know exactly how to solve your problem with a low backlink indexing rate?

Visit http://www.ryantmalone.com today to find out.

Article Source: http://EzineArticles.com/?expert=Ryan_T_Malone