14th Colony

Wed 19 November 2008

14thC: Membership has it's privileges - Register Free


Subscribe via email!

Get the latest 14th Colony content delivered to your inbox. Just enter your email address:

 Subscribe in a reader


Subscribe with Bloglines Add to Technorati Favorites



Search 14th Colony


Advertise on 14th Colony!


Spread your message!


Reviews Articles

Let’s go for a sphinn

Randall McCarley
by Randall McCarley
July 17th, 2007

This is one of those posts where I’m not entirely sure why I’m writing it. I mean, odds are if you read this blog you already know about Sphinn the new brain-child of Search Engine Land’s (and general SEO Guru Supreme) Danny Sullivan aka 3rd Door Media. But in case you missed the announcement on any of the other 200 blogs commenting on it, here is what you need to know.

Sphinn is another Pligg-powered social bookmarking site that is a bit different in that a great deal of care has gone into making it look good and act a bit cleaner than most Pligg sites. It is also geared for people that normally get knocked as spammers at the mainstream social sites like Digg and reddit: marketers, SEOs, SEMs, etc.

People like you and me.

The branding matches Search Engine Land and a link to SELand is built right into the navigation. Using the site is the same as any other SB site - generate an account, submit your favorite stories and vote for others.

I have to admit I was a bit reluctant to get into Sphinn. Do we really need another SB site? Do we really need one filled with spammers SEOs?

I think the answer is a big, loud “yes”. While I never actually spammed the other SB sites just by being an SEO I was looked upon with suspicion and scorn. Getting stories to *pop* is a pain even when they are clearly deserving. It’s nice to hang out with peers that really “get it”… whatever “it” is.

Plus it’s nice to see some old, familiar faces. I don’t get around the forum and blog circuit as much as I used to and it seems everyone is now on Sphinn no matter what area of the internet they usually hang out in.

Sphinn is bringing us all back together.



Warning: WordPress permalinks and 301 redirects are not the same thing!

Randall McCarley
by Randall McCarley
July 13th, 2007

Say you have a couple hundred (or thousand) pages indexed in Google and you decide to change how the link structure of the site will be handled. How do you do this?

The best way is to 301 (permanent) redirect the old URLs to the new ones but this can get tricky. There are tutorials (Apache | IIS). But if you aren’t familiar with your web server and are not a programmer (like me) you may be in over your head (again, like me).

The 301 redirect is important because it lets the search engines know that the same page has been “moved” to a new location. This helps you avoid duplicate content penalties and keep the link juice, or PR, that you already accumulated for those pages. It also prevents viewers from clicking links to your website that turn into “404 - page not found” errors.

Put simply, a 301 redirect helps you keep the rankings you already earned.

It also keeps your pages in the index and lets the search engines transition to the new structure without dropping your whole site and re-indexing it from scratch.

At least, that’s the theory.

I’ve been working on a redesign for 14th Colony and part of the redesign is to make the site more search-engine friendly. I’ve learned a lot since this site was launched and want to put that knowledge into practice here like I already do on my client sites. One of the changes I want to implement is the link structure because having keyword in your URL (instead of a generic post number) helps with rankings. And on the advice of ex-Googler Vanessa Fox, I’ve decided to do the link structure update before I push the redesign.

But this site is a WordPress blog and WordPress offers something called “permalinks” that will automatically change your link structure for you. Permalinks are a type of URL designed to refer to a specific information item and to remain unchanged permanently, or at least for a lengthy period of time to prevent link rot.

At least that’s the definition I got from Google which pulled it from Wikipedia… and don’t get me started on Wikipedia again!

The way permalinks work in WordPress is you go into your admin panel, select Options > Permalinks and set the structure how you want. Easy, right?

Almost too easy.

I was getting ready to change the permalink structure when this little voice in the back of my head asked if permalinks 301 the “old” pages to the new. I asked around and didn’t get a clear answer. I did some research and saw a few posts about how long it would take Google to reindex a WordPress site that changed the permalink structure.

I sensed danger.

Sure enough WordPress will change the link structure but that does not include the very important 301 redirects needed to keep your link juice and prevent duplicate content penalties!

After more research (and trying to decide if there was enough value in what I’d learn by just flipping the switch to see what happens) I found a couple WordPress plug-ins that 301 redirect your site to the structure you want. The plug-in I decided to implement is by fucoder.com tittled Permalink Redirect WordPress Plugin.

I picked this particular plug-in because it was recently updated, easy to implement and also supports redirecting to FeedBurner which is an added bonus. It also had the best “total coverage” of what the 301 would do.

How easy is it to use?

Upload the file, activate it and if you use FeedBurner plug in your URL for that and you are done. This plug-in automates whatever the WordPress permalink setting is into a 301 - how cool is that?

It is important to note there will be a period of fluctuation while Google and the other search engines update their index to replace the URLs. Sometimes this process runs smoothly sometimes not. There is always risk when it comes to making major changes to your website but by using 301 redirects you can minimize that risk.

To speed the process along I suggest using this Google Sitemap Generator and then rebuild and resubmit the map after you change your permalink structure.

For the record, changing an existing site structure should really be avoided and if you can set the link structure up correctly to begin with you’ll avoid these headaches all together. The format I suggest is: http://www.domain.com/catagory/posttittle

If you are concerned that post titles may duplicate add a date function at the beginning like: http://www.domain.com/date/catagory/posttittle

Finally, another advantage of fucoder’s plug-in is that is makes sure a trailing slash gets removed and redirected. This is a small thing but I often see duplicate pages in my logs - one with the slash the other without, and this plug-in will clear that up. So even if you have an existing WordPress site with permalinks this plug-in can help you out.



Google - Wikipedia Connection Problematic

Randall McCarley
by Randall McCarley
July 2nd, 2007

Wikipedia is an online collaborative effort where anyone can edit articles to create a resource where “every single person on the planet is given free access to the sum of all human knowledge”. Wikipedia has always had problems but as time goes on these issues are getting more important – even dangerous - because Wikipedia dominates Google’s search results.

About Wikipedia

Wikipedia was launched January of 2001 by Jimmy Wales and Larry Sanger as part of Bomis – an Open Directory clone, web ring and porn site - as an offshoot of the Nupedia Project. Nupedia was a peer-reviewed online encyclopedia made up by qualified contributors. The problem with Nupedia was the time to launch new articles. Programmer Ben Kovitz suggested using a wiki format to speed up the process. This new format would be less formal than the original Nupedia project.

In June of 2003, Wikipedia moved under the new Wikimedia Foundation, a non-profit company whose vision is “to bring a free and accurate encyclopedia to every single person on the planet”. The Wikimedia Foundation now hosts several other open wiki projects including a dictionary and thesaurus. Wikimedia has less than 10 employees with the majority of work completed by volunteers.

Wikipedia is now in 100 languages and boasts more than 5,300,000 articles with over 75,000 active contributors.

Wikipedia: All the knowledge you wish you knew if it were true

Wikipedia has attracted media attention for various controversies. While generally quick to address these concerns some issues remain, many at the core of what Wikipedia actually is.

Lack of Authority

Because Wikipedia is open, anyone can edit any entry. Articles are written and edited by non-credentialed “experts”. Articles are open for attack (vandalism) and often the articles fail to cite sources or they cite sources that are not the best available.

Wikipedia inaccuracies are cited on many authoritative websites like answers.com. Having Wikipedia content on these sites adds credibility to false information.

Wikipedia has problems with disproportionate reporting and quality. Current events are covered in detail while important historical and scientific information does not get the coverage deserved. Some topics contain only “stubs” or brief definitions of the topic with incomplete content. Because the contributors to Wikipedia are not necessarily experts, the information presented may be out of context misleading the reader.

Wikipedia content is slanted because it can only be created by people that have the time, care about the topic and understand how to operate within the Wikipedia technical and cultural rules. Opposing viewpoints by people too busy or apathetic are not included.

Wikipedia articles change over time. An article you link to may be the target of vandalism or in the process of a rewrite. If you cite a Wikipedia article, that page may change and the citation will not make sense!

No Accountability

In addition to the lack of authority, Wikipedia also faces the issue of lack of accountability. This issue is a ticking bomb with two parts:

1. Anonymous Editing

Anyone can edit an existing entry on Wikipedia. This has led to several scandals including vandalism and hoaxes that could be liable. According to Jason Scott of textfiles.com, the entry for George W. Bush spends 80% of its time a victim of vandalism. While membership is required to start a new article, anonymous users can edit existing entries without logging in. In place of a member ID the user’s IP address is logged. IP logging is an extremely limited security measure easily countered with a proxy server that redirects your IP with another one that is difficult to trace.

2. Legal Issues

The content on Wikipedia falls into a legal gray area in which the courts may eventually create legislation that affects a broad range of websites, especially those that rely on user-generated content.

The problem is that subjects of articles may resent the information posted about them. That information may be false for various reasons including vandalism and human error. Because the content of Wikipedia is aggregated on several authority websites, it is often assumed fact. Living down a bad biography may be difficult to do.

In the case of vandalism who is to blame for bad information? The person that posted bad content or Wikipedia? Consider that Wikipedia allows anonymous editing and that even members can hide behind screen names – sometimes multiple screen names - making them difficult to track down. In addition, Wikipedia appeals to an international audience so even if the person behind the attack is discovered they may be beyond the reach of domestic legal action. The law is unclear on whether Wikipedia itself is actionable in these cases or not.

Wikipedia has a policy of correcting libelous articles. Wikipedia has put in systems for detecting vandalism through both automated and human methods but some bad posts have taken months to catch and correct. An error on Hillary Clinton’s biography was not discovered for 20 months. During that time over 4,000 other edits were made to her biography!

Even Wikipedia’s own internal tests have proven the system for catching errors is lacking. Individuals are not supposed to edit content about them as it violates the Wikipedia “neutral point of view” rule that is supposed to ensure articles are balanced.

Wikipedia has a problem with copyright laws. Wikipedia Watch estimates as much as 2% of all Wikipedia information is plagiarized. Wikipedia Founder Jimmy Wales has admitted many of the images on Wikipedia have been incorrectly labeled “fair use”. Unfortunately, Wikipedia does not have any active programs for discovering and removing illegal content.

People who request removal of their biographies are often denied. The oddest case of this is perhaps that of Daniel Brandt a former member of the Wikipedia Foundation’s Board of Trustees who requested his biography be removed several times with no result. If this is how Wikipedia treats its own contributors, how do they treat outsiders without influence in the Wikipedia system?

In a CNN interview John Seigenthaler, a writer and journalist, raised the issue of increased government regulation online based on his own experience with vandalism on his Wikipedia biography.

“Can I just say where I’m worried about this leading. Next year we go into an election year. Every politician is going to find himself or herself subjected to the same sort of outrageous commentary that hit me, and hits others. I’m afraid we’re going to get regulated media as a result of that. And I, I tell you, I think if you can’t fix it, both fix the history as well as the biography pages, I think it’s going to be in real trouble, and we’re going to have to be fighting to keep the government from regulating you.”

Wikipedia mirror sites can hold deleted or altered articles in public view making removal of false information even more difficult.

Google: Popularity over quality

Google has a special relationship with Wikipedia that placed Wikipedia results in the “one box” integrated results above the organic search results. Google recently shifted the Wikipedia results from the one box to the first position in the organic results.

Why does Google lean on Wikipedia so much? This makes some sense when you compare Wikipedia’s vision to bring a free and accurate encyclopedia to every single person on the planet with Google’s mission to organize the world’s information and make it universally accessible and useful. Wikipedia contains a lot of free information and Google provides a lot of viewers – over half of internet users worldwide use Google for their search.

Where things get confusing is Google’s premise of relevance. Wikipedia admits that they are not to be taken as an authority despite the title of “encyclopedia”. They claim instead that Wikipedia articles should be used as an overview or starting point for research.

By its own rules, Wikipedia does not create original content instead relying on content cited from other sources. The issue of reliance gets more confusing as Google has been diligently working to remove competing search results from their results under the guise of relevance. Since Wikipedia is another intermediary that passes viewers to more relevant source documents Google’s stance on this issue is a bit fuzzy.

Perhaps Wikipedia is the new DMOZ for Google. DMOZ, also known as the Open Directory Project is a human-edited directory of websites. Google has shown a lot of faith in DMOZ, cloning the ODP for its own Google Directory and sometimes using the descriptions DMOZ editors assign to websites in the search results pages instead of the descriptions websites set for themselves.

DMOZ has fallen victim to alleged corruption along with internal problems and was shut down for several months. It has since reopened but the faith in DMOZ seems to be broken by the internet population and Google.

Whatever the reasons are for Google’s reliance on Wikipedia, it does point out a fatal flaw in the Google system: popularity over correctness. People familiar with how Google works understand how important backlinks are as a voting mechanism. The websites with the most links get to the top of the search results even if their content is not the best. That same philosophy of determining relevance by popularity is also seen by promoting Wikipedia even though the information is unreliable.

More special treatment by Google for Wikipedia is noted by Tom C on YOUmoz and Russ Jones notes that 96.6% of Wikipedia Pages Rank in Google’s Top 10.

Google’s special treatment of Wikipedia is likely coming to an end but there are reasons Wikipedia has traditionally done well in Google’s organic results.

Wikipedia’s site structure makes the site easy to crawl for Google’s bots. There is plenty of cross-linking between articles with keywords used in the anchor text. Wikipedia nofollows external links boosting their internal PR flow. The one box placement helped Wikipedia get media attention and more backlinks.

Finally, Wikipedia handles detractors by naming their websites as “uncredible” which disqualifies those sites from being listed in Wikipedia, limiting their exposure. In instances where ignoring those sites is not feasible Wikipedia often lists unanchored text URLs instead of directly linking to the sites in question.

The next evolution of Wiki: Search

Jimmy Wales has announced a search engine as part of the Wikia initiative. Wikia is a for-profit company that Wales leads and is technically not associated with Wikipedia though a lot of the talent overlaps. The Wikia search engine would be in direct competition with Google and assumedly be the default search engine for Wikipedia, or at least gain significant exposure through the Wikipedia community.

Wikia search will be open source with the algorithm freely available for all to see. Google gained a lot of its initial traction by appealing to the open source community.

Conclusion

The Wikipedia concept is ambitious and idealistic. Wikipedia contributors follow a noble pursuit and work hard to make sure the best possible information is presented. The culture and infighting within the Wikipedia community speaks volumes about the passion of the editors.

Unfortunately, that does not solve the problems Wikipedia faces or make the content worthy of its current standing in our global internet culture. Wikipedia is exactly as it claims to be if you look at their disclaimers: a great place to start your research or get a general overview of a topic but not the same caliber resource expected from an encyclopedia.

The best practice for dealing with Wikipedia is to keep it in perspective. The information there is just as good as the information you get from your neighbors. They may be brilliant nuclear physicists or just full of crap – you just don’t know without doing a bit of real research on your own.

Resources:



SEO Fast Start: Introduction to SEO with Great Advanced Tips

Randall McCarley
by Randall McCarley
May 31st, 2007

Dan Thies has updated his groundbreaking Fast Start Guide to SEO - A simple, step-by-step system for better results. Even better - he is giving it away for free.

Anyone considering search should read this book. In fact, anyone already in search should read this book.

Dan does a great job of organizing the magic behind the search engines in a way that makes sense and demystifies the process. And Dan has the experience to know what he is talking about.

Dan has been around in the Search (and marketing) industry for over 6 years and earned a solid reputation for accurate information and being a generally nice guy. He speaks at SES conferences, teaches, mods for High Rankings and is a member of the SEOPros executive committee. Dan also wrote the Search Engine Marketing Kit for SitePoint.

So anything this guy puts his name on is going to be not just good… but great.

I’m only on page 25 and already feel compelled to give SEO Fast Start a bit of link love and a glowing review. I’ve been around a while and already picked up a couple neat tips and observations.

So now, go get the book and thank me later!



Lessons from Vista Babble

Randall McCarley
by Randall McCarley
March 19th, 2007

Vista Babble on Microsoft's websiteFirst, a note of congratulations to my friend Donna. Donna has been a wealth of inspiration and knowledge in the forums for years. She helped me get started with SEO and for that I will always be grateful. One of her latest ventures is Vista Babble, a forum dedicated to the new Microsoft OS and it is doing great!

As of this writing, there are over 2,000 members and 1,000 threads. Vista Babble is a “Top 100 Windows Vista Site”. And Vista Babble even picked up a link from Microsoft’s own Vista support pages.

Rumor has it that some really big things are coming for Vista Babble and that it is going to become an even more important part of the Vista community.

Congratulations Donna on all your success with Vista Babble - you definitely deserve it!

Now some lessons for the rest of us!

A couple months ago Rand posted Predicting Search Queries Before Demand Arrives where he discusses the tactic of selecting keyword phrases based on anticipated traffic before everyone else starts fighting for them. Donna’s Vista Babble is an excellent example of this in action. By getting in on the ground floor and putting a bit of work into a site before the demand arrives you can establish yourself as THE authority before there is any competition. How great is that?

This isn’t a new tactic and some SEOs specialize in small-scale websites that they expect will bring traffic based on emerging trends like new tv shows, celebrities, etc.

What sets Vista Babble apart is the scale of implementation. Donna didn’t just try to drive some keyword traffic; she thought long-term and created a community around a new theme: Windows Vista. She also put a lot of attention on crafting the look of the site making sure it would stand apart from any sites with a similar idea.

But what if you have an existing website? How can you get ahead of your competition?

I’ve enjoyed some success with “predictive search” at 14th Colony with the article About Social Bookmarks. Social Bookmarking was all the buzz in the forums but nobody had put together what it was all about in one place as a resource. That article got such a strong response I did a follow-up.

Depending on your industry you may need to do some clever thinking to get ahead of the game but the best two tips I have are:

1. Pay Attention - Listen to the buzz in your industry and pay attention to the news.

2. Give it a Shot - The problem with predictive search is that you can’t always be right. Some terms just won’t jump. But you’ll never get the ones that do unless you try!



« Previous Entries   Next Entries »