Drupal Path module SEO problem

J. Cohen's picture
public
J. Cohen - Sat, 2008-06-28 22:08

If using the Path Module (which is standard for SEO), the URLs become case insensitive even on *nix servers. That means you can get duplicate content if people accidentally link to versions of your URLs with different cases.

Example:

  • drupal.org/handbook
  • drupal.org/hAnDbOoK

Just something to watch out for until it's fixed.

I added an issue here:
http://drupal.org/node/276201


perhaps globalredirect can help

greggles's picture
greggles - Sat, 2008-06-28 23:49

Maybe we should add something to globalredirect so that it will do a 301 to the "right" version of the url...

--
Open Prediction Markets | Drupal Dashboard


global redirect

J. Cohen's picture
J. Cohen - Sun, 2008-06-29 00:26

Good idea... that way when the Path Module is updated, any incorrect links wouldn't break.


Right...

NikLP's picture
NikLP - Mon, 2008-06-30 13:54

So, I think I understand this, but what, if anything, is the temporary workaround?

Web Development in Nottingham, UK by Kineta Systems


path module workaround

J. Cohen's picture
J. Cohen - Mon, 2008-06-30 21:10

To keep things simple for me and visitors, I generally only use lowercase letters in the URLs. The only problem would be if someone linked to different case URLs either in internal links or from external sites.

I think this would cause the most problem on larger sites.

I worked on one Drupal site where they had created views that had weird arguments something like example.com/page/Tag1+Tag2

There were a lot of users so imagine if users started creating links to
example.com/page/tag1+tag2

(or vice versa)

The paginated URLs keep the case of the first URL, so example.com/page/Tag1+Tag2 would have two URLs with different cases, and so would every paginated version:
example.com/page/Tag1+Tag2?page=25
example.com/page/tag1+tag2?page=25
etc.

on a large site with a lot of pagination that could be a significant SEO problem.


Well...

NikLP's picture
NikLP - Tue, 2008-07-01 09:03

I suppose this is only a problem if people are creating links manually, right? I would think that this does minimise the likelihood of these links being created, but I see where you're going with that.

Perhaps it's time to look at the linking system again? I have been told that Flexifilter can't handle inline-ing things like node references and image fields, which is a pain, but perhaps there is another way to achieve this? I think that would go a long way towards standardising any/all internal links?

Web Development in Nottingham, UK by Kineta Systems


I suppose this is only a

J. Cohen's picture
J. Cohen - Tue, 2008-07-01 22:02

I suppose this is only a problem if people are creating links manually, right? I would think that this does minimise the likelihood of these links being created, but I see where you're going with that.

I think using all lowercase reduces the chance of people typing in URLs in the wrong case -- no need to remember which case to use.

I would tend to type in example.com/about from memory instead of example.com/About. Maybe that's just me though...


Well...

NikLP's picture
NikLP - Tue, 2008-07-01 23:06

... yeah of course I agree, but I had assumed we were talking about a scenario where "content editors" were not the main cause of the problem. On social sites, and indeed on any third party site, this is a potential issue (assuming you're correct in what you're saying - it's new to me but I'm not arguing) so I was really just referring to "users" as opposed to "editors", if you follow me? :)

Web Development in Nottingham, UK by Kineta Systems


experiment

J. Cohen's picture
J. Cohen - Wed, 2008-07-02 03:32

I'm going to test something with Google and URLs. I'll post the results here in a few days if I find anything interesting.