Views block exposed filters get indexed?
Hello recently I am hectic about my drupal sites seo.
Recently I have created a block in my site with exposed filters and I noticed that googlebot is trying all the filters and sometimes giving me...
"Message Illegal choice in field_age_value_many_to_one element"
should i block the spider crawl those filters?
Thanks
Groups:
Login to post comments

The handbook page Search
The handbook page Search engine optimization (SEO) for Views contains information on how to prevent this, basically the answer is: yes.
Stefan Kudwien
unleashed mind
Thanks for your fast reply I
Thanks for your fast reply I have already read that page but as I stated in the thread
I have a question?
If i use
Disallow: /?
Allow: /?page=
Disallow: /?page=&*
http://www.mysite.com/?page=1 is ok
http://www.mysite.com/?page=2 is ok
but what about
http://www.mysite.com/?page=0
it is the same as http://www.mysite.com/ so it is kind of you are duplicating the frontpage.
Is there a way to eliminate this problem?
In the mean time this discussion seams to be private. I am new to this groups and how can i make this public so everyone can benefit
Block "page=0" like
Block "page=0" like this:
Disallow: /*page=0$There is a more simple way to write the rules in that SEO for Views handbook page. Here' is what the handbook page says:
# Disallow all URL variables except for pageDisallow: /*?
Allow: /*?page=
Disallow: /*?page=*&*
[EDIT: removed mistake...]
You could rewrite it as:
Disallow: /*&That would block all URLs with more than one parameter.
--
My Drupal Tutorials
First Page
Does
Disallow: /*&work on the first page, since the page variable isn't in the url?robots
Do you mean this URL?
/?page=0This will block that one:
Disallow: /*page=0I think it would be better to fix the module so it doesn't link to that kind of URL.
This rule only blocks URLs with more than one parameter:
Disallow: /*&--
My Drupal Tutorials
yes it works on the first
yes it works on the first page.
There is a great tool to test it.
In Google WebMaster Tools page there is "Tools" link click on it then choose "Analyze robots.txt" in there you can find your latest downloaded robots.txt you can find for your site.
If you want to try some new disallow combinations just enter there and in the box below enter the urls to test if it is blocked or not.
in the mean time do you guys
in the mean time do you guys think
Disallow: /frontpageis a good ideaDo you guys think this is a okey robots.txt I have attached a image because for some reason "*" character strips out upon submission
/frontpage
You don't need
/frontpageif you have the Global Redirect Module installed. Global Redirect will automatically redirect it in a search engine friendly way.--
My Drupal Tutorials
well for some reason the
well for some reason the robots also indexing the http://www.example.com/frontpage?field_
?field_ is from the views exposed filter search query so I have included disallow: /frontpage*
Either this: Disallow:
Either this:
Disallow: /*?field_or
Disallow: /frontpageor
Disallow: /frontpage?would work... depending on what other URLs might be getting spidered.
You could grep the logs to be sure -- something like to see what weird URLs Google might be hitting:
$ grep 'Googlebot\/' access.log | grep 'field_' > googlebot_field_.txt--
My Drupal Tutorials
J. Cohen I really appreciate
J. Cohen I really appreciate your feedback.