Dariusz on Software Quality

20/05/2010

Watch your HTTPD logs

Filed under: en — Tags: , , — dariusz.cieslak @

Recently I observed that AdWords-generated traffic dissapeared from Analytics panel. I thought: WTH?

I checked the logs and saw that URL called by AdWords:

http://my-site.com/?gclid=342343445345....

Generated 403 (Forbidden) server response. That was caused by recent change in Lighttpd filtering rules. I was paying for AdWords traffic but customer hit 403 error page. Ops!

In order to easily spot such problems in future I created the following scanner to easily find all error server responses.

awk '$9>=400' /var/log/lighttpd/access.log | less

If you are boring of 404 errors you can filter them out as well (leaving only 403 / 500 errors for investigation):

awk '$9>=400 && $9 != 404' /var/log/lighttpd/access.log | less

I discovered that the following URLs were inaccessible:

  • /robots.txt (exclusion rules for web crawlers)
  • /favicon.ico (icon used by web browsers)

Next step could be automation of this check (cron job that will send an alert if errant responses count is higher than N). It’s left as exercise for the reader.

13/02/2010

Why you should disable “content network” in AdWords

Filed under: en — Tags: — dariusz.cieslak @

Google AdWords is popular marketing tool that allows you to attract customer attention. You “pay per click” – that means if potential customer is directed to your page (or e-commerce site) then you will pay.

There are two main variations of this tool:

  • matching advert is located based on keywords entered by user in Google search
  • matching advert is located based on content of a partner page (AdSense partner)

It seems both methods (keyword and content-based) are equal efficient, but wait: there’s a big difference. When user is searching for some keywords it’s very likely he will be interested in your product/service. When user is coming from AdSense page he may be directed to this page for different reason.

(more…)

Powered by WordPress