Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 24 Dec 2007 23:49:53 -0800 (PST)
From:      RSean <rseans@gmail.com>
To:        freebsd-questions@freebsd.org
Subject:   Re: performance impact of large /etc/hosts files
Message-ID:  <14493715.post@talk.nabble.com>
In-Reply-To: <20071212211015.439a673a@gumby.homeunix.com.>
References:  <475E0190.7030909@pacific.net.sg> <200712111718.05876.nvass@teledomenet.gr> <475EAC9D.1020902@pacific.net.sg> <20071211084309.A16234@wonkity.com> <475EB887.6070902@pacific.net.sg> <475EC215.8060004@dial.pipex.com> <475F4209.8080507@pacific.net.sg> <200712120920.46626.nvass@teledomenet.gr> <475F9648.804@pacific.net.sg> <20071212085939.F21510@wonkity.com> <47600D2B.70306@dial.pipex.com> <20071212120214.C22244@wonkity.com> <20071212211015.439a673a@gumby.homeunix.com.>

next in thread | previous in thread | raw e-mail | index | archive | help

Hi guys,

Just curious if anyone has tried regular expressions to handle ads and
banners. 

We have a small network of about 10 users. We use SafeSquid as proxy and
content filter. It supports the use of regex for defining rules.

The URL Filter section has 2 default rules for blocking ads and banners -

Hosts: 
(^ad(|s|v|server)\.|adtag\.|targetsearches.com|webconnect.net|imgis.com|atwola.com|fastclick.net|abz.com|tribalfusion.com|advertising.com|atdmt.com|sp
inbox\.(com|net)|linkexchange.com|hitbox.com|doubleclick.net|valueclick.com|click2net.com|mediaplex.com|247media.com|clickagents.com|adbutler.com|qkim
g.net|realmedia.com|us.a1.yimg.com|clickheretofind.com|images.cybereps.com|adbureau.net|sfads.osdn.com|adflow.com|adprofs.com|zedo.com|digitalmedianet
.com|ad-flow.com|/adsync/|adtech.de|netdirect.nl|rcm-images.amazon.com|pamedia.com|msads.net|valuead.com|smartadserver.com|thisbanner.com|aaddzz.com|s
cripps.com|ru4.com|adtrix.net|falkag.net)

File:
(/adimages/|/banner(|s)/|/ad(|s|v|(|_)banner(|s))/|/adx/|/sponsors/|/advert(ising|s|)/|/adcycle/|/track/|/promo/|/adspace/|/admentor/|/image\.ng/|/ajr
otator/|/adview.php|/clickthru|/affiliates|banmat(\.cgi|.\.cgi)|/adproof/|/bannerfarm/|/BannerAds/|/banner_|sponsorid|/servfu.pl|/RealMedia/|/adsync/|
_ad_|/adceptdelivery.cgi)

I am not a very technical person, but the first rule, I think, is a regex
that defines hosts that serve ads; while the second rule is a regex for
words that the file part of a url may contain.

These rules very efficiently block ads and banners at the gateway, saving
b/w and improving surfing experience.

Just thought I should mention this.

Cheers!
-- 
View this message in context: http://www.nabble.com/performance-impact-of-large--etc-hosts-files-tp14267018p14493715.html
Sent from the freebsd-questions mailing list archive at Nabble.com.




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?14493715.post>