Misplaced Pages

HTTP referer

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

This is an old revision of this page, as edited by SusanLesch (talk | contribs) at 12:09, 9 November 2007 (-wl). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Revision as of 12:09, 9 November 2007 by SusanLesch (talk | contribs) (-wl)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)
For the rare occasions where de-referring links is needed in Misplaced Pages, see Template:Derefer.

The referer, or HTTP referer, identifies, from the point of view of an internet webpage or resource, the address of the webpage (commonly the URL, the more generic URI or the i18n updated IRI) of the resource which links to it. By checking the referer, the new page can see where the request came from. Referer logging is used to allow websites and web servers to identify where people are visiting them from, for promotional or security purposes. Since the referer can easily be spoofed (faked), however, it is of limited use in this regard except on a casual basis.

A dereferer is a means to strip the details of the referring website from a link request so that the target website cannot identify the page which was clicked on to originate a request.

Referer is a common misspelling of the word referrer. It is so common, in fact, that it made it into the official specification of HTTP – the communication protocol of the World Wide Web – and has therefore become the standard industry spelling when discussing HTTP referers.

Details

When visiting a webpage, the referer or referring page is the URL of the previous webpage from which a link was followed.

More generally, a referer is the URL of a previous item which led to this request. The referer for an image, for example, is generally the HTML page on which it is to be displayed. The referer is part of the HTTP request sent by the browser program to the web server.

Many web sites log referers as part of their attempt to track their users. Most web log analysis software can process this information. As referer information can violate privacy, some browsers allow the user to disable the sending of referer information. Some proxy and firewall software will also filter out referer information, to avoid leaking the location of non-public websites. This can in turn cause problems: some servers block parts of their site to browsers that don't send the right referer information, in an attempt to prevent deep linking or unauthorised use of images (bandwidth theft). Some proxy software has the ability to give the top-level address of the target site as the referer, which usually prevents these problems while still not divulging the user's last visited site.

Recently many blogs have started publishing referer information in order to link back to people who are linking to them, and hence broaden the conversation. This has led, in turn, to the rise of referer spam: the sending of fake referer information in order to popularize the spammer's site.

Many pornographic paysites utilize referer information to secure their materials: only browsers arriving from a small set of approved (login-) pages are given access; this facilitates the sharing of materials among a group of cooperating paysites. Referer spoofing is often used to gain free access to these sites.

Referer hiding

Most web servers will maintain logs of all traffic, and record the HTTP referer sent by the browser for each request. This raises a number of privacy concerns, and as a result a number of systems to prevent servers being sent the real referring URL have been developed. These systems work either by blanking the referer header or by replacing it with inaccurate data. Generally, internet security suites blank the referer data, while web based servers replace it with a false URL, usually their own - of course, this raises the problem of Referer spam. The technical details of both methods are fairly consistent - software applications act as a proxy server and manipulate the HTTP request, while web based methods load websites within frames, causing the browser to send a referrer URL of their website address. Some web browsers give their users the option to turn off referer headers.

See also

  • Referer spoofing, changing referer information to gain unauthorized access to a web site.
  • Referer spam, providing fake referer information in order to popularize a spammer's website.

References and external links

  • RFC 2616: Hypertext Transfer Protocol – HTTP/1.1
  • IRI – Internationalized Resource Identifiers
Categories: