WebCite

Psychology: Debates · Journals · Psychologists

WebCite is an on-demand archiving service, designed to digitally preserve scientific and educationally important material on the web by making snapshots of Internet contents as they existed at the time when a scholar cited or quoted from it. The indefinite preservation allows for verifiability of claims supported by the cited sources even when the original web pages are being revised, removed, or disappear for other reasons, an effect known as link rot.

Comparison to other services[]

The service differs from the short time Google Cache copies by having indefinite archiving, and from Internet Archive by being an on-demand service.

WebCite is a non-profit consortium supported by publishers and editors, and it can be used by individuals without charge. Rather than relying on a web crawler which archives pages in a "random" fashion, authors who want to cite web pages in a scholarly article can initiate the archiving process. They then cite — instead of or in addition to the original URL — the snapshot address archived by WebCite, with an identifier that specifies the cited source.

It can be used to preserve cited Internet content, such as the archived web pages through WebCite, in addition to citing the original URL of the Internet content. All types of web content, including HTML web pages, PDF files, style sheets, JavaScript and digital images can be preserved. It also archives metadata about the collected resources such as access time, MIME type, and content length.

History[]

Conceived in 1997 by Gunther Eysenbach, WebCite was publicly described the following year when an article on Internet quality control declared that such a service could also measure the citation impact of web pages.^[1] In the next year, a pilot service was set up at the address webcite.net (see Template:Waybackdate). Although it seemed that the need for WebCite decreased when Google's short term copies of web pages begun to be offered by Google Cache and web content begun to be archived by Internet Archive, only WebCite allows for "on-demand" archiving by users. WebCite also offers interfaces to scholarly journals and publishers to automate the archiving of cited links. By 2008, over 200 journals had begun routinely using WebCite.^[2]

WebCite used to be, but is no longer, a member of the International Internet Preservation Consortium.^[3] In a 2012 message on Twitter, Eysenbach commented that "WebCite has no funding, and IIPC charges 4000 Euro/yr in membership fees."^[4]

WebCite "feeds its content" to other digital preservation projects, including the Internet Archive.^[3] Lawrence Lessig, an American academic who writes extensively on copyright and technology, used WebCite in his amicus brief in the United States Supreme Court case of MGM Studios, Inc. v. Grokster, Ltd.^[5]

Fundraising[]

WebCite started a FundRazr in February 2013 with a target of $50,000 (though it is now $25,000), a sum which its operators stated is needed to sustain the submission service beyond the end of 2013.^[6]

Process[]

WebCite allows on-demand prospective archiving. It is not crawler-based; pages are only archived if the citing author or publisher requests it. No cached copy will appear in a WebCite search unless the author or another person has specifically cached it beforehand.

To initiate the caching and archiving of a page, an author may use WebCite's "archive" menu option or create a WebCite bookmarklet that will allow web surfers to cache pages just by clicking a button in their bookmarks folder.

One can retrieve or cite archived pages through a transparent format such as

http://webcitation.org/query?url=URL&date=DATE

where URL is the URL that was archived, and DATE indicates the caching date. For example,

http://webcitation.org/query?url=http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMain_Page&date=2008-03-04

or the alternate short form http://webcitation.org/5W56XTY5h retrieves an archived copy of the URL http://en.wikipedia.org/wiki/Main_Page that is closest to the date of March 4, 2008.

It is important to note that WebCite does not work for pages which contain a no-cache tag. WebCite respects the author's request to not have their web page cached.

One can archive a page by simply navigating in their browser to a link formatted like this:

http://webcitation.org/archive?url=urltoarchive&email=youremail

replacing urltoarchive with the full URL of the page to be archived, and youremail with their e-mail address. This is how the WebCite bookmarklet works.^[7]

Business model[]

The term "WebCite" is a registered trademark.^[8] WebCite does not charge individual users, journal editors and publishers^[9] any fee to use their service. WebCite earns revenue from publishers who want to "have their publications analyzed and cited webreferences archived",^[10] and accepts donations. Early support was from the University of Toronto.^[11]

Copyright issues[]

WebCite maintains the legal position that its archiving activities^[2] are allowed by the copyright doctrines of fair use and implied license.^[3] To support the fair use argument, WebCite notes that its archived copies are transformative, socially valuable for academic research, and not harmful to the market value of any copyrighted work.^[3] WebCite argues that caching and archiving web pages is not considered a copyright infringement when the archiver offers the copyright owner an opportunity to "opt-out" of the archive system, thus creating an implied license.^[3] To that end, WebCite will not archive Web sites in violation of "do-not-cache" and "no-archive" metadata, as well as robot exclusion standards, the absence of which creates an "implied license" for web archive services to preserve the content.^[3]

In a similar case involving Google's web caching activities, on January 19, 2006, the United States District Court for the District of Nevada agreed with that argument in the case of Field v. Google (CV-S-04-0413-RCJ-LRL), holding that fair use and an "implied license" meant that Google's caching of Web pages did not constitute copyright violation.^[3] The "implied license" referred to general Internet standards.^[3]

Outages[]

June–July 2009 outages[]

In June 2009, attempts to create new citations failed. The project's creator wrote on June 19 that increased server load generated by Wikipedia's Template:Srlink prompted migration of the service to a new server.^[12] By the end of June 2009, attempts to access the project's website returned a message that it was "undergoing maintenance", and previously archived links became inaccessible. The archiving service resumed operation by the second week of July 2009, and previously archived links became accessible again.

September 2011 outage[]

On September 4, 2011 Gunther Eysenbach, owner of the "webcitation.org" domain, was notified that not only were new citation requests producing errors, but previous citations were failing as well. Subsequently the following message appeared on various WebCite pages:

Sep 6, 2011 – On Sep 3rd (just before the long labor day weekend), WebCite went down due to a hardware failure. While we are restoring the database from our backups, no new snapshots can be made, and old snapshots may be temporarily unavailable. We apologize for any inconvenience caused.

On September 11, 2011 at 7:38pm EST, WebCite appeared to be up-and-running again:

Sep 11, 2011 – We apologize for the recent outage following the week of Sep 3rd, 2011. WebCite went down due to a hardware failure, and restoring our huge database took a couple of days. Everything should be back to normal. We apologize for any inconvenience caused.

August - September 2013[]

On 8 August 2013, at approximately 1350h, the web site stopped responding altogether, or, starting about half an hour later, with a message indicating a database error. It is still not functioning.

References[]

↑ Eysenbach, Gunther, Diepgen, Thomas L. (28 November 1998). Towards quality management of medical information on the internet: evaluation, labelling, and filtering of information. British Medical Journal 317 (7171): 1496–1502. BL Shelfmark 2330.000000.
↑ ^2.0 ^2.1 Eysenbach, Gunther, Trudel, Mathieu (2005). Going, Going, Still There: Using the WebCite Service to Permanently Archive Cited Web Pages. Journal of Medical Internet Research 7 (5): e60.
↑ ^3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 ^3.6 ^3.7 WebCite Consortium FAQ. webcitation.org. WebCite.
↑ Twitter post. URL accessed on 2013-03-10.
↑ includeonly>Cohen, Norm. "Courts Turn to Wikipedia, but Selectively", January 29, 2007.
↑ The 2013 WebCite fundraising campaign at FundRazr
↑ WebCite Bookmarklet.
↑ WebCite Legal and Copyright Information. WebCite Consortium. URL accessed on 2009-06-16.
↑ WebCite Member List. WebCite Consortium. URL accessed on 2009-06-16.
↑ WebCite Frequently Asked Questions. WebCite Consortium. URL accessed on 2009-06-16.
↑ WebCite Frequently Asked Questions – Who owns and runs WebCite at the moment?. WebCite Consortium. URL accessed on 2009-06-16.
↑ Gunther Eysenbach – Twitter update. Twitter.com. URL accessed on 2013-06-30.

External links[]

Script error: No such module "URL".

This page uses Creative Commons Licensed content from Wikipedia (view authors).

[1] Eysenbach, Gunther, Diepgen, Thomas L. (28 November 1998). Towards quality management of medical information on the internet: evaluation, labelling, and filtering of information. British Medical Journal 317 (7171): 1496–1502. BL Shelfmark 2330.000000.

[JMIR-2] 2.0 ^2.1 Eysenbach, Gunther, Trudel, Mathieu (2005). Going, Going, Still There: Using the WebCite Service to Permanently Archive Cited Web Pages. Journal of Medical Internet Research 7 (5): e60.

[faq-3] 3.0 ^3.1 ^3.2 ^3.3 ^3.4 ^3.5 ^3.6 ^3.7 WebCite Consortium FAQ. webcitation.org. WebCite.

[4] Twitter post. URL accessed on 2013-03-10.

[5] udeonly>Cohen, Norm. "Courts Turn to Wikipedia, but Selectively", January 29, 2007.

[funding-6] The 2013 WebCite fundraising campaign at FundRazr

[7] WebCite Bookmarklet.

[wc_license-8] WebCite Legal and Copyright Information. WebCite Consortium. URL accessed on 2009-06-16.

[wc_MEMBERS-9] WebCite Member List. WebCite Consortium. URL accessed on 2009-06-16.

[wc_FAQ-10] WebCite Frequently Asked Questions. WebCite Consortium. URL accessed on 2009-06-16.

[11] WebCite Frequently Asked Questions – Who owns and runs WebCite at the moment?. WebCite Consortium. URL accessed on 2009-06-16.

[12] Gunther Eysenbach – Twitter update. Twitter.com. URL accessed on 2013-06-30.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]