翻訳と辞書
Words near each other
・ Herm, Landes
・ Herma
・ Herma (Xenakis)
・ Heritage Woods, Alberta
・ Heritage365
・ HeritageWest Credit Union
・ Heritier Lumumba
・ Heritiera
・ Heritiera fomes
・ Heritiera littoralis
・ Heritiera longipetiolata
・ Heritiera parvifolia
・ Heritiera percoriacea
・ Heritiera utilis
・ Heritor
Heritrix
・ Herius Asinius
・ Herivelto Martins
・ Heriz rug
・ Herizo Razafimahaleo
・ Herizons
・ Herići
・ Herja
・ Herja River
・ Herjalf
・ Herjangsfjord
・ Herjava
・ Herjolfsfjord
・ Herjolfsnes (Norse Greenland)
・ Herjulf Bårdsson


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Heritrix : ウィキペディア英語版
Heritrix

Heritrix is a web crawler designed for web archiving. It was written by the Internet Archive. It is free software license and written in Java. The main interface is accessible using a web browser, and there is a command-line tool that can optionally be used to initiate crawls.
Heritrix was developed jointly by the Internet Archive and the Nordic national libraries on specifications written in early 2003. The first official release was in January 2004, and it has been continually improved by employees of the Internet Archive and other interested parties.
Heritrix was not the main crawler used to crawl content for the Internet Archive's web collection for many years. The largest contributor to the collection is Alexa Internet.〔 Alexa crawls the web for its own purposes,〔 using a crawler named ''ia_archiver''. Alexa then donates the material to the Internet Archive.〔 The Internet Archive itself did some of its own crawling using Heritrix, but only on a smaller scale.〔
Starting in 2008, the Internet Archive began performance improvements to do its own wide scale crawling, and now does collect most of its content.〔http://blog.archive.org/2013/01/09/updated-wayback〕
== Projects using Heritrix ==

A number of organizations and national libraries are using Heritrix, among them:
* (Austrian National Library, Web Archiving )
* (Bibliotheca Alexandrina's Internet Archive )
* Bibliothèque nationale de France
* British Library
* (California Digital Library's Web Archiving Service )
* CiteSeerX
* (Documenting Internet2 )
* Internet Memory Foundation
* Library and Archives Canada
* Library of Congress ()
* National and University Library of Iceland
* National Library of Finland
* National Library of New Zealand
* National Library of the Netherlands (Koninklijke Bibliotheek)〔http://www.kb.nl/organisatie/onderzoek-expertise/e-depot-duurzame-opslag/webarchivering/technische-aspecten-bij-webarchivering〕
* (Netarkivet.dk )
* (Smithsonian Institution Archives )
* (National Library of Israel )

抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Heritrix」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.