The Smithsonian Seeks Volunteers for Massive Digitization Project
Today the Smithsonian launches its Transcription Center website to the public. The website is designed to leverage the power of crowds to help the Smithsonian unlock the content inside thousands of digitized images of documents, such as handwritten civil war journals, personal letters from famous artists, 100-year-old botany specimen labels and examples of early American currency.
The Smithsonian has already produced digital images for millions of objects, specimens and documents in its collection. Many of the digitized documents are handwritten or have text that computers cannot easily decipher. Transcription by humans is the only way to make the text of these items searchable, which will open them up for endless opportunities for research and discovery.
"We are thrilled to invite the public to be our partners in the creation of knowledge to help open our resources for professional and casual researchers to make new discoveries," said Smithsonian Secretary Wayne Clough. "For years, the vast resources of the Smithsonian were powered by the pen; they can now be powered by the pixel."
The Smithsonian's collection is so vast that transcribing its content using its own staff could take decades. By harnessing the power of online volunteers that goal can become a reality. During the past year of beta testing with nearly 1,000 volunteers, the Transcription Center completed more than 13,000 pages of transcription. In one instance-transcribing the personal correspondence of members of the Monuments Men held in the Smithsonian's Archives of American Art collection-49 volunteers finished the 200-page project in just one week. By some estimates, the volunteers are completing in a couple of days what it would take the Smithsonian months to complete without their help. Once a document is done, the work is reviewed by another volunteer before it is certified for accuracy by a Smithsonian expert.
Projects selected for transcription during the beta-test phase were chosen due to high demand from scientists, researchers and enthusiasts for certain items that presented accessibility challenges. For example, the Smithsonian's National Museum of Natural History has one of the world's largest bumble bee collections-nearly 45,000 specimens. Information about each bee, such as where it was collected and when it was collected, is extremely valuable to scientists studying the rapid decline of bee populations during the past few decades. The only way to obtain this information before digitization and transcription would be for a scientist to come to the museum and read each tiny, handwritten label (often as small as 3 millimeters by 7 millimeters) and record the information. Now, with the information digitized and transcribed, scientists anywhere in the world can understand more about the population history of the bumble bee and its recent population decline. The bumble bee transcription project is currently one of thehighlighted projects on the site.