"OCR" stands for optical character recognition. The scanned image pages of the documents on Profiles in Science are subjected to OCR to obtain the text within them. An example of a scanned document and its corresponding OCR text can be found at FSBBDL. OCR is often imperfect, particularly for historical documents such as those found on Profiles in Science. OCR is ineffective for handwritten documents. Profiles in Science is currently making the OCR text available as an experiment. In cases where the OCR text is accurate, it may be helpful to someone with low vision who is using a screen reader. No OCR text is listed for scanned documents that have already been transcribed since the transcript is more accurate than OCR. If you have suggestions for making the digitized materials on Profiles in Science more accessible for users with vision problems, please email them to firstname.lastname@example.org.
Most of the digital materials in Profiles in Science are protected by the U.S. Copyright Law (Title 17, U.S.C.). Under Copyright Law, some use of copyright protected documents may be allowed under "fair use." Transmission or reproduction of protected items beyond that allowed by fair use requires the written permission of the copyright owner. You are under obligation to determine and satisfy copyright restrictions when publishing or otherwise distributing digital materials found on Profiles in Science. Whenever possible, we have provided information about the copyright owner of each document; this information appears in the "Rights" section of each metadata page. The National Library of Medicine would like to hear from any copyright owners who are not properly identified (email@example.com). Our Historical Collections Copyright Information page contains information specific to copyright and historical collections. More information about fair use and copyright in general is available from the U.S. Copyright Office.
You will need either Quicktime or the RealMedia player to see the movies and play audio. You will need the Adobe Acrobat Reader to view PDF files. Please see the list of necessary viewer applications/plug-ins at https://profiles.nlm.nih.gov/Help/Helpers/.
Many factors are considered when choosing collections for Profiles in Science. We are looking for scientists who have had an outstanding career and whose research can stand as a model for scientific investigation. The availability of a scientist's papers, and the depth and complexity of the collection are important factors. Another factor we consider is how well the papers reflect the scientist's activities in promoting public understanding of and debate on science-related topics.
Yes. We are continually adding new collections as well as adding more digital objects to the collections that we already have up. Please check the What's New section for updates.
No. Our staff evaluates the documents according to their historical significance. Historians and archivists select documents from the collections that provide insight into both the development of modern biomedical science as well as the character and life of the individual scientists. The physical collections contain additional materials that are not selected for digitization.
NLM collections are available for use by contacting the History of Medicine Division at the NLM. (email: firstname.lastname@example.org, phone: 301-496-5405). For collections not owned by the NLM, researchers may contact the institution that holds the papers about their availability. Contact information is provided with each collection.
Metadata is broadly defined as "data about data." For example, a library's card catalog is metadata -- data about the books, prints, photographs, and films that make up the library's collections. On Profiles in Science, you will find a metadata record about each document -- letter, article, photograph, interview, etc. -- in the digital collection. Each metadata record that we create contains information such as the document's title, creator, date of creation, copyright statement, location within the collection, and page length. The metadata is particularly useful because it provides information that may not be apparent on the digitized document itself. You can search the information contained in the metadata by checking the box in front of the word "Metadata" on our search engine page.
Black and white (b&w) pages are scanned and stored in a lossless, compressed b&w TIFF format. These TIFF files are used to create Adobe image+text PDF files. The TIFF files are stored offline for future use, and the PDF files are made available over the Web.
Greyscale and color items are scanned and stored in a lossless, uncompressed TIFF format. Greyscale items are 8-bit (256 shades of grey). Color items are 24-bit (about 16 million colors). These TIFF files are used to create JPEG files. The TIFF files are stored offline for future use, and the JPEG files are made available over the Web.
Black and white pages are scanned and made available at 300 dpi. Greyscale and color items are scanned as 600 dpi TIFFs, and made available as 600 dpi and 75 dpi JPEGs. Small items are scanned at a higher resolution. For example, 35mm slides are scanned as 4,800 dpi TIFFs and made available as 4,800 and 600 dpi JPEGs.
The Dublin Core subset of the metadata associated with the Profiles in Science digitized items is available through the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Service. More information and examples are available on the OAI-PMH Service Help page.
For experimentation, the Dublin Core subset of the metadata associated with the Profiles in Science digitized items is available as linked data through RDF/XML, JSON, JSON-LD, N3/Turtle, and N-Triples files. These files are listed in the Linked Data section of every metadata record, for example here. Please note that these linked data files are subject to change, and currently there is no SPARQL endpoint.