Digitization of Books for Indic Language WikiSource
Event details
When
from 02:30 PM to 05:30 PM
Where
Contact Name
This workshop will be conducted by Malayalam Wikimedian Viswanathan Prabhakaran. Anyone interested in learning about the process of digitising old manuscripts, books and creating text based documents could join this workshop. The workshop will cover the following topics:
- Best practices in capturing images using a camera and tripod through demonstration;
- An introduction to the types of scanners;
- How to hold books and the need to treat old books with proper care;
- Discussion on image formats and some basic comparison (i.e. djvu, PDF, JPEG, TIFF, BMP, GIF);
- Introduction and practical use of SM Tether (using Nikon dSLR) in capturing images;
- Practical demonstration of using Scan Tailor (a free software) in post-processing of scanned pages. Splitting, deskewing, rearranging borders, and de-speckling of scanned pages;
- Some basic discussion on copyright and introduction to Wiki Source;
- Importance of online archival resources (DLI, DSAL, Archive.org, etc) and when to do or not to redo scanning of books (i.e., image resolution) that are already available in scanned format;
- OCR and Indian languages.