In what is a painstaking process, Wikipedians are digitizing Indian language, out-of-copyright texts online, trying to address the comparative paucity of Indic language texts online. Wikisource is a repository of documents and archived material that serves as a reference source for Wikipedia, and a means of improving access to information sources. Of the 64 languages Wikisource is available in, 8 are Indian: Tamil (stats), Malayalam (stats), Telugu (stats), Kannada (stats), Sanskrit (stats), Marathi (stats), Bengali (stats) and Gujarati (stats). What's particularly notable about this digitization is that the texts are being typed out by volunteers on their own time, one word at a time. How It Began Users were adding bhajans of Mirabai to Wikipedia, but according to Wikipedia's policies, recipes, poems and song lyrics belong to Wikibooks or Wikisource, Noopur Raval, Communications Consultant (India Program) at the Wikimedia Foundation told MediaNama. One user raised this issue, and following discussions, it was decided to create a Wikisource for Gujarati. The first text to be digitized, though, was Rachnatmak Karyakram, a book by Mahatma Gandhi. The project, involving the digitization of 60 pages, took six volunteers a week. This was followed by another project, the digitization of Gandhi's autobiography, with a group of 13 people typing out the book over a month. Identification & Prioritization Of Texts For Digitization Selection of text for digitization is entirely community driven: they decide what is important. Editors put up a notice for the project, and user participation is sought. For example, the Gujarati Wikisource editors chose…
- US Judge Allows $4.7 Billion Class Action Lawsuit Against Google By 21 Million Individuals November 29, 2022
- AIIMS denies ₹200 crores ransom demand, servers remain down for 7th day November 29, 2022
- Here’s How Indian Government Justifies Search and Seizures of Devices November 29, 2022
- DPDP Bill, 2022: A Small Amendment May Impact Indians’ Rights to Information and Accountability November 28, 2022
- Why has the Indian central bank stopped Paytm from onboarding new online merchants? November 28, 2022
MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.
The Structure and Style of a Dogma Community: Conspiracy theories and organized Twitter engagement on Sushant Singh Rajput
Studying the 'community' supporting the late Sushant Singh Rajput (SSR) shows how Twitter was gamed through organized engagement
Do we have an enabling system for the National Data Governance Framework Policy (NDGFP) aiming to create a repository of non-personal data?
A viewpoint on why the regulation of cryptocurrencies and crypto exchnages under 2019's E-Commerce Rules puts it in a 'grey area'
India's IT Rules mandate a GAC to address user 'grievances' , but is re-instatement of content removed by a platform a power it should...
Why ‘group privacy’ should be recognised, and how ‘non-personal’ data becomes a regulatory blindspot
There is a need for reconceptualizing personal, non-personal data and the concept of privacy itself for regulators to effectively protect data
Please subscribe to MediaNama. Don't share prints and PDFs.
You May Also Like
Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...
135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...
Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request
By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...