Maharashtra Voter List to go Unicode!
The voter list in Maharashtra(India) is soon going to be Unicode. The voter list data already exists in computerized form and is available in local languages.But there is no provision in the system for a public interface in Indian Languages.Linux and free software, localized Indian Languages, and the Unicode standard alone can provide an affordable universal interface.
The voter list will be only a starting point in the move to generalize Indian language-enabled e-governance initiatives.
Professor Jitendra Shah’s IndicTrans team has come up with this idea.
Open source will be used in US in November elections and if it gets implemented then we’ll be the first in the world 
In Detail:-
Understanding the Issues in Detail
The first issue concerns the voter list as such. In its current form, the lists are composed in Indian languages–for example, in Marathi for Maharashtra–and stored in databases. The storage traditionally has been using ISCII, a national font-independent standard. But when the data has to be displayed it inevitably has to be done in vendor-specific font-encoding, such as ISFOC. In addition, in order to not expose this data to the public, the electronic data files are not published by the CEC. On a side-note, whole lists easily can be downloaded from state sites for Delhi, Kerala or Andhra Pradesh and so on. Hence in Maharashtra, the Electoral Office converts the rolls to PDF files that then are displayed on its Web site. This requires visual scanning of page after page to look up one’s name.
This method doesn’t make extracting information an easy process for non-skilled computer users. Although it is possible to use certain tools to extract the information from the PDF files, you can do this for only certain files, not all of them.
Now, suppose your name was misspelled. To get the information updated, you would have to write a letter to the CEC by hand. And guess what? The content in the files (in its current state) would require some proprietary software to make any amendments.
The Ideal World Case
What Prof. Shah is proposing is “citizen-friendly access”. According to him, the same data should be available in a public, font-independent standard that is multilingual and accepted in all major operating systems–Unicode.
Shah also believes the CEC should relax its policy of restricting access to this data. If CEC decides to share the data–with appropriate security checks, of course–it would be imperative to shift to standards that can be accessed without proprietary software.
Prof. Shah asserts that public information, such as voter lists, must be available in formats that follow open public standards and must be available for amending and interaction without any expenditure on or binding to a closed software system of a specific vendor. It is, of course, subject to the law of the land as to how much access is given to the lay citizen.
Access to free software ensures that an official at any administrative level could make the amendments and submit the same to a higher authority for approval.
Proof of Concept
Now that we understand the difficulties in the existing system and how we want the situation to be handled, let’s look at the technology that bridges the gap.
As already stated, the government has to adopt an accepted standard for maintaining its records. Unicode is the best option available, and according to Prof. Shah, both the state and the central government has accepted it as its future direction. All e-Governance projects most likely will be funded for conversion from older standards or non-standards to Unicode. For people not yet convinced about the usability of Unicode, Prof. Shah has tabulated various configurations to demonstrate its compatibility. The table is available here.
As a proof of concept, Prof. Shah and his team have converted the content for the voter list from the non-standard font-encoded format to the standard Unicode format. You can view screenshots, see sample converted files and search on sample data here.
As a demonstration of the power and use of Unicode, the team also has converted the Marathi files into Gujarati. This proves that interlingual translatability is not exclusive to ISCII and even Unicode can achieve the same.
Prof. Shah’s team now is adept at converting various formats of data to and from Unicode. They parse the files from their software-specific structure, say .rtf, .dbf or .html, to a generic structure. This often is done implicitly or explicitly. Then, the information is converted either as files or on the fly. The same can be restored to its original structure whenever needed.
The Response
The Election Office has yet to decide on the adoption of the solution. The software solution has been proposed first as working for standalone machines for telephone help-lines. It may be extended for use on the Internet.
Prof. Shah feels that the bureaucracy hesitates with the technology because many commercial software vendors have been promising a lot, but so far they have not delivered. Often, support is a problem. The need for support is felt even more acutely in the open-source domain. Plus, there is the added responsibility that comes with the freedom associated with open-source software, which the bureaucracy is not equipped to handle.
Prof. Shah, acting as a teacher, finds the bureaucrats to be quite amenable to open source. This is partly because no price tag is attached to his opinions and nor a hidden agenda other than making the democratic process more democratic.
Source link
If you liked this post, please buy me a beer & encourage me to write more














Tags:
Related Posts:



Cool. Bharat improving.
Yeah Mera Bharat Mahan
Please note that the correct URL of IndicTrans website is http://www.indictrans.in/
Please use this URL instead of the one given in article. We are trying to acquire the old indictrans.org and indictrans.com domains back so that the links referenced earlier still work.
regards,
Swapnil
IndicTrans Team
ple’s send me the voter list in pune .
please send me the voter list of pune