Searching in Languages with Non-Roman Scripts

Columbia Libraries have extensive collections in many languages, but searching for languages in non-Roman scripts can be tricky. Some catalog records can be found using the language's native script, but many library resources have been cataloged using romanization tables for transliteration. Older catalog records may also use variant romanization schemes. When searching for non-roman script items in the library catalog, it can be useful to try several spelling variants.

Search tools and tips for specific languages and scripts can be found below. When available, romanization tables from the American Library Association and Library of Congress (ALA-LC) are included, which can be used as guides for searching library catalogs in the United States.

See ALA-LC romanization tables for all available languages

African Scripts

Columbia selectively collects a small number of publications in African languages written in indigenous scripts, including the following. Where possible, the ALA-LC romanization table is linked.

Amharic Romanization Table
Guinean Pular (no romanization table available)
Mande languages in N'ko Romanization Table
Moroccan Tamazight Romanization Table
Senegalese Wofolal (no romanization table available)
Tamashek of Algeria, Mali, and Niger Romanization Table
Tigrinya Romanization Table
Other West African languages using the African ajami script (no romanization table available)

Contact a librarian for more help:

Yuusuf Caruso

Meet with me

African Studies Librarian

Global Studies

caruso@columbia.edu

(212) 854-8045
Global Studies - 309 International Affairs

Arabic

CLIO can be searched using Arabic script. However, only books cataloged after 2007 will have a bibliographic description in Arabic. Older records can only be found using Latin alphabet. CLIO and most other library catalogs use the following ALA-LC transliteration rules.

Arabic ALA-LC Romanization Tables:

Please note that these rules vary slightly from those used by IJMES or Encyclopaedia of Islam.

We also recommend searching in WorldCat, where a large percentage of books has description in Arabic script and records that link back to CLIO catalog.

Contact a librarian for more help:

Peter Magierski

Middle East & Islamic Studies Librarian

Global Studies

pm2650@columbia.edu

(212) 854-3995
Global Studies - 309 International Affairs

Chinese

Contact a librarian for more help:

Chengzhi Wang

Meet with me

Chinese Studies Librarian

Starr East Asian Library

cw2165@columbia.edu

(212) 854-3721
C.V. Starr East Asian Library - 307M Kent Hall

Cyrillic

Cyrillic ALA-LC Romanization Tables:

Hebrew

Nearly 60% of the records for Hebrew works in CLIO include Hebrew characters, so it is worthwhile to try searching in Hebrew if you are looking for a specific text.

Library transliteration can differ slightly from typical academic transliteration of Hebrew. Hebrew transliteration has also changed significantly over the 20th century, and so a book that was cataloged in the 1940s might look different than a book from 2003. If you cannot find your book, it is also helpful to seach by author.

Another useful tool is to search Worldcat, the "catalog of catalogs," which will indicate if Columbia holds a title, and will give a direct link to that title in CLIO, if it is available.

Contact a librarian for more help:

Michelle Margolis

Meet with me

Norman E. Alexander Librarian for Jewish Studies

Global Studies

mc3395@columbia.edu

(212) 854-8046
Global Studies - 309 International Affairs

Japanese

Contact a librarian for more help:

Chiaki Sakai

Meet with me

Japanese Studies Librarian

Starr East Asian Library

cs3948@columbia.edu

(212) 854-1506
C.V. Starr East Asian Library - 308M Kent Hall

Korean

Contact a librarian for more help:

Hee-Sook Shin

Meet with me

Korean Studies Librarian

Starr East Asian Library

hs2148@columbia.edu

(212) 854-1507
C.V. Starr East Asian Library - 310M Kent Hall

South Asian Scripts

OCLC currently claims to support South Asian scripts such as Assamese, Bengali, Devanagari, Hindi, Marathi, Nepali, Persian, Sanskrit, Sherpa, Tamil, Urdu. In practice, only a small percentage of materials cataloged in Worldcat are directly searchable by non-Roman scripts. Hence it is necessary to know how to search using non-Roman script standard transliteration conventions.

ALA-LC transliteration guidelines for specific languages and scripts may vary from standards common to introductory South Asia language learning texts. A few examples:

In Hindi transliteration, the vowel a is implicit after all consonants and consonant clusters and is supplied in transliteration unless another vowel is specifically indicated or the absence of a vowel is clearly indicated by the virāma. Thus, for example, the Hindi word for ‘book’ is transliterated as kitāba and pustaka (rather than as kitāb and pustak).
Transliteration of Sanskrit, Pali and Prakrit conjunct compounds may vary; sometimes rendered as a single word/unit; sometimes with dashes separating individual elements; and sometimes with spaces between individual ‘words.’