Black Lives Matter. Support the Equal Justice Initiative and read our statement here.
Unicopedia Sinica

Unicopedia Sinica

Set of Unicode utilities related to ideographs, wrapped into one single app.

Unicopedia Sinica icon UNICOPEDIA SINICA

Unicopedia Sinica is a developer-oriented set of Unicode utilities related to ideographs, wrapped into one single app, built with Electron.

This desktop application works on macOS, Linux and Windows operating systems.

Unicopedia Sinica social preview

Utilities

The following utilities are currently available:

  • CJK Components
    • Look Up IDS
    • Parse IDS
    • Match IDS
    • Find by Components
  • CJK Sources
  • JavaScript Runner

CJK Components

Look Up IDS

CJK Components - Look Up IDS screenshot

CJK Components - Look Up IDS - Show Graphs screenshot

Parse IDS

CJK Components - Parse IDS screenshot

Match IDS

  • The Match IDS feature of the CJK Components utility displays a list of IDS-matching Unihan characters, including through regular expressions. It makes use of the IDS (Ideographic Description Sequences) defined in the IDS.TXT data file, maintained by Andrew West.
  • After entering a query, click on the Search button to display a list of all relevant matches, if any, ordered by code point value.
  • Click on the Nested Match toggle button to extend the search to IDS-nested characters whose IDS match the query string.
  • Click on the Code Points checkbox to display the code point under each matching Unihan character.
  • It is possible to choose how many characters are shown one page at a time.
  • The search is performed on the set of 92,865 Unihan characters (excluding CJK compatibility ideographs) defined in Unicode 14.0.
  • The results may include the searched component itself when it happens to be a proper Unihan character too.
  • Use the Results ▾ pop-up menu to perform an action among:
    • Copy Results [copy the results as string to the clipboard]
    • Save Results.. [save the results as string to a text file]
    • Clear Results [clear the current list of results]
  • Various examples of regular expressions are provided for quick copy-and-paste.
  • Notes:

CJK Components - Match IDS screenshot

Find by Components

CJK Components - Find by Components screenshot

CJK Sources

CJK Sources screenshot

JavaScript Runner

  • The JavaScript Runner utility lets you execute JavaScript code, and comes with several sample scripts related to CJK and IDS; it is useful for quick testing/prototyping or data processing.

JavaScript Runner screenshot

Using

You can download the latest release for macOS.

Building

You'll need Node.js (which comes with npm) installed on your computer in order to build this application.

Clone method

# Clone the repository
git clone https://github.com/tonton-pixel/unicopedia-sinica
# Go into the repository
cd unicopedia-sinica
# Install dependencies
npm install
# Run the application
npm start

Note: to use the clone method, the core tool git must also be installed.

Download method

If you don't wish to clone, you can download the source code, unZip it, then directly run the following commands from a Terminal opened at the resulting unicopedia-sinica-master folder location:

# Install dependencies
npm install
# Run the application
npm start

Packaging

Several scripts are also defined in the package.json file to build OS-specific bundles of the application, using the simple yet powerful Electron Packager Node module.
For instance, running the following command (once the dependencies are installed) will create a Unicopedia Sinica.app version for macOS:

# Build macOS (Darwin) application
npm run build-darwin

License

The MIT License (MIT).

Copyright © 2021 Michel Mariani.

not_used

Something missing? Edit this app.

Keyboard Shortcuts

Key Action
/ Focus the search bar
Esc Focus the search bar and cleans it
Select the next search result
Select the previous search result
Enter Open the selected search result
cmdEnter Ctrl+Enter Open the selected search result in a new tab