You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
qmk_firmware/docs/feature_unicode.md

10 KiB

Unicode Support

There are three Unicode keymap definition methods available in QMK:

UNICODE_ENABLE

Supports Unicode input up to 0xFFFF. The keycode function is UC(n) in keymap file, where n is a 4 digit hexadecimal.

UNICODEMAP_ENABLE

Supports Unicode up to 0xFFFFFFFF. You need to maintain a separate mapping table const uint32_t PROGMEM unicode_map[] = {...} in your keymap file. The keycode function is X(n) where n is the array index of the mapping table.

And you may want to have an enum to make reference easier. So you'd want to add something like this to your keymap:

enum unicode_names {
  BANG,
  IRONY,
  SNEK,
};

const uint32_t PROGMEM unicode_map[] = {
  [BANG]    = 0x203D,  // ‽
  [IRONY]   = 0x2E2E,  // ⸮
  [SNEK]    = 0x1F40D, // 🐍
}:

Make sure that the order for both matches.

UCIS_ENABLE

Supports Unicode up to 0xFFFFFFFF. As with UNICODE_MAP, you may want to main a mapping table in your keymap file. However, there is no keycodes for this feature, you will have to add a keycode or function to call qk_ucis_start(). Once you've run that, you can just type the text for your unicode, and then hit space or enter to complete it, or ESC to cancel it. And if it matches an entry in your table, it will automatically "backspace" the trigger word (from your table) and then will input the unicode sequence.

For instance, you would need to have a table like this in your keymap:

const qk_ucis_symbol_t ucis_symbol_table[] = UCIS_TABLE
(
 UCIS_SYM("poop", 0x1f4a9),
 UCIS_SYM("rofl", 0x1f923),
 UCIS_SYM("kiss", 0x1f619)
);

You run the function, and then type "rofl" and hit enter, it should backspace remove "rofl" and input the 0x1f923 unicode.

Customization

There are several functions that you can add to your keymap to customize the functionality of this feature.

  • void qk_ucis_start_user(void) - This runs when you run the "start" function, and can be used to provide feedback. By default, it types out a keyboard emoji.
  • void qk_ucis_success(uint8_t symbol_index) - This runs when the unicode input has matched something, and has completed. Default doesn't do anything.
  • void qk_ucis_symbol_fallback (void) - This runs if the input text doesn't match anything. The default function falls back to trying that input as a unicode code.

The default code for these are:

void qk_ucis_start_user(void) { // outputs keyboard emoji
  unicode_input_start();
  register_hex(0x2328);
  unicode_input_finish();
}

void qk_ucis_success(uint8_t symbol_index) {
}

void qk_ucis_symbol_fallback (void) { // falls back to manual unicode entry
  for (uint8_t i = 0; i < qk_ucis_state.count - 1; i++) {
    uint8_t code = qk_ucis_state.codes[i];
    register_code(code);
    unregister_code(code);
    wait_ms(UNICODE_TYPE_DELAY);
  }
}

Input Modes

Unicode input in QMK works by inputting a sequence of characters to the OS, sort of like a macro. Unfortunately, the way this is done differs for each platform. Specifically, each platform requires a different combination of keys to trigger Unicode input. Therefore, a corresponding input mode has to be set in QMK.

The following input modes are available:

  • UC_OSX: Mac OS X built-in Unicode hex input. Supports code points up to 0xFFFF (0x10FFFF with UNICODEMAP).

    To enable, go to System Preferences > Keyboard > Input Sources, add Unicode Hex Input to the list (it's under Other), then activate it from the input dropdown in the Menu Bar.

  • UC_OSX_RALT: Same as UC_OSX, but sends the Right Alt key for unicode input

  • UC_LNX: Linux built-in IBus Unicode input. Supports all possible code points (0x10FFFF).

    Enabled by default and works almost anywhere on IBus-enabled distros. Without IBus, this mode works under GTK apps, but rarely anywhere else.

  • UC_BSD: (non operational) Unicode input method under BSD.

  • UC_WIN: (not recommended) Windows built-in hex numpad Unicode input. Supports code points up to 0xFFFF.

    To enable, create a registry key under HKEY_CURRENT_USER\Control Panel\Input Method\EnableHexNumpad of type REG_SZ called EnableHexNumpad and set its value to 1. This can be done from the Command Prompt by running reg add "HKCU\Control Panel\Input Method" -v EnableHexNumpad -t REG_SZ -d 1 with administrator privileges. Afterwards, reboot. This mode is not recommended because of reliability and compatibility issues; use the UC_WINC mode instead.

  • UC_WINC: Windows Unicode input using WinCompose. As of v0.8.2, supports code points up to 0xFFFFF.

    To enable, install the latest release. Once installed, WinCompose will automatically run on startup. Works reliably under all version of Windows supported by the app. This mode uses the right Alt key (KC_RALT), and this needs to be set to this in WinCompose's settings.

!> There is an input mode option for BSD, but it's not implemented at this time. If you use BSD and want to help with adding support for it, please open an issue on GitHub.

Switching Input Modes

There are two ways to set the input mode for Unicode: by keycode or by function. Keep in mind that both methods write to persistent storage (EEPROM), and are loaded each time the keyboard starts. So once you've set it once, you don't need to set it again unless you want to change it, or you've reset the EEPROM settings.

You can switch the input mode at any time by using one of the following keycodes. The easiest way is to add the ones you use to your keymap.

Keycode Alias Input mode Description
UNICODE_MODE_OSX UC_M_OS UC_OSX Switch to Mac OS X input.
UNICODE_MODE_LNX UC_M_LN UC_LNX Switch to Linux input.
UNICODE_MODE_WIN UC_M_WI UC_WIN Switch to Windows input.
UNICODE_MODE_BSD UC_M_BS UC_BSD Switch to BSD input (not implemented).
UNICODE_MODE_WINC UC_M_WC UC_WINC Switch to Windows input using WinCompose.
UNICODE_MODE_OSX_RALT UC_M_OR UC_OSX_RALT Switch to Mac OS X input using Right Alt.

You can also switch the input mode by calling set_unicode_input_mode(x) in your code, and this works the same way as the above keycodes.

Audio Feedback

If you have the Audio feature enabled on the board, you can set melodies to be played when you press the above keys. That way you can have some audio feedback when switching input modes.

For instance, you can add these definitions to your config.h file:

#define UNICODE_SONG_OSX  COIN_SOUND
#define UNICODE_SONG_LNX  UNICODE_LINUX
#define UNICODE_SONG_BSD  MARIO_GAMEOVER
#define UNICODE_SONG_WIN  UNICODE_WINDOWS
#define UNICODE_SONG_WINC UNICODE_WINDOWS
#define UNICODE_SONG_OSX_RALT COIN_SOUND

Additional Customization

The functions for starting and finishing Unicode input on your platform can be overridden locally. Possible uses include customizing input mode behavior if you don't use the default keys, or adding extra visual/audio feedback to Unicode input.

  • void unicode_input_start(void) This sends the initial sequence that tells your platform to enter Unicode input mode. For example, it presses Ctrl+Shift+U on Linux and holds the Option key on Mac.
  • void unicode_input_finish(void) This is called to exit Unicode input mode, for example by pressing Space or releasing the Option key.

You can find the default implementations of these functions in process_unicode_common.c.

send_unicode_hex_string

To type multiple characters for things like (ノಠ痊ಠ)ノ彡┻━┻, you can use send_unicode_hex_string() much like SEND_STRING() except you would use hex values separate by spaces. For example, the table flip seen above would be send_unicode_hex_string("0028 30CE 0CA0 75CA 0CA0 0029 30CE 5F61 253B 2501 253B")

There are many ways to get a hex code, but an easy one is this site. Just make sure to convert to hexadecimal, and that is your string.

Additional Language Support

In quantum/keymap_extras/, you'll see various language files - these work the same way as the alternative layout ones do. Most are defined by their two letter country/language code followed by an underscore and a 4-letter abbreviation of its name. FR_UGRV which will result in a ù when using a software-implemented AZERTY layout. It's currently difficult to send such characters in just the firmware.

International Characters on Windows

AutoHotkey allows Windows users to create custom hotkeys among others.

The method does not require Unicode support in the keyboard itself but depends instead of AutoHotkey running in the background.

First you need to select a modifier combination that is not in use by any of your programs. CtrlAltWin is not used very widely and should therefore be perfect for this. There is a macro defined for a mod-tab combo LCAG_T. Add this mod-tab combo to a key on your keyboard, e.g.: LCAG_T(KC_TAB). This makes the key behave like a tab key if pressed and released immediately but changes it to the modifier if used with another key.

In the default script of AutoHotkey you can define custom hotkeys.

<^<!<#a::Send, ä
<^<!<#<+a::Send, Ä

The hotkeys above are for the combination CtrlAltGui and CtrlAltGuiShift plus the letter a. AutoHotkey inserts the Text right of Send, when this combination is pressed.

US International

If you enable the US International layout on the system, it will use punctuation to accent the characters.

For instance, typing "`a" will result in à.

You can find details on how to enable this here.