hangul-asm
Disassemble and assemble complex char-set of korean letters into/from serialized(disassembled) form. In this way Korean text can be fed into neural network just like English text.
$ node app
안녕하세요 한글 테스트입니다
§' ¥¥« §
$ node app decode
§' ¥¥« §
안녕하세요 한글 테스트입니다
var lib = var encoded = libvar decoded = liblib;// output(using_char) : §' ¥¥« §// output(using_charcode) : [256, 123, 145, ...]
Natural representation for Hangul(Korean letter)
않다
-->ㅇㅏㄴㅎㄷㅏ
아니하다
-->ㅇㅏㄴㅣㅎㅏㄷㅏ
이걸
-->ㅇㅣㄱㅓㄹ
이것을
-->ㅇㅣㄱㅓㅅㅇㅡㄹ
Version history
- v1.1.0
- Add charcode option
- Refactoring code