hangul-asm-tmp

1.1.0 • Public • Published

hangul-asm

Disassemble and assemble complex char-set of korean letters into/from serialized(disassembled) form. In this way Korean text can be fed into neural network just like English text.

$ node app
안녕하세요 한글 테스트입니다
§' ¥¥« §

$ node app decode
§' ¥¥« §
안녕하세요 한글 테스트입니다
var lib = require('hangul-asm')
var encoded = lib.encode('한글 풀어쓰기')
var decoded = lib.decode(encoded)
lib.option({ "using_charcode": true });
// output(using_char) : §' ¥¥« §
// output(using_charcode) : [256, 123, 145, ...]

Natural representation for Hangul(Korean letter)

  • 않다 --> ㅇㅏㄴㅎㄷㅏ
  • 아니하다 --> ㅇㅏㄴㅣㅎㅏㄷㅏ
  • 이걸 --> ㅇㅣㄱㅓㄹ
  • 이것을 --> ㅇㅣㄱㅓㅅㅇㅡㄹ

Version history

  • v1.1.0
    • Add charcode option
    • Refactoring code

Readme

Keywords

none

Package Sidebar

Install

npm i hangul-asm-tmp

Weekly Downloads

2

Version

1.1.0

License

ISC

Unpacked Size

7.52 kB

Total Files

5

Last publish

Collaborators

  • keicoon