utf8-byte-chunks

0.2.1 • Public • Published

utf8-byte-chunks

A utility that splits a given string into chunks of a maximum size in bytes.

use case

The primary use case was feeding user input to a chat API that accepts input of max 255 bytes per message.

Limiting the number of input characters was not an option. Users should be able to submit larger text, e.g. copy-pasted from some documents, potentially containing multibyte characters.

The most viable option seemed to be sending multiple requests, each carrying up to 255 bytes of text, until the entire message was sent.

installation

Via npm:

npm install --save utf8-byte-chunks

Via yarn:

yarn add utf8-byte-chunks

Example usage

 
import getBytesizedChunks from 'utf8-byte-chunks';
 
async handleSubmit(event) {
    event.preventDefault();
 
    // split the current text into chunks, then submit each of them individually
    const chunks = getBytesizedChunks(this.state.text, 255);
    for (let chunk of chunks) {
        if (this._isMounted) {
            await ChatStore.submitMessage(chunk);
        }
    }
 
    if (this._isMounted) {
        this.setState({ text: '' }, this.focusInput);
    }
}

Options

Split on spaces

Pass true as the third argument (splitOnSpace) to avoid the initial value to be split in the middle of a word and respect spaces instead.

Readme

Keywords

none

Package Sidebar

Install

npm i utf8-byte-chunks

Weekly Downloads

428

Version

0.2.1

License

MIT

Unpacked Size

3.45 kB

Total Files

5

Last publish

Collaborators

  • loopmode