Document the encoding used when hashing strings

The README mentions

> Per bcrypt implementation, only the first 72 bytes of a string are used. Any extra bytes are ignored when matching passwords. Note that this is not the first 72 characters. It is possible for a string to contain less than 72 characters, while taking up more than 72 bytes (e.g. a UTF-8 encoded string containing emojis).

But that is of little value because this library accepts JavaScript strings as parameters, e.g. for `hash`, so how are those strings being represented as bytes? Does it use UTF-8, like in the example? My investigation and tests suggest so.

This is some very important information that I can't find written anywhere. The README even seems to dodge this question since it also mentions

> Compatibility with hashes generated by other languages is not 100% guaranteed due to difference in character encodings. However, it should not be an issue for most cases.

Again keeping it vague.

These are the lines I think are the most relevant and confirm that UTF-8 is being used, in case it saves time for someone.
https://github.com/kelektiv/node.bcrypt.js/blob/2a3c44550e6e32e098dcb4d6fdacee0d28af697f/src/bcrypt_node.cc#L170
https://github.com/nodejs/node-addon-api/blob/7e1aa06132558fcc3de4ef5f4f6b84ff10c32502/napi-inl.h#L1105

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document the encoding used when hashing strings #1007

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Document the encoding used when hashing strings #1007

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions