Class CharsetRecog_Unicode

    • Constructor Detail

      • CharsetRecog_Unicode

        CharsetRecog_Unicode()
    • Method Detail

      • getName

        abstract java.lang.String getName()
        Description copied from class: CharsetRecognizer
        Get the IANA name of this charset.
        Specified by:
        getName in class CharsetRecognizer
        Returns:
        the charset name.
      • match

        abstract CharsetMatch match​(CharsetDetector det)
        Description copied from class: CharsetRecognizer
        Test the match of this charset with the input text data which is obtained via the CharsetDetector object.
        Specified by:
        match in class CharsetRecognizer
        Parameters:
        det - The CharsetDetector, which contains the input text to be checked for being in this charset.
        Returns:
        A CharsetMatch object containing details of match with this charset, or null if there was no match.
      • codeUnit16FromBytes

        static int codeUnit16FromBytes​(byte hi,
                                       byte lo)
      • adjustConfidence

        static int adjustConfidence​(int codeUnit,
                                    int confidence)