convert character set (German)

Tomas By

I have a text file that uses various characters in the 128+ range in currently non-standard ways. The file command just says Non-ISO extended-ASCII.

From the context I can recognise these:

Octal 201: u + unlaut
      204: a + umlaut
      216: A + umlaut
      224: o + umlaut
      341: double s

(There are many others, which I suspect are graphical symbols, not characters.)

Addition, example:

 example:   E0X A ANCIENT.IMG 2 0 C:\DOS\DISKOPT.EXE A: /O /Sa /M2
              ДВД В ДДВДДДДДДДД В Д ДДДДДДДВДДДДДДДДДД ДДДДДДДВДДДДД
           і  і   і         і          і                  і
     load E0X ДЩ  АДДДДДДДДДї   і          і                  і
                      і     і   і          і                  і
     with ANCIENT.IMG Щ     і   і          і                  і
                            і   і          і                  і
     for drive A: ДДДДДДДДДДЩ   і          і                  і
                                і          і                  і
     let DISKOPT work ДДДДДДДДДДіДДДДДДДДДДБДДДДДДДДДДДДДДДДДДЩ
                    і
     and write the result back to disk if finished.

(The graphical chars are octal 263, 277, 302, 304, 331.)

And here is the link to the file: e0x.arj. It is the E0X.ENG, but I guess it is the same encoding in all the text files.

Which character set is this, and how can I make it readable on a modern computer?

Francesco Potortì

Most probably the character positions you mention are octal numbers: 201 (which is customarily written as 0201 to make it clear it's octal) is decimal 129, or 0x81.

Those characters are consistent with several DOC codepages:

  • VGA codepage 437 (VGA ROM charset)
  • Codepage 437 (IBM-PC: default)
  • Codepage 775 (IBM-PC: Baltic)
  • Codepage 850 (IBM-PC: European)
  • Codepage 852 (IBM-PC: East European)
  • Codepage 857 (IBM-PC: Turkish)
  • Codepage 861 (IBM-PC: Icelandic)
  • Codepage 865 (IBM-PC: Nordic European)

If it's German, I'd bet that it's 437 or 850. Any editor should be able to read that text file and write it in a different character set.

For example you can read it with Notepad++ and write it in UTF-8 if you are sure you need that.

P.S. after reading the file that you attached, I can see that E0X.ENG charset is MS-DOS codepage 437. You can see it converted to utf-8 at https://pastebin.com/LdnQCpk4.

If you run on Linux, you can automate conversion with GNU recode. If you run on DOS, I see this recode utility https://docs.seneca.nl/Smartsite-Docs/Features-Modules/Features/Tools/Recode-commandline-utility.html should do the same

이 기사는 인터넷에서 수집됩니다. 재 인쇄 할 때 출처를 알려주십시오.

침해가 발생한 경우 연락 주시기 바랍니다[email protected] 삭제

에서 수정
0

몇 마디 만하겠습니다

0리뷰
로그인참여 후 검토

관련 기사

분류에서Dev

convert character linestring to geometry in sf

분류에서Dev

how to convert a character array to decibel

분류에서Dev

VIM - Bat to convert character encodings

분류에서Dev

Trying to Convert String to Character Pointer?

분류에서Dev

How to convert decimal value to character in c language

분류에서Dev

How to convert unicode to its original character in Python

분류에서Dev

Swift Convert Hex String or Character to Integer

분류에서Dev

How to convert a json string into an individual character array in bash?

분류에서Dev

Convert an array of character to string from index i to j in c++

분류에서Dev

Convert keycode to character using javascript,like keyCode 27 to esc

분류에서Dev

Delete weird ANSI character and convert accented ones using Python

분류에서Dev

How do I convert a TTF into individual PNG character images?

분류에서Dev

INSERT 값은 SET NAMES, SET CHARACTER SET로 인코딩됩니까?

분류에서Dev

How to replace set of repeating String pattern with specific character?

분류에서Dev

Change character set of downloaded file via response headers

분류에서Dev

mysql_query("SET NAMES 'UTF8'"); to solve '???' character

분류에서Dev

Is there a termination character for bq in interactive mode? How do I set it?

분류에서Dev

set caret position to last character in masked text box - winform?

분류에서Dev

GAE의 SQLAlchemy 'character_set_name'오류

분류에서Dev

Set if-condition for new line every 10th character

분류에서Dev

How to use regex to select a character outside of and after a set of quotations

분류에서Dev

Can't display german umlaut

분류에서Dev

In bash, how can I convert a Unicode Codepoint [0-9A-F] into a printable character?

분류에서Dev

Convert escape characters in strings (like "\\n" - two characters) into ASCII character (newline)

분류에서Dev

Inconsistent date time format for German locale

분류에서Dev

LibreOffice - getting German spell checking to work (Debian)

분류에서Dev

How to change directory to desktop on german ubuntu?

분류에서Dev

How to convert string to variable and set attributes of them in jquery

분류에서Dev

Python Convert set to list, keep getting TypeError: 'list' object is not callable

Related 관련 기사

  1. 1

    convert character linestring to geometry in sf

  2. 2

    how to convert a character array to decibel

  3. 3

    VIM - Bat to convert character encodings

  4. 4

    Trying to Convert String to Character Pointer?

  5. 5

    How to convert decimal value to character in c language

  6. 6

    How to convert unicode to its original character in Python

  7. 7

    Swift Convert Hex String or Character to Integer

  8. 8

    How to convert a json string into an individual character array in bash?

  9. 9

    Convert an array of character to string from index i to j in c++

  10. 10

    Convert keycode to character using javascript,like keyCode 27 to esc

  11. 11

    Delete weird ANSI character and convert accented ones using Python

  12. 12

    How do I convert a TTF into individual PNG character images?

  13. 13

    INSERT 값은 SET NAMES, SET CHARACTER SET로 인코딩됩니까?

  14. 14

    How to replace set of repeating String pattern with specific character?

  15. 15

    Change character set of downloaded file via response headers

  16. 16

    mysql_query("SET NAMES 'UTF8'"); to solve '???' character

  17. 17

    Is there a termination character for bq in interactive mode? How do I set it?

  18. 18

    set caret position to last character in masked text box - winform?

  19. 19

    GAE의 SQLAlchemy 'character_set_name'오류

  20. 20

    Set if-condition for new line every 10th character

  21. 21

    How to use regex to select a character outside of and after a set of quotations

  22. 22

    Can't display german umlaut

  23. 23

    In bash, how can I convert a Unicode Codepoint [0-9A-F] into a printable character?

  24. 24

    Convert escape characters in strings (like "\\n" - two characters) into ASCII character (newline)

  25. 25

    Inconsistent date time format for German locale

  26. 26

    LibreOffice - getting German spell checking to work (Debian)

  27. 27

    How to change directory to desktop on german ubuntu?

  28. 28

    How to convert string to variable and set attributes of them in jquery

  29. 29

    Python Convert set to list, keep getting TypeError: 'list' object is not callable

뜨겁다태그

보관