Mercurial > hg > nginx-ranges
view contrib/unicode2nginx/unicode-to-nginx.pl @ 518:4b0d7f0bf22b NGINX_0_8_6
nginx 0.8.6
*) Feature: the ngx_http_geoip_module.
*) Bugfix: XSLT filter may fail with message "not well formed XML
document" for valid XML document.
Thanks to Kuramoto Eiji.
*) Bugfix: now in MacOSX, Cygwin, and nginx/Windows locations given by
a regular expression are always tested in case insensitive mode;
*) Bugfix: now nginx/Windows ignores trailing dots in URI.
Thanks to Hugo Leisink.
*) Bugfix: name of file specified in --conf-path was not honored during
installation; the bug had appeared in 0.6.6.
Thanks to Maxim Dounin.
author | Igor Sysoev <http://sysoev.ru> |
---|---|
date | Mon, 20 Jul 2009 00:00:00 +0400 |
parents | fa32d59d9a15 |
children |
line wrap: on
line source
#!/usr/bin/perl -w # Convert unicode mappings to nginx configuration file format. # You may find useful mappings in various places, including # unicode.org official site: # # http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1251.TXT # http://www.unicode.org/Public/MAPPINGS/VENDORS/MISC/KOI8-R.TXT # Needs perl 5.6 or later. # Written by Maxim Dounin, mdounin@rambler-co.ru ############################################################################### require 5.006; while (<>) { # Skip comments and empty lines next if /^#/; next if /^\s*$/; chomp; # Convert mappings if (/^\s*0x(..)\s*0x(....)\s*(#.*)/) { # Mapping <from-code> <unicode-code> "#" <unicode-name> my $cs_code = $1; my $un_code = $2; my $un_name = $3; # Produce UTF-8 sequence from character code; my $un_utf8 = join('', map { sprintf("%02X", $_) } unpack("C*", pack("U", hex($un_code)))); print " $cs_code $un_utf8 ; $un_name\n"; } else { warn "Unrecognized line: '$_'"; } } ###############################################################################