[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#923948: Encoding error ... I think



Package: anthy
Version: 1:0.3.0
Severity: minor

While playing with Debian source package, I found minor error in source
(due to new directory structure of Debian forked situation) and possible
encoding problem.

See my proposed patches are here.

I think this encoding inconsistency may be the cause of some complain
for recent conversion problem after migrating to UTF-8.


-- System Information:
Debian Release: buster/sid
  APT prefers testing
  APT policy: (500, 'testing'), (10, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 4.19.0-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US:en (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages anthy depends on:
ii  anthy-common    1:0.3.0
ii  libanthy1       1:0.3.0
ii  libanthyinput0  1:0.3.0
ii  libc6           2.28-7

anthy recommends no packages.

anthy suggests no packages.

-- debconf information excluded
>From fe83538be797cb4440eaa86ed93042b09eb0ccaa Mon Sep 17 00:00:00 2001
From: Osamu Aoki <osamu@debian.org>
Date: Sun, 3 Feb 2019 21:36:32 +0900
Subject: [PATCH 1/7] Fix file path to the correct one

Signed-off-by: Osamu Aoki <osamu@debian.org>
---
 doc/POS | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/doc/POS b/doc/POS
index ab07c1a..286d654 100644
--- a/doc/POS
+++ b/doc/POS
@@ -23,7 +23,7 @@ wtype_tは
 であるときに対象側は基準側に含まれる。
 
 
-src-worddic/wtype.hに CC_??として、活用のしかたを分類してある。
+anthy/wtype.hに CC_??として、活用のしかたを分類してある。
 活用表は最後の数文字を見て、語幹と活用形がわかればよいので、
 上一段と下一段は統合して扱う(本当はcannadicの都合)。
  CC_NONE 無活用
-- 
2.20.1

From 185282ff9da4bc8eb79ff613faf4829479a5f574 Mon Sep 17 00:00:00 2001
From: Osamu Aoki <osamu@debian.org>
Date: Sun, 3 Feb 2019 22:08:59 +0900
Subject: [PATCH 3/7] Convert to UTF-8

This is very odd to see this file in non UTF-8 format.
So converted to UTF-8.

Signed-off-by: Osamu Aoki <osamu@debian.org>
---
 depgraph/master.depword | 38 +++++++++++++++++++-------------------
 1 file changed, 19 insertions(+), 19 deletions(-)

diff --git a/depgraph/master.depword b/depgraph/master.depword
index d897485..bb760f4 100644
--- a/depgraph/master.depword
+++ b/depgraph/master.depword
@@ -1,37 +1,37 @@
 # UTF-8
 #
-# �°���������# 付属語の列を定義する
 #
-# indepword.txt�˳�ʻ줴�Ȥκǽ��Ρ��ɤ������Ƥ��롣
+# indepword.txtに各品詞ごとの最初のノードが定義してある。
 #
-# ��ɽ
+# 活用表
 # \\include conjugate.table
-# ��������+# 活用するもの
 # \\include conjugate.depword
-# �����ʤ���
+# 活用しないもの
 # \\include fix.depword
 #
-# ̾��# 名詞
 # \\include noun.depword
 # \\include noun-variant.depword
-# ���# 副詞
 # \\include av.depword
-# ư��# 動詞
 # \\include v.depword
-# ��ƻ�# 形容詞
 # \\include a.depword
-# ������# 形容動詞
 # \\include ajv.depword
 #
 #
-@�³��" Sy@ Sy@�����# �����Τ��ˤ����ʤ����ΤǴ�����Ȥ���
-#@������� "�� "��" @
-@�����" @
-@Ϣ���" St@
-#@����"��" "����" "������"�� "��" "��" "�Ȥ�"�ä� "�� "��"��"��@ @���ѥ졼��
-@������ "" @
-@����� "" @
-@�@
+@接続詞 "" Sy@ Sy@間投助詞
+# 誤変換のもとにしかならないので間投助詞は使わない
+#@間投助詞 "ね" "な" "さ" @
+@間投助詞 "" @
+@連体詞 "" St@
+#@終助詞 "か" "かい" "かしら" "な" "ぞ" "ぜ" "とも" "って" "の" "わ" "や" "よ" @ @セパレータ
+@開き括弧 "" @
+@閉じ括弧 "" @
+@単漢字 "" @
 #
 #
-- 
2.20.1


Reply to: