Search in directory encoding exception

Submitted by yohann.martineau on Thursday, 26 April, 2012 - 16:00

I try to use jedit to make recursive text search in all files in a directory.

When I perform a search, jedit complains about cp1252 encoding not applicable on pdf files. Yes, I'm on windows xp...

The file could not be loaded correctly (some data might be lost) with
the encoding "Cp1252".
(java.nio.charset.UnmappableCharacterException: Input length = 1)
Try selecting a different encoding.
It can be selected with the menu File->Reload with Encoding.
If you want it to be done automatically, add the candidates into
"List of fallback encodings" in Encodings pane of Global Options.

I don't understand why jedit tries to search in pdf files as pdf files are binary files. Does it try to extract text from pdf files and then apply an encoding to find text?

It seems that jedit can skip binary files, it seems to be an interesting feature, but I don't know how jedit considers files as binary. If I uncheck "skip binary files", the same exception pops out but for bin files, obj files, etc. I've been looking at the documentation, but cannot find anything about binary files association. I think this behavior is expected, but not very user friendly.

Is it possible to use jedit to search in all binary files as well?

Does it just try to decode the file using its default encoding (cp1252, I'm on windows xp) and considers the file as binary if it fails? (I guess it's not that simple)

thanks,

yohann

« May 2025
Mo	Tu	We	Th	Fr	Sa	Su
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

file	ver	dls
GdbPlugin for jEdit 4.5+	0.5	1163
Hypersearch results analysis	1.0	2248
German Language Pack for jEdit 5 (up-to-date)	5.3	4157
Goal column macros	1.0	4047
Hyper-search all .txt files in home dir	1	3303
Select line	1.0	3459
Open_Copied_Path.bsh	1.0	8518
Select_All_or_Lines.bsh	1.0	3428
A BeanShell macro script to search and open a recent file or a file in the current directory.	1.0	5653
Select contents in between parentheses (excluding parentheses)	1.0	3557

file	ver	dls
German Localization light	4.4.2.1	108254
Context Free Art (*.cfdg)	0.31	46074
BBEdit scheme	1.0	18609
JBuilder scheme	.001	18511
ColdFusion scheme	1.0	18044
R Edit Mode - extensive version	0.1	17490
Advanced HTML edit mode	1.0	16226
Matlab Edit Mode	1.0	16088
jEdit XP icons	1.0	15248
XP icons for jEdit	1.1	14312

RSS

XML

HTML