Différences
Ci-dessous, les différences entre deux révisions de la page.
Prochaine révision | Révision précédente | ||
fichiers:pdf:produire_des_pdf_cherchables [2018/05/22 11:37] – créée joseph.wright | fichiers:pdf:produire_des_pdf_cherchables [2018/12/04 00:22] (Version actuelle) – jejust | ||
---|---|---|---|
Ligne 1: | Ligne 1: | ||
- | # Copy-paste-able/ | + | ====== |
- | + | ||
Ligne 7: | Ligne 5: | ||
hold their text in the encoding of the original TeX font used by | hold their text in the encoding of the original TeX font used by | ||
the document. | the document. | ||
- | |||
When PDF readers, etc., offer copy-paste or searching | When PDF readers, etc., offer copy-paste or searching | ||
Ligne 14: | Ligne 11: | ||
documents (in English, at least); the problem comes when you're using | documents (in English, at least); the problem comes when you're using | ||
an inflected language (with accented letters, or composite glyphs | an inflected language (with accented letters, or composite glyphs | ||
- | such as 'æ')& | + | such as 'æ' |
encoding, and there are likely be problems, since PDF readers | encoding, and there are likely be problems, since PDF readers | ||
assume the text is presented in Unicode. | assume the text is presented in Unicode. | ||
- | |||
For PDF generated from LaTeX (the DVI being | For PDF generated from LaTeX (the DVI being | ||
- | converted, by whatever means), or from PDFLaTeX, the character | + | converted, by whatever means), or from pdfLaTeX, the character |
codes used in the PDF file are in fact those of the document' | codes used in the PDF file are in fact those of the document' | ||
- | [font encoding](./ | + | [[FAQ-whatenc|font encoding]]; if you're using OT1 or |
T1, your document will be OK for almost all ASCII | T1, your document will be OK for almost all ASCII | ||
- | characters, but it's likely that anything | + | characters, but it's likely that anything |
not be represented properly. | not be represented properly. | ||
XeTeX- or LuaTeX-based formats is going to be OK, since | XeTeX- or LuaTeX-based formats is going to be OK, since | ||
- | those engines work in Unicode | + | those engines work in Unicode |
The solution comes from the character-mapping facilities in the | The solution comes from the character-mapping facilities in the | ||
Ligne 35: | Ligne 30: | ||
version of the characters. | version of the characters. | ||
- | + | Packages [[ctanpkg> | |
- | Packages [`cmap`](http:// | + | generating such tables ([[ctanpkg> |
- | generating such tables ([`mmap`](http:// | + | the various maths encodings); both work with pdfTeX |
- | the various maths encodings); both work with PDFTeX | + | |
engine. | engine. | ||
- | ```latex | + | |
+ | < | ||
\documentclass{article} | \documentclass{article} | ||
\usepackage{mmap} % (or cmap) | \usepackage{mmap} % (or cmap) | ||
Ligne 47: | Ligne 42: | ||
\begin{document} | \begin{document} | ||
... % your actual text | ... % your actual text | ||
- | ``` | + | </ |
Unfortunately, | Unfortunately, | ||
- | encoded, such as the default (Computer Modern, i.e., `cm` | + | encoded, such as the default (Computer Modern, i.e., '' |
- | fonts, and things such as `cm-super` or the `Latin` | + | fonts, and things such as '' |
- | `Modern` sets. Fonts like Adobe | + | '' |
Times Roman (which are encoded for (La)TeX use via virtual fonts) | Times Roman (which are encoded for (La)TeX use via virtual fonts) | ||
are not amenable to this treatment. | are not amenable to this treatment. | ||
+ | ----- | ||
+ | //Source:// [[faquk> | ||
- | + | {{htmlmetatags> | |
- | + | metatag-og: | |
- | + | metatag-og: | |
- | + | }} | |
- | + | ||
- | + | ||
- | + | ||