PHWinfo banniere

Titres
PORTAIL ANNUAIRE ARTICLES COMPARATEUR HÉBERGEURS DEVIS FORUMS RÉDUCTEUR D'URL
Précédent   PHWinfo > Forums Hébergement > Forum Serveur - Sécurité et techniques > linux.debian.user > word list for document index
S'inscrire FAQ Membres Recherche Messages du jour Marquer les forums comme lus
linux.debian.user debian-user@lists.debian.org.

word list for document index

Réponse
 
LinkBack Outils de la discussion
Vieux 12/04/2007, 09h10   #1
Russell L. Harris
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut word list for document index

I wish to add an index to documents I am writing. I am using XEmacs
and LaTeX.

It appears that makeindex is obsolescent and has been superseded by
xindy, so I plan to use xindy.

>From an article titled "MakeIndex: An Index Processor for LaTeX",

written by Leslie Lamport (17 February 1987), it appears that the
first step is to create an alphabetical list of every word in the
document, with duplicates removed. Lamport recommends use of the
commands:

delatex myfile.tex | sort -uf > foo

It would appear that delatex purges from the file all the LaTeX
commands.

However:

(1) It appears that delatex now is obsolete. I have searched with
Google, but I have not yet discovered how to obtain or create an
executable version of delatex.

(2) I do not know if delatex takes account of "include" files in a
LaTeX document (that is, files specified by the \include command).
However, it would be no great problem to run delatex on each of the
include files, and then concatenate the resulting word lists.

Accordingly, I am trying to start out with the .dvi file of the
document. Now the problem is that, in the .dvi file, many words are
hyphenated at the end of the line, and there is a newline character
(and perhaps also a carriage return?) at the end of each line.

Using sed and tr, I have attempted to remove combination of [hyphen
and newline character], but I have not been successful. Two sed
script which I have tried are:

sed -e 's/" \n"//g'

sed -e 's/( \n)//g'

Is there a better way to generate a list of words in a LaTeX document?

RLH



--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 10h40   #2
Kushal Kumaran
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

On 4/12/07, Russell L. Harris <rlharris@oplink.net> wrote:
> I wish to add an index to documents I am writing. I am using XEmacs
> and LaTeX.
>
> It appears that makeindex is obsolescent and has been superseded by
> xindy, so I plan to use xindy.
>
> >From an article titled "MakeIndex: An Index Processor for LaTeX",

> written by Leslie Lamport (17 February 1987), it appears that the
> first step is to create an alphabetical list of every word in the
> document, with duplicates removed. Lamport recommends use of the
> commands:
>
> delatex myfile.tex | sort -uf > foo
>
> It would appear that delatex purges from the file all the LaTeX
> commands.
>
> <snipped detailed query on converting latex to plain text>


Have you seen the TeX FAQ? The section
http://www.tex.ac.uk/cgi-bin/texfaq2html?label=toascii might .

catdvi is at least packaged for debian. I haven't checked the others.
--
Kushal


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 11h00   #3
Russell L. Harris
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

* Kushal Kumaran <kushal.kumaran+debian@gmail.com> [070412 04:42]:
> On 4/12/07, Russell L. Harris <rlharris@oplink.net> wrote:
>> I wish to add an index to documents I am writing. I am using
>> XEmacs and LaTeX.

> Have you seen the TeX FAQ? The section
> http://www.tex.ac.uk/cgi-bin/texfaq2html?label=toascii might .
>
> catdvi is at least packaged for debian. I haven't checked the others.



I thank you, Kushal. I forgot about the TeX FAQ.

I am using multi-columns, so dvi2tty is not suitable, but catdvi with
the --sequential option works.

The TeX FAQ provided a link to a complete delatex archive, so perhaps
I can get delatex running; it appears to be a much better solution.

For some reason, xindy is not in the Debian archive, but I did find
www.xindy.org. It may be better for me to use makeindex for the
present time.

RLH


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 11h30   #4
Florian Kulzer
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

On Thu, Apr 12, 2007 at 04:59:36 -0500, Russell L. Harris wrote:
> * Kushal Kumaran [070412 04:42]:
> > On 4/12/07, Russell L. Harris wrote:
> >> I wish to add an index to documents I am writing. I am using
> >> XEmacs and LaTeX.

> > Have you seen the TeX FAQ? The section
> > http://www.tex.ac.uk/cgi-bin/texfaq2html?label=toascii might .
> >
> > catdvi is at least packaged for debian. I haven't checked the others.

>
>
> I thank you, Kushal. I forgot about the TeX FAQ.
>
> I am using multi-columns, so dvi2tty is not suitable, but catdvi with
> the --sequential option works.
>
> The TeX FAQ provided a link to a complete delatex archive, so perhaps
> I can get delatex running; it appears to be a much better solution.


Try "detex -w somefile.tex" to get the word list. Detex processes \input
and \include commands automatically and it can deal with both plain TeX
and LaTeX source files. Detex is included in texlive-extra-utils.

--
Regards,
Florian


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 11h40   #5
Russell L. Harris
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

* Florian Kulzer <florian.kulzer@icfo.es> [070412 05:21]:
> Try "detex -w somefile.tex" to get the word list. Detex processes \input
> and \include commands automatically and it can deal with both plain TeX
> and LaTeX source files. Detex is included in texlive-extra-utils.


Thanks, Florian.

I now am installing TeXLive on this machine.

Regrettably, this necessitates removing muttprint, which is the reason
I did not install TeXLive several months ago.

Also, I think that lilypond may not work yet with TeXLive.

But a working and supported version of detex is more important to me.

RLH


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 11h50   #6
Florian Kulzer
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

On Thu, Apr 12, 2007 at 05:35:45 -0500, Russell L. Harris wrote:
> * Florian Kulzer [070412 05:21]:
> > Try "detex -w somefile.tex" to get the word list. Detex processes \input
> > and \include commands automatically and it can deal with both plain TeX
> > and LaTeX source files. Detex is included in texlive-extra-utils.

>
> Thanks, Florian.
>
> I now am installing TeXLive on this machine.
>
> Regrettably, this necessitates removing muttprint, which is the reason
> I did not install TeXLive several months ago.
>
> Also, I think that lilypond may not work yet with TeXLive.
>
> But a working and supported version of detex is more important to me.


I think you could simply extract the detex binary from the texlive
package and put it somewhere in your path; it only links against libc6
if I am not mistaken. This would allow you to keep tetex.

muttprint seems to specify an alternative dependency "tetex-extra |
texlive-latex-extra" now, so you should be able to use this package with
texlive also (not sure about lilypond, though).

--
Regards,
Florian


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 12/04/2007, 23h04   #7
Russell L. Harris
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

* Florian Kulzer <florian.kulzer@icfo.es> [070412 05:49]:
> On Thu, Apr 12, 2007 at 05:35:45 -0500, Russell L. Harris wrote:
>> * Florian Kulzer [070412 05:21]:


>>> Try "detex -w somefile.tex" to get the word list. Detex processes
>>> \input and \include commands automatically and it can deal with
>>> both plain TeX and LaTeX source files. Detex is included in
>>> texlive-extra-utils.

>>
>> I now am installing TeXLive on this machine. Regrettably, this
>> necessitates removing muttprint, which is the reason I did not
>> install TeXLive several months ago.

>
> muttprint seems to specify an alternative dependency "tetex-extra |
> texlive-latex-extra" now, so you should be able to use this package with
> texlive also (not sure about lilypond, though).


Hi, Florian.

(1) detex does a marvelous job; many thanks.

(2) Although muttprint was removed when I installed TeXLive, I was
able to reinstall muttprint; thanks.

(3) After reinstallation, muttprint gives an error message, saying
that there is not DVI file; but I think that this is related to the
the fact that TeXLive doesn't work (see the next item).

(4) Now when I try to run LaTeX on a document, there is an error message:

This is pdfeTeX, Version 3.141592-1.30.5-2.2 (Web2C 7.5.5)
---! /home/rlh/.texmf-var/web2c/latex.fmt doesn't match pdfetex.pool
(Fatal format file error; I'm stymied)

But I installed TeXLive without difficulty on another i386 system on
which it runs without error on the same document.

So perhaps I need to remove TeXLive, purge the configuration files,
and reinstall?

And if that doesn't work, I need to start another thread for this topic.

RLH



--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Vieux 13/04/2007, 09h10   #8
Florian Kulzer
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: word list for document index

On Thu, Apr 12, 2007 at 15:20:39 -0500, Russell L. Harris wrote:

[ ...transition from tetex to texlive; so far so good, but... ]

> (4) Now when I try to run LaTeX on a document, there is an error message:
>
> This is pdfeTeX, Version 3.141592-1.30.5-2.2 (Web2C 7.5.5)
> ---! /home/rlh/.texmf-var/web2c/latex.fmt doesn't match pdfetex.pool
> (Fatal format file error; I'm stymied)
>
> But I installed TeXLive without difficulty on another i386 system on
> which it runs without error on the same document.


My first guess would be that you have some old files in your
~/.texmf-var directory which are not upwards compatible. Try to move
this directory out of the way and run latex again; I assume the proper
files will be created automatically then. You could first check if a
"fresh" user has the same problem when running latex.

--
Regards,
Florian


--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
  Réponse avec citation
Réponse


Outils de la discussion

Règles de messages
Vous ne pouvez pas créer de nouvelles discussions
Vous ne pouvez pas envoyer des réponses
Vous ne pouvez pas envoyer des pièces jointes
Vous ne pouvez pas modifier vos messages

Les balises BB sont activées : oui
Les smileys sont activés : oui
La balise [IMG] est activée : oui
Le code HTML peut être employé : non
Trackbacks are oui
Pingbacks are oui
Refbacks are oui


Fuseau horaire GMT +1. Il est actuellement 23h11.


Édité par : vBulletin® version 3.7.2
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.2.0 RC5 Tous droits réservés.
Version française #16 par l'association vBulletin francophone
PHWinfo est un site Éducation Sans Frontières
Ad Management by RedTyger
©Tous droits réservés par les parties respectives
Page generated in 0,19819 seconds with 16 queries