PHWinfo banniere

Titres
PORTAIL ANNUAIRE ARTICLES COMPARATEUR HÉBERGEURS DEVIS FORUMS RÉDUCTEUR D'URL
Précédent   PHWinfo > Forums Hébergement > Forum Hébergement serveur > comp.info.servers.unix > How to find out all the image links on a site?
S'inscrire FAQ Membres Recherche Messages du jour Marquer les forums comme lus
comp.info.servers.unix Web servers for UNIX platforms.

How to find out all the image links on a site?

Réponse
 
LinkBack Outils de la discussion
Vieux 27/07/2006, 16h50   #1
dubing@gmail.com
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut How to find out all the image links on a site?

Hello,

Our web site is hosted on Apache 2.0 on RHEL4. There are various
images directories that make the directory structure messy. So we are
think about merging the image directories. For doing that, I need to
get a good idea which image paths are referenced frequently by the web
pages. I'm aware of link checker tools. But I don't know if any of
them can just show me all the image links on a web site. Anybody have
ever done this and would like to share the experience?

Thanks in advance,

Bing

  Réponse avec citation
Vieux 27/07/2006, 17h20   #2
MikeDawg
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?


dubing@gmail.com wrote:
> Hello,
>
> Our web site is hosted on Apache 2.0 on RHEL4. There are various
> images directories that make the directory structure messy. So we are
> think about merging the image directories. For doing that, I need to
> get a good idea which image paths are referenced frequently by the web
> pages. I'm aware of link checker tools. But I don't know if any of
> them can just show me all the image links on a web site. Anybody have
> ever done this and would like to share the experience?
>
> Thanks in advance,
>
> Bing


Why don't you just use grep and go through the web directories you
have, and just search for everything that begins with "<img" ? If you
don't have direct access to those files via a shell or anything, you
can use wget and get a copy of the pure html and just search through it
with grep.

  Réponse avec citation
Vieux 27/07/2006, 17h20   #3
MikeDawg
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?


dubing@gmail.com wrote:
> Hello,
>
> Our web site is hosted on Apache 2.0 on RHEL4. There are various
> images directories that make the directory structure messy. So we are
> think about merging the image directories. For doing that, I need to
> get a good idea which image paths are referenced frequently by the web
> pages. I'm aware of link checker tools. But I don't know if any of
> them can just show me all the image links on a web site. Anybody have
> ever done this and would like to share the experience?
>
> Thanks in advance,
>
> Bing


Why don't you just use grep and go through the web directories you
have, and just search for everything that begins with "<img" ? If you
don't have direct access to those files via a shell or anything, you
can use wget and get a copy of the pure html and just search through it
with grep.

  Réponse avec citation
Vieux 27/07/2006, 18h49   #4
dubing@gmail.com
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?


MikeDawg wrote:
> dubing@gmail.com wrote:
> > Hello,
> >
> > Our web site is hosted on Apache 2.0 on RHEL4. There are various
> > images directories that make the directory structure messy. So we are
> > think about merging the image directories. For doing that, I need to
> > get a good idea which image paths are referenced frequently by the web
> > pages. I'm aware of link checker tools. But I don't know if any of
> > them can just show me all the image links on a web site. Anybody have
> > ever done this and would like to share the experience?
> >
> > Thanks in advance,
> >
> > Bing

>
> Why don't you just use grep and go through the web directories you
> have, and just search for everything that begins with "<img" ? If you
> don't have direct access to those files via a shell or anything, you
> can use wget and get a copy of the pure html and just search through it
> with grep.


Thanks for the reply.

We're using an open source content management system to manage our
websites. Only the templates and CSS are stored on the file system.
Actual page contents are all stored in MySQL. I have a shell access
and know how to use grep. But that won't work for me in this
situation. My knowledge about wget is limited. I've ever used it but
I don't know if it's possible to retrieve hundreds of pages once. I'll
check its manual. At the same time, I'm looking forward to other
ideas.

Thanks,

Bing

Bing

  Réponse avec citation
Vieux 27/07/2006, 18h49   #5
dubing@gmail.com
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?


MikeDawg wrote:
> dubing@gmail.com wrote:
> > Hello,
> >
> > Our web site is hosted on Apache 2.0 on RHEL4. There are various
> > images directories that make the directory structure messy. So we are
> > think about merging the image directories. For doing that, I need to
> > get a good idea which image paths are referenced frequently by the web
> > pages. I'm aware of link checker tools. But I don't know if any of
> > them can just show me all the image links on a web site. Anybody have
> > ever done this and would like to share the experience?
> >
> > Thanks in advance,
> >
> > Bing

>
> Why don't you just use grep and go through the web directories you
> have, and just search for everything that begins with "<img" ? If you
> don't have direct access to those files via a shell or anything, you
> can use wget and get a copy of the pure html and just search through it
> with grep.


Thanks for the reply.

We're using an open source content management system to manage our
websites. Only the templates and CSS are stored on the file system.
Actual page contents are all stored in MySQL. I have a shell access
and know how to use grep. But that won't work for me in this
situation. My knowledge about wget is limited. I've ever used it but
I don't know if it's possible to retrieve hundreds of pages once. I'll
check its manual. At the same time, I'm looking forward to other
ideas.

Thanks,

Bing

Bing

  Réponse avec citation
Vieux 27/07/2006, 23h56   #6
Max Dittrich
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?

Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com:

> Hello,
>
> Our web site is hosted on Apache 2.0 on RHEL4. There are various
> images directories that make the directory structure messy. So we are
> think about merging the image directories. For doing that, I need to
> get a good idea which image paths are referenced frequently by the web
> pages. I'm aware of link checker tools. But I don't know if any of
> them can just show me all the image links on a web site. Anybody have
> ever done this and would like to share the experience?


You can examine your logfiles as one source for this information. Extend
the format with the value of the response-header "Content-Type" and grep
for "image/".

..max
  Réponse avec citation
Vieux 27/07/2006, 23h56   #7
Max Dittrich
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?

Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com:

> Hello,
>
> Our web site is hosted on Apache 2.0 on RHEL4. There are various
> images directories that make the directory structure messy. So we are
> think about merging the image directories. For doing that, I need to
> get a good idea which image paths are referenced frequently by the web
> pages. I'm aware of link checker tools. But I don't know if any of
> them can just show me all the image links on a web site. Anybody have
> ever done this and would like to share the experience?


You can examine your logfiles as one source for this information. Extend
the format with the value of the response-header "Content-Type" and grep
for "image/".

..max
  Réponse avec citation
Vieux 28/07/2006, 14h17   #8
dubing@gmail.com
Aucun Avatar
 
Messages: n/a
Hébergeur:
Par défaut Re: How to find out all the image links on a site?


Max Dittrich wrote:
> Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com:
>
> > Hello,
> >
> > Our web site is hosted on Apache 2.0 on RHEL4. There are various
> > images directories that make the directory structure messy. So we are
> > think about merging the image directories. For doing that, I need to
> > get a good idea which image paths are referenced frequently by the web
> > pages. I'm aware of link checker tools. But I don't know if any of
> > them can just show me all the image links on a web site. Anybody have
> > ever done this and would like to share the experience?

>
> You can examine your logfiles as one source for this information. Extend
> the format with the value of the response-header "Content-Type" and grep
> for "image/".


Thanks, Max. That's an interesting idea. It would be great if all
the images get accessed and recorded in the log.

Bing

  Réponse avec citation
Réponse


Outils de la discussion

Règles de messages
Vous ne pouvez pas créer de nouvelles discussions
Vous ne pouvez pas envoyer des réponses
Vous ne pouvez pas envoyer des pièces jointes
Vous ne pouvez pas modifier vos messages

Les balises BB sont activées : oui
Les smileys sont activés : oui
La balise [IMG] est activée : oui
Le code HTML peut être employé : non
Trackbacks are oui
Pingbacks are oui
Refbacks are oui


Fuseau horaire GMT +1. Il est actuellement 12h29.


Édité par : vBulletin® version 3.7.3
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.2.0 RC5 Tous droits réservés.
Version française #16 par l'association vBulletin francophone
PHWinfo est un site Éducation Sans Frontières ©2000-2008
Ad Management by RedTyger
©Tous droits réservés par les parties respectives
Page generated in 0,19653 seconds with 16 queries