|
|
|
|
||||||
| comp.info.servers.unix Web servers for UNIX platforms. |
![]() |
|
|
LinkBack | Outils de la discussion |
|
|
#1 |
|
Messages: n/a
Hébergeur: |
Hello,
Our web site is hosted on Apache 2.0 on RHEL4. There are various images directories that make the directory structure messy. So we are think about merging the image directories. For doing that, I need to get a good idea which image paths are referenced frequently by the web pages. I'm aware of link checker tools. But I don't know if any of them can just show me all the image links on a web site. Anybody have ever done this and would like to share the experience? Thanks in advance, Bing |
|
|
|
#2 |
|
Messages: n/a
Hébergeur: |
dubing@gmail.com wrote: > Hello, > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > images directories that make the directory structure messy. So we are > think about merging the image directories. For doing that, I need to > get a good idea which image paths are referenced frequently by the web > pages. I'm aware of link checker tools. But I don't know if any of > them can just show me all the image links on a web site. Anybody have > ever done this and would like to share the experience? > > Thanks in advance, > > Bing Why don't you just use grep and go through the web directories you have, and just search for everything that begins with "<img" ? If you don't have direct access to those files via a shell or anything, you can use wget and get a copy of the pure html and just search through it with grep. |
|
|
|
#3 |
|
Messages: n/a
Hébergeur: |
dubing@gmail.com wrote: > Hello, > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > images directories that make the directory structure messy. So we are > think about merging the image directories. For doing that, I need to > get a good idea which image paths are referenced frequently by the web > pages. I'm aware of link checker tools. But I don't know if any of > them can just show me all the image links on a web site. Anybody have > ever done this and would like to share the experience? > > Thanks in advance, > > Bing Why don't you just use grep and go through the web directories you have, and just search for everything that begins with "<img" ? If you don't have direct access to those files via a shell or anything, you can use wget and get a copy of the pure html and just search through it with grep. |
|
|
|
#4 |
|
Messages: n/a
Hébergeur: |
MikeDawg wrote: > dubing@gmail.com wrote: > > Hello, > > > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > > images directories that make the directory structure messy. So we are > > think about merging the image directories. For doing that, I need to > > get a good idea which image paths are referenced frequently by the web > > pages. I'm aware of link checker tools. But I don't know if any of > > them can just show me all the image links on a web site. Anybody have > > ever done this and would like to share the experience? > > > > Thanks in advance, > > > > Bing > > Why don't you just use grep and go through the web directories you > have, and just search for everything that begins with "<img" ? If you > don't have direct access to those files via a shell or anything, you > can use wget and get a copy of the pure html and just search through it > with grep. Thanks for the reply. We're using an open source content management system to manage our websites. Only the templates and CSS are stored on the file system. Actual page contents are all stored in MySQL. I have a shell access and know how to use grep. But that won't work for me in this situation. My knowledge about wget is limited. I've ever used it but I don't know if it's possible to retrieve hundreds of pages once. I'll check its manual. At the same time, I'm looking forward to other ideas. Thanks, Bing Bing |
|
|
|
#5 |
|
Messages: n/a
Hébergeur: |
MikeDawg wrote: > dubing@gmail.com wrote: > > Hello, > > > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > > images directories that make the directory structure messy. So we are > > think about merging the image directories. For doing that, I need to > > get a good idea which image paths are referenced frequently by the web > > pages. I'm aware of link checker tools. But I don't know if any of > > them can just show me all the image links on a web site. Anybody have > > ever done this and would like to share the experience? > > > > Thanks in advance, > > > > Bing > > Why don't you just use grep and go through the web directories you > have, and just search for everything that begins with "<img" ? If you > don't have direct access to those files via a shell or anything, you > can use wget and get a copy of the pure html and just search through it > with grep. Thanks for the reply. We're using an open source content management system to manage our websites. Only the templates and CSS are stored on the file system. Actual page contents are all stored in MySQL. I have a shell access and know how to use grep. But that won't work for me in this situation. My knowledge about wget is limited. I've ever used it but I don't know if it's possible to retrieve hundreds of pages once. I'll check its manual. At the same time, I'm looking forward to other ideas. Thanks, Bing Bing |
|
|
|
#6 |
|
Messages: n/a
Hébergeur: |
Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com:
> Hello, > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > images directories that make the directory structure messy. So we are > think about merging the image directories. For doing that, I need to > get a good idea which image paths are referenced frequently by the web > pages. I'm aware of link checker tools. But I don't know if any of > them can just show me all the image links on a web site. Anybody have > ever done this and would like to share the experience? You can examine your logfiles as one source for this information. Extend the format with the value of the response-header "Content-Type" and grep for "image/". ..max |
|
|
|
#7 |
|
Messages: n/a
Hébergeur: |
Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com:
> Hello, > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > images directories that make the directory structure messy. So we are > think about merging the image directories. For doing that, I need to > get a good idea which image paths are referenced frequently by the web > pages. I'm aware of link checker tools. But I don't know if any of > them can just show me all the image links on a web site. Anybody have > ever done this and would like to share the experience? You can examine your logfiles as one source for this information. Extend the format with the value of the response-header "Content-Type" and grep for "image/". ..max |
|
|
|
#8 |
|
Messages: n/a
Hébergeur: |
Max Dittrich wrote: > Am 27 Jul 2006 08:50:33 -0700 schrieb dubing@gmail.com: > > > Hello, > > > > Our web site is hosted on Apache 2.0 on RHEL4. There are various > > images directories that make the directory structure messy. So we are > > think about merging the image directories. For doing that, I need to > > get a good idea which image paths are referenced frequently by the web > > pages. I'm aware of link checker tools. But I don't know if any of > > them can just show me all the image links on a web site. Anybody have > > ever done this and would like to share the experience? > > You can examine your logfiles as one source for this information. Extend > the format with the value of the response-header "Content-Type" and grep > for "image/". Thanks, Max. That's an interesting idea. It would be great if all the images get accessed and recorded in the log. Bing |
|
![]() |
| Outils de la discussion | |
|
|