Need an album, not the whole gallery!

BRosewood

Joined: 2002-10-02
Posts: 16
Posted: Sat, 2003-06-21 22:30

So, I have the new version of gallery (1.3.4) and I am following the instructions http://gallery.menalto.com/modules.php?op=modload&name=phpWiki&file=index&pagename=Creating%20an%20Offline%20Copy there.

I hit up the command "wget -r -k --html-extension 'http://rosewood.dynu.com/gallery/marthawed?set_offline=true' "

Well, this does things kinda weird. I find the .html file for the album I need but some stuff just doesnt work. Its like things arent pointing to the right locations. None of the full images seem to work until I go find them myself.

I also compiled httrack. I blocked the listing for the albums directory. However, when I do the command " httrack -%F "" -K0 "http://rosewood.dynu.com/gallery/login.php?uname=admin&gallerypassword=hidden \ &submit=Login&set_offline=true" http://rosewood.dynu.com/gallery/marthawed http://rosewood.dynu.com/albums/ " all on one line, it decides to grab my whole gallery :(

Tips please?

-- EDIT --
Heh. Figured it out with wget. I am able to grab everything. However, windows really does not like the filenames with the ? in them I take it. Either in Firebird or IE, when I try to pull up the full version of a pic, it freaks out :(

 
ezra

Joined: 2003-06-24
Posts: 7
Posted: Tue, 2003-06-24 06:36

Me, too.
I do
wget -r -k --html-extension 'http://www.jezra.org/gallery?set_offline=true'
and get very few files, 3 directories and 3 files to be precise. I was really hoping to get this feature to work.

 
ezra

Joined: 2003-06-24
Posts: 7
Posted: Tue, 2003-06-24 06:46

By the way, I do want the whole gallery.

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Tue, 2003-06-24 07:21
ezra wrote:
Me, too.
I do
wget -r -k --html-extension 'http://www.jezra.org/gallery?set_offline=true'
and get very few files, 3 directories and 3 files to be precise. I was really hoping to get this feature to work.

wget -r -k --html-extension 'http://jezra.org/gallery/?set_offline=true'

[thanks to Jesse Mullan]

 
ezra

Joined: 2003-06-24
Posts: 7
Posted: Tue, 2003-06-24 07:35

Joan,
So I am staring and staring at what you posted, and the only change I see is that you changed the 'www.jezra.org' portion to 'jezra.org' and I'm like, "Nah, that can't be it," and I try it and it fails, and I am staring and staring at what you wrote until I finally see the backslash, and well, needless to say, it's working, and I only posted that question a couple of minutes ago, you answered so quickly (is it the middle of the night for you too?). I wish you have a question that I see that I know the answer to some day!
Thanks,
(I never woulda seen it)

the gallery-people rock! :D

 
jmullan
jmullan's picture

Joined: 2002-07-28
Posts: 974
Posted: Tue, 2003-06-24 08:01

Actually, Joan enlisted the aid of a whole slew of gallery people. I accidentally downloaded your whole gallery twice without ever really looking at it. Good luck and enjoy! ^_^

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Tue, 2003-06-24 08:03
BRosewood wrote:
Heh. Figured it out with wget. I am able to grab everything. However, windows really does not like the filenames with the ? in them I take it. Either in Firebird or IE, when I try to pull up the full version of a pic, it freaks out :(

have you tried httrack again? that doesn't get the funny characters

httrack -%F "" -K0 'http://rosewood.dynu.com/gallery/fishy?set_offline=true' http://rosewood.dynu.com/albums/

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Tue, 2003-06-24 08:06
jmullan wrote:
Actually, Joan enlisted the aid of a whole slew of gallery people. I accidentally downloaded your whole gallery twice without ever really looking at it. Good luck and enjoy! ^_^

well technically Beckett started the whole thing. But your gallery was the center of the IRC channel for a while, and Jesse came up with the solution.

[www.jezra.org vs jezra.org is a very important division. wget will only download things from the same domain as the original URL, otherwise if you ran wget on a page with a link on it, you could end up downloading the entire internet]

 
jdr0606

Joined: 2003-01-06
Posts: 63
Posted: Wed, 2003-06-25 01:54

BTW!

What version of WGET is everyone using?

Thanks

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Wed, 2003-06-25 06:48

1.8.2

 
BRosewood

Joined: 2002-10-02
Posts: 16
Posted: Fri, 2003-07-04 18:48
joan wrote:
BRosewood wrote:
Heh. Figured it out with wget. I am able to grab everything. However, windows really does not like the filenames with the ? in them I take it. Either in Firebird or IE, when I try to pull up the full version of a pic, it freaks out :(

have you tried httrack again? that doesn't get the funny characters

httrack -%F "" -K0 'http://rosewood.dynu.com/gallery/fishy?set_offline=true' http://rosewood.dynu.com/albums/

I tried that again and once again, it grabbed EVERYTHING in the albums directory. So, then I tried making that last part /albums/fishy/ and (well its running now) it has not grabbed other images from the albums but its still grabbing html files from all my other albums. Not quite sure why it keeps doing that.

Also, even with httrack, it is still pulling html files like: marthawed?sef_offline=true.html

Any ? in files in windows is going to not work right.

As for what version of wget I am using when I use wget? 1.8.2

 
hybernaut

Joined: 2003-06-22
Posts: 1
Posted: Wed, 2003-07-30 03:56

It's possible to restrict wget to certain directories on the (ever-growing) command line.

I've had success with the following, to archive gallery 'album25' to a directory, from which
I can burn a cd-r.

wget -r -k --html-extension -nH --cut-dirs=1 \
'http://www.myserver.com/user/gallery/album25?offline=true' \
-I /user/gallery/images,/user/gallery/album25,/user/albums/album25

So for each album you want to mirror, you need to restrict wget to gallery/images,
gallery/album, and albums/album

or your config.php is significantly different from mine, that would be:

$photoAlbumURL/images, $photoAlbumURL/$album, and $albumDirURL/$album

A big shout out to the Gallery team--Gallery is a beautiful thing.

 
brucew

Joined: 2003-08-16
Posts: 1
Posted: Sat, 2003-08-16 19:23

After hitting the same problem, I couldn't see how httrack was getting to the other albums when there weren't any visible links (I had disabled the return link).

I figured out that the <link tags in the head section with rel="up", "top", etc. are the culprit. It seems httrack is following those even though they aren't visible links.

I couldn't figure out a way to get httrack to not traverse them so I commented out the link tags in the head section (except the stylesheets) in view_album.php and view_photo.php.

I'm sure there's a better solution and I'm not sure what I'm losing by not having those in there but it works for the offline browsing.

ciao,
bruce

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Sat, 2003-08-16 21:51

You DA MAN! This has been bothering me for months, and this is the solution (I hope). Thanks! Now, I've got 10 minutes before the code freeze I announced to get it into 1.4.

wish me luck!

 
joan
joan's picture

Joined: 2002-10-21
Posts: 3473
Posted: Sat, 2003-08-16 22:50

Made it in just a little extra time. The fix is in 1.4-cvs-b132, or you can wait for RC1.

[I'd got the one in view_album.php, it was the "top" link in view_photo.php that had eluded me]

 
nukestud
nukestud's picture

Joined: 2003-03-25
Posts: 114
Posted: Sat, 2003-08-30 22:26
hybernaut wrote:
It's possible to restrict wget to certain directories on the (ever-growing) command line.

I've had success with the following, to archive gallery 'album25' to a directory, from which
I can burn a cd-r.

wget -r -k --html-extension -nH --cut-dirs=1 \
'http://www.myserver.com/user/gallery/album25?offline=true' \
-I /user/gallery/images,/user/gallery/album25,/user/albums/album25

So for each album you want to mirror, you need to restrict wget to gallery/images,
gallery/album, and albums/album

or your config.php is significantly different from mine, that would be:

$photoAlbumURL/images, $photoAlbumURL/$album, and $albumDirURL/$album

A big shout out to the Gallery team--Gallery is a beautiful thing.

I would LOVE something similar this feature to be included in a link under each album for regular users to download a zip for the album. Does anyone have any ideas how I could accomplish this?

Have a good one!

 
JoergSchulz

Joined: 2003-05-21
Posts: 31
Posted: Sun, 2004-10-31 18:13

Solved: Reason: rss.php

Howto:
either -> include rss.php in your robots.txt
or -> add "-*rss.php" to the end of your command line.