[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: reverse mirroring
* Hagen Fritsch <hagen@itooktheredpill.dyndns.org> [2004-08-29 10:00]:
> Hi,
> such a feature is not planned, as there is nearly noone who would
> use it. Therefore c-coding it into wput would just waste time. For
> such purposes there exist scripting languages that come with every
> unixoid system. For example you could write a simple perl-script
> that collects all the referenced files and then pipe this list to
> wput so that it uploads these files. It is more easy to write a
> perl-script for the filtering since perl has the
> regexp-capabilities. In C you would have to spend hours just for
> this little parsing task.
Okay, thanks for the feedback. I brought this up on the wget list,
and folks suggested taking the wget components that already do this,
and integrating them into wput. Assuming wget is a C project, it may
be the easiest way for me (since I don't know perl). If I find myself
uploading web pages very often, I might consider doing this.
> *argh* these ads are quite disturbing. maybe i create a own
> mailman-list or st. similar.
Yes they are irritating. I have some procmail recipes that eliminate
yahoo groups ads:
# Yahoo groups mailing lists
#
:0
*$ ^Mailing-List:.*$YAHOO_GROUPS@yahoogroups\.com
* ^Mailing-List:.*\/[a-z0-9._-]+@
* MATCH ?? ()\/[^@]+
{
LISTNAME=`echo $MATCH | tr 'A-Z' 'a-z'`
:0 fhw
| sed -e "/^Subject:/ s/ *\[$LISTNAME\] */ /I" \
-e '/^Subject:/ s/ *\[MacPP\] */ /' \
-e '/^Subject:/ s/ *\[EL-M\] */ /' \
-e '/^Subject:/ s/ *\[PaperPort Preservation Society\] */ /' \
-e '/^Reply-To:.*/d'
# | formail -I "Reply-To: "
:0 fbw
| sed -e "/^${QUOTE}-* *Yahoo! Groups Sponsor/,/^${QUOTE}-\{5\}~?-*>/d" \
-e "/^${QUOTE}Yahoo! Groups Links/,/http:\/\/docs.yahoo.com\/info\//d" \
-e "/^${QUOTE}Yahoo! Groups Links/,/Groups is subject to the/d" \
-e "/^${QUOTE}Your use of Yahoo! Groups is subject to/,/http:\/\/docs.yahoo.com\/info\//d" \
-e "/^${QUOTE}-*~-->$DOLLAR/,/^${QUOTE}-*~->$DOLLAR/d"
:0 fbw
* ^Mailing-List:.*paperport@yahoogroups\.com
| sed -e "/^${QUOTE}PDF Flavors.*single most confusing/,/^${QUOTE}PDF Formatted Text/d"\
-e "/^${QUOTE}To reply: mailto:PaperportWin/,/~PaperportWin$DOLLAR/d"\
-e "/^${QUOTE}These User Communities are not monitored by ScanSoft/,/omnipage/d"\
-e "/^${QUOTE}Unsubscribe, or change your delivery options here/,/omnipage/d"\
-e "/^${QUOTE}Related websites:/,/omnipage/d"\
-e "/^${QUOTE}Yahoo! Groups SponsorADVERTISEMENT/d"\
-e '/<!--.*begin egp html banner/,/end egp html banner.* -->$/d'\
-e '/websites:<BR><A href=/,/<tt>/d'\
-e "/^${QUOTE}Official Forum Semaphores$DOLLAR/,/\/message\/28092$DOLLAR/d"\
-e "/^${QUOTE}Official Forum Semaphores$DOLLAR/,/\/group\/omnipage/d"
:0 fbw
* ^Mailing-List:.*paperport-tips@yahoogroups\.com
| sed "/^${QUOTE}Unsubscribe, or change your delivery options here:$DOLLAR/,/group\/paperport-feature-requests$DOLLAR/d"
:0 fbw
* ^Mailing-List:.*vpfree@yahoogroups\.com
| sed "/^${QUOTE}vpFREE Links: http:\/\/members.cox.net\/vpfree\/Links.htm/d"
:0
* ^Mailing-List:.*paperport@yahoogroups\.com
* ^From:.*(listmanager|PaperportWin)[.0-9]*@webboard\.scansoft\.com
{
# TRUEFROM=`grep 'From: \".*\"' | sed 's/.*<\([^>]*\)>/\1/g' | tail -1`
TRUEFROM = `grep '^From: \".*\"\ <.*@.*>$' | sed 's/From: \(.*\)/\1/g' | tail -1`
LOG = "truefrom is set to $TRUEFROM"
:0 fhw
| formail -i "From: $TRUEFROM"\
-i "Mail-Followup-To: paperport@yahoogroups.com, $TRUEFROM"\
-i "To: paperport@yahoogroups.com"
}
:0 fhw
*$ ^${FROM_}.*stevesnitzer@msn[.]com
*$ ! RECEIVED_YEAR ?? $STATED_YEAR
| formail -i "Date: $PROCESSED_DATE "
:0 :
* ^Mailing-List:.*paperport@yahoogroups\.com
* ^Subject:.*\<OT\>
mailing_lists/${LISTNAME}_${YEAR:-0000}ot
:0 E:
mailing_lists/${LISTNAME}_${YEAR:-0000}
}