[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: reverse mirroring



* Hagen Fritsch <hagen@itooktheredpill.dyndns.org> [2004-08-29 10:00]:
> Hi,
> such a feature is not planned, as there is nearly noone who would
> use it.  Therefore c-coding it into wput would just waste time. For
> such purposes there exist scripting languages that come with every
> unixoid system.  For example you could write a simple perl-script
> that collects all the referenced files and then pipe this list to
> wput so that it uploads these files. It is more easy to write a
> perl-script for the filtering since perl has the
> regexp-capabilities. In C you would have to spend hours just for
> this little parsing task.

Okay, thanks for the feedback.  I brought this up on the wget list,
and folks suggested taking the wget components that already do this,
and integrating them into wput.  Assuming wget is a C project, it may
be the easiest way for me (since I don't know perl).  If I find myself
uploading web pages very often, I might consider doing this.

> *argh* these ads are quite disturbing. maybe i create a own
> mailman-list or st. similar.

Yes they are irritating.  I have some procmail recipes that eliminate
yahoo groups ads:

# Yahoo groups mailing lists
#
:0
*$ ^Mailing-List:.*$YAHOO_GROUPS@yahoogroups\.com
* ^Mailing-List:.*\/[a-z0-9._-]+@
* MATCH ?? ()\/[^@]+
{
  LISTNAME=`echo $MATCH | tr 'A-Z' 'a-z'`

  :0 fhw
  | sed -e "/^Subject:/ s/ *\[$LISTNAME\] */ /I" \
        -e '/^Subject:/ s/ *\[MacPP\] */ /' \
        -e '/^Subject:/ s/ *\[EL-M\] */ /' \
        -e '/^Subject:/ s/ *\[PaperPort Preservation Society\] */ /' \
        -e '/^Reply-To:.*/d'

#  | formail -I "Reply-To: "

  :0 fbw
  | sed -e "/^${QUOTE}-* *Yahoo! Groups Sponsor/,/^${QUOTE}-\{5\}~?-*>/d" \
        -e "/^${QUOTE}Yahoo! Groups Links/,/http:\/\/docs.yahoo.com\/info\//d" \
        -e "/^${QUOTE}Yahoo! Groups Links/,/Groups is subject to the/d" \
        -e "/^${QUOTE}Your use of Yahoo! Groups is subject to/,/http:\/\/docs.yahoo.com\/info\//d" \
        -e "/^${QUOTE}-*~-->$DOLLAR/,/^${QUOTE}-*~->$DOLLAR/d"

  :0 fbw
  * ^Mailing-List:.*paperport@yahoogroups\.com
  | sed -e "/^${QUOTE}PDF Flavors.*single most confusing/,/^${QUOTE}PDF Formatted Text/d"\
        -e "/^${QUOTE}To reply: mailto:PaperportWin/,/~PaperportWin$DOLLAR/d"\
        -e "/^${QUOTE}These User Communities are not monitored by ScanSoft/,/omnipage/d"\
        -e "/^${QUOTE}Unsubscribe, or change your delivery options here/,/omnipage/d"\
        -e "/^${QUOTE}Related websites:/,/omnipage/d"\
        -e "/^${QUOTE}Yahoo! Groups SponsorADVERTISEMENT/d"\
        -e '/<!--.*begin egp html banner/,/end egp html banner.* -->$/d'\
        -e '/websites:<BR><A href=/,/<tt>/d'\
        -e "/^${QUOTE}Official Forum Semaphores$DOLLAR/,/\/message\/28092$DOLLAR/d"\
        -e "/^${QUOTE}Official Forum Semaphores$DOLLAR/,/\/group\/omnipage/d"

  :0 fbw
  * ^Mailing-List:.*paperport-tips@yahoogroups\.com
  | sed "/^${QUOTE}Unsubscribe, or change your delivery options here:$DOLLAR/,/group\/paperport-feature-requests$DOLLAR/d"

  :0 fbw
  * ^Mailing-List:.*vpfree@yahoogroups\.com
  | sed "/^${QUOTE}vpFREE Links: http:\/\/members.cox.net\/vpfree\/Links.htm/d"

  :0
  * ^Mailing-List:.*paperport@yahoogroups\.com
  * ^From:.*(listmanager|PaperportWin)[.0-9]*@webboard\.scansoft\.com
  { 
#    TRUEFROM=`grep 'From: \".*\"' | sed 's/.*<\([^>]*\)>/\1/g' | tail -1`
    TRUEFROM = `grep '^From: \".*\"\ <.*@.*>$' | sed 's/From: \(.*\)/\1/g' | tail -1`
    LOG      = "truefrom is set to $TRUEFROM"

    :0 fhw
    | formail -i "From: $TRUEFROM"\
              -i "Mail-Followup-To: paperport@yahoogroups.com, $TRUEFROM"\
              -i "To: paperport@yahoogroups.com"
  }

  :0 fhw
  *$ ^${FROM_}.*stevesnitzer@msn[.]com
  *$ ! RECEIVED_YEAR ?? $STATED_YEAR
  | formail -i "Date: $PROCESSED_DATE "

  :0 :
  * ^Mailing-List:.*paperport@yahoogroups\.com
  * ^Subject:.*\<OT\>
  mailing_lists/${LISTNAME}_${YEAR:-0000}ot

  :0 E:
  mailing_lists/${LISTNAME}_${YEAR:-0000}
}