[hashcash] Re: PR Problem?

From: "Eric S. Johansson" <esj@xxxxxxxxxx>
To: hashcash@xxxxxxxxxxxxx
Date: Tue, 12 Dec 2006 01:44:34 -0500

Mario 'BitKoenig' Holbe wrote:

Eric S. Johansson <esj@xxxxxxxxxx> wrote:

DeLesley SpamBox wrote:
I'm not convinced that even a naive sender pays wouldn't be helpful.
make assumptions about the number of zombies, how much leakage you willpermit and you can get the stamp size. It's quite entertaining.


I don't think this is a good argument. Even now/today all these zombies
could be used to generate spam mails directly. So the question should
rather be: how far would the amout of spam messages decrease when all
these zombies would additionally need to pay CPU for sender stamps.

stamps work in three ways in this context. As the load on the spammer,as an indicator of the quality of the e-mail source, and as an introducer.

The load on the spammer is self-evident. The more work they have to doto generate each piece of spam, the lower the profit margin by virtue ofthe lowered visibility of their traffic. The quality of the e-mailsource is only apparent if you have a feedback mechanism between senderand receiver of e-mail. If I get spam from a variety of addresses, thecost of getting e-mail from that address goes up. I can also increasecosts by specifying always stamp versus first time stamp. But like Isaid, this only works if the stamper queries the recipientdomains/domains to find out how much postage should be. See previousconversation about DNS/HTTP-based mechanisms for communicating postagecosts. The introducer model says that a stamp introduces you tosomebody else as a way of bypassing filters. This allows you to biasyour spam filter scores so that those with stamps get through inpreference to those without. This may or may not be a good thing but Ihave been known to be a little biased at times.

you are falling into the classic trap of assuming that the cost ofhardware mean something. This is the fallacy behind the Ben Lauriepaper. It's important to remember that the cost per stamp drops withevery stamp generated with a given piece of hardware. The first step is
I didn't read the Ben Laurie paper, if I should do, because it proves my
below aguments wrong, please tell me :)
Of course, sender stamps can only reduce the total amount of spam by a
linear factor. A big linear factor probably, but linear.
However, the nice thing about sender stamps is that this linear factor
is very easy adjustible to the average current hardware out there. And
this is why hardware costs begin to mean something.

stamp "size" shifts with time as hardware performance improves. But ifyou lay out $1000 for a fast machine, every stamp you calculate dropsthe cost per stamp. Ranging from a very expensive first stamp to thenumber stamps per day * 365 Which means stamps are frighteningly cheapif you use that metric. Even with power and cooling averaged in, it'sstill amazingly cheap. As a friend said, how expensive can it be to runa dozen PCs in a back room in Jakarta with the fan blowing in "cooler" air.

By just calculating the "average price" (quantiled average over the size
of stamps - quantiled to prevent DoS) of all emails you get, your MUA
can easily find out how much it *needs* to pay for the stamp to get a
good probability for the delivery of your mails. Of course, it can
always calculate bigger stamps, if it or his user likes. By using a
min() function over the above average and what the MUA is able to
calculate within a user-defined time, the above average slides over
time and thus adjusts to the average hardware out there and to what
users are willing to pay.

On the other side, MTAs, i.e. spam filters, can adjust their price-
acceptance function for sender stamps that simple as well.

this is why the feedback mechanism proposed is a good idea. You don'tneed to guess the cost of a stamp you just do it. In fact, thetechnique described as the vantage that it gradually renders all of thespam generating zombie addresses effectively useless. This is not tosay they can't deliver spam, it's just that it costs a whole lot to do so.

Of course the average price is hard for PDAs, slow machines etc.
However, at any time MTAs can calculate sender stamps theirselves on
behalf of the sender (as sendmail-hashcash shows). So MTAs could
easily generate sender stamps for authenticated well-known clients.
Of course, the best solution would be some incremental algorithm, where
you can subsequently increase the stamp size just by investing a bit
more CPU time.

if you use the appropriate variable postage mechanism, once youestablish communications with someone, postage ceases to be an issue.This becomes a freebie given to you by the service provider becauseyou're paying them $30, $40, $75, or more a month. they can afford tospend some of the money you leave on the table for the five or 10 stampsa month you'll need. get you also have the option of saying "I'll takemy chances" and not send a stamp.

stay in business. The number of zombies will decrease and be moreeasily targeted.
Well, then users need to be willing to pay more for their own stamps.

if you pay attention to basic human factors, sure, they will. Generateand background, the user doesn't see anything and if you use variablepostage mechanisms, the number stamps per user per day will be triviallysmall.

this is probably a philosophical disagreement. I absolutely of abhorfalse positives. I look in the dumpster maybe once every couple ofmonths if somebody tells me something was lost. I look in my spam trapabout once a week. If somebody is going to send me a message with astamp, I have no problem with it coming through directly. If it's aspammer, I want to be able to mark it as spam and then permanentlyblacklisted IP address and tell all of my friends about it automatically.
Well, I personally think this is a bit a blue-eyed point-of-view. If you
think this is really feasible, just think about why you don't just do
the same today without stamps.
The more stamps become widely accepted, the more spammers will use them
as well. And... wasn't this the idea anyways? Spammers should be forced
to use them to increase the cost for spam :)

well, it's how I live and without stamps. It's been rather successful Imight add. Anyone who uses twopenny blue also basically lives the sameway. if I didn't have this ability to not look at my spam trap, Iprobably would have ditched e-mail long time ago for something moreuseful like the telephone.

but the addition of stamps as filter bypass improves the quality ofsystem behavior because it now becomes predictable. It used to be thee-mail was predictable, for the most part, in that it either wasdelivered or it wasn't. It was so reliable that people didn't careabout the unreliability warnings, it just worked. But now with spam andthe probabilistic content filters, e-mail has become unreliable becauseit's unpredictable. You have no idea if a customer's spreadsheet withHTML framework talking about the shipment of strawberries out ofCalifornia is going to pass your content filter or not. This isunacceptable for businesses.

Lest you think I'm making this scenario up, this is one of many I'velived with one of my customers, a fruit and vegetable wholesaler. Theirsalespeople send invoices, quotes etc. and receive the same by e-mail.If e-mail is down, you can hear the thousand dollar counter clickingrather rapidly in the background. An e-mail lost in a spam trap for aday can literally cost them tens of thousands of dollars. And this is asmall operation. If you want e-mail to become reliable again, you needa predictable and determine a deterministic event which says "this stampwill get through". If there is some way to combine the two models, I'mopen. Let's see what can fly for those who don't really care if amessage arrives versus those that do.

to use a stamp or even a stamp size as a scoring factor actually worksin the spammers favor. By crafting a message the right way and justputting a little stamp, maybe 10 seconds worth, they would be able toalmost guaranteed delivery. While at the same time, you would still end
Hehe, so there are methods out there to reduce the amount of work that
is needed to calculate a stamp? :)
If not: the automatic adaption of MUAs and MTAs to the stamp size works
against spammers using too small stamps.

I was concerned that stamps as a modifier for content filter scorescould give spammers a leg up at making their messages more visible forvery little work. It's very simple to analyze the reduction of spam onthe net using a stamp as filter bypass. You can only know what effect asmall stamp would have in conjunction with a filter if you looked at thescores of a test case and then offset some number of them with stamps.If I had the time to start analyzing, I would probably start with aneven distribution of scores.

this is another reason for direct delivery on stamps. Your stamp is anintroducer. It guarantees delivery to the inbox. This is a win. Thismean customers don't have to worry about their mail getting through.
This is also a good reason for adaptive stamp sizes: You yourself can
increase the chance for your mail to get delivered by just paying more.
So on the one hand companies could accept small stamps in mails to their
support-addresses to increase their chance that they miss no customer
mail and on the other hand they could just pay enough for their own
mailings to make sure they get read.

come up with a model, I would like to see it. Personally I think areal-time dynamic pricing structure is far more appropriate because it can:


  o reduce stamp load on legitimate senders
  o increase costs on commercial and spam mail
  o make systems more resilient with regard to Moore's Law inflation

o makes systems more resilient in the face of concentratedcomputation attacks (i.e. lots of zombies generating stamps aimed at avery small number of machines)

remember, transition costs are really expensive. We want to do it good
Using stamps as just another spam +/- indicator plus it being adaptive
is IMHO a really simple transistion strategy.

read the archive. There's lots of geek psychological resistance tousing stamps. I'm not going to go into it again.


---eric

PS, here are a few thoughts about the psychology of spam filters andtheir owners. I need to repost it somewhere and I'm not sure wherequite yet.


--- Spam filters are like dogs. ---

Spam filters are very much like dogs. The similarity is apparent toanyone with experience with both. They both need training, they bothneed daily care, and they both require dedicated owners.

But the similarity goes beyond this. They are both used in competitiveevents where they are judged on how high a score they can get. Spamfilters are rated on the percentage accuracy. One filter that I knowquite well, CRM114 boasts a highly impressive 99.99% classificationaccuracy rate. For dogs there are competitive obedience trials wherethey are stored on how accurately the performed exercise. There aredogs I know scoring 198 out of 200 points at top-level difficulty dogobedience trials.

My dog on the other hand knows basic obedience, is reasonably wellmannered yet barks her fool head off any time anyone makes a noiseoutside the house. My spam filter accuracy runs around 90%. FortunatelyI have the rest of the [link [url http://www.camram.org] camram system]to make up the difference and make spam a non-issue.

But back to the comparison. What's the difference between my dog andthe high-scoring obedience dogs? Breed, temperament, and mostimportantly, owner dedication. I'm willing to spend, in the beginning,the dedicated 15 week 5hrs/week effort it takes to train my dog to obeyme. In that same 15 weeks, the dog teaches me to hear a little bitabout how it works. Then, I spend the rest of its life communicating inthe way that it wants to hear and reinforcing good behavior wheneverpossible. As a result, I have a reasonably well-behaved dog that is notthe win any prizes at an obedience trial.

In contrast, the high-scoring obedience folks work with their dogs fouror five hours every single day, really intense training of boththemselves and the dog so that they can get those high scores. They getinside the dog's head and understand how it learns, how it will besthear the trainer. This training process is the owner's life. they livewith and for the dog.

How does this relate to spam filters? High scoring spam filtration onlyhappens if you dedicate your life to the spam filter, work with it everysingle day, and learn how to train it in the way it wants you to. Aspam filter is not something you can train intensively in the beginningand then just kind of leave alone. It needs constant attention in orderto keep it working right.

There's one more set of comparisons about spam filters and dogs. Dogsand spam filters both have accidents and leave something unpleasantwhere you need to deal with it. A major difference is that dogs cantrain you to recognize their signals and need to go outside so that theywon't have accidents. Spam filters keep giving you little presents inyour mailbox every so often. Dogs and spam filters also chew on thingsyou don't want them to chew on. With dogs, you usually know whenthey've chewed on something. Not so with spam filters.

When it comes right down to it, most people find this concept of livingfor a piece of software repugnant. They want to come into work, get thejob done and that does not involve satisfying the attentional demands ofspam filters. Most people would also agree that any system which losesinformation silently or forces you to go through all the spam anyway tofind what was lost is flawed.

given the choice, I, like most people, prefer to live with a dog becauseyou get something worthwhile back from that relationship.

References:
- [hashcash] Re: PR Problem?
  - From: Mario 'BitKoenig' Holbe

[hashcash] Re: PR Problem?

Other related posts: