- body
- the part of a message that comes after the
headers. Separated from the headers by a blank line.
- Bubblegum proxypot
- the name of a specific program written by me, Alan Curry, the owner of
proxypot.org. It is an example of a proxypot.
- cygwin
- a package that is capable of providing a POSIX-like environment on an
unmentionable operating system. It enables the running of
Bubblegum proxypot.
- DCC
- Distributed Checksum Clearinghouse. A system that tries to identify bulk
mail, so that people can reject bulk mail from unknown sources.
Bubblegum proxypot can submit messages to a
DCC server, which provides DCC users with more raw information to help them
decide what messages should be rejected.
- envelope
- the sender and recipient addresses in an SMTP
session. These are separate from the addresses in the
message headers and bodyforged. The envelope
recipient(s) may or may not match the contents of the To: and Cc: headers. In
a normal SMTP server, the envelope recipients are the addresses where the
message will actually be delivered. In a
Bubblegum proxypot mbox
or maildir, the envelope sender is recorded in the
Return-Path header and the envelope recipients are recorded in Delivered-To
headers. They will be above the proxypot
Received header, which distinguishes them from
the spammer's forged headers below it.
- forgery
- false addresses that are usually placed in the
headers of spam. Sometimes forged addresses are random
garbage followed by a well-known domain name. Other times the forged
addresses are real and belong to the spammer's enemy. Almost everything you
see in the headers of a spam message is forged. The only headers that can be
trusted are the Received headers, and only the
ones that were added after the spammer sent the message. The most
trustworthy Received header is the one closest to the top. See also
spoof.
- headers
- the top part of a message, containing information about the sender of the
message, the recipient(s), the subject, and the method by which it was
transmitted. In a spam message, almost all the headers are
forged. Most mailreaders do not display all the
headers of a message unless specifically told to do so.
- honeypot
- a system designed to appear vulnerable to attack. When an attacker
accesses a honeypot, the honeypot tries to create the illusion that the
attacker is successfully evading detection, while actually he is being
watched.
- maildir
- a mail storage format in which every message is kept in a separate file,
and meta-information flags (like which messages have been seen by the user) is
manipulated by renaming the files, instead of altering the contents. This
format is safer than mbox when multiple programs are
accessing the mail store at the same time, and faster tham mbox when messages
are deleted or flags are changed. Maildir format is supported by many
mailreaders and POP/IMAP servers.
- mbox
- the traditional mail storage format of UNIX, where an entire mailbox is
in one file, and the messages in the file are separated by "From_" lines
(lines that begin with the word "From" followed by a space). It is inferior
to maildir in several ways, but it has the support of
some old-fashioned UNIX mailreaders, which makes it still occasionally
useful.
- obfuscate
- to write a message in a way that makes it hard to understand. In
particular, web site links found in spam messages are frequently obfuscated
to prevent spam victims from figuring out where the web
site is located, and complaining to the web site's ISP. Obfuscation is
sometimes done with a legitimate, but unnecessary, message encoding. Other
times it is done by exploiting bugs in commonly used mailreaders - the
spammer doesn't care if the link is technically invalid as long as it works
on the systems used by the people he's targeting.
- open proxy
- a proxy server that offers unrestricted, anonymous
access to anyone and everyone. Usually, in the context of spam, this term
refers to proxies that allow the user to create arbitrary
TCP connections. SOCKS proxies and HTTP proxies with SSL
support meet that requirement. Other types of proxies can be "open" too, but
they are less useful to spammers.
- open proxy honeypot
- See proxypot.
- perl
- the programming language used to create
Bubblegum proxypot. To run it you need the
perl interpreter and modules, available from
perl.com. If your operating system comes
from Redmond, you also need cygwin.
- proxy
- something or someone who does something on behalf of someone else.
Proxies are used on the Internet for several legitimate reasons: some
increase speed of web access by caching web pages, some provide an indirect
method of Internet access for computers that aren't be connected directly,
and some are used to impose restrictions on what Internet resources may be
accessed from a particular location. A good proxy only fulfills requests from
people it can identify, usually those "inside" a specific home or business
network, and keeps track of who did what. The bad proxies are
open proxies.
- proxypot
- a honeypot that disguises itself as an
open proxy.
- Received header
- the header that traces the path from a message
sender to the recipient. When a message is first created, it should have no
Received headers (but sometimes a spammer will forge
some). When the message is injected into SMTP, the first
server adds a Received header to the top. That server passes the message to
another server, which adds another Received header over the first one, and so
on until the recipient's mail server delivers it. The legitimate Received
headers are always above the forged ones.
Bubblegum proxypot adds a Received header
when it delivers messages to an mbox or
maildir. It doesn't look exactly like a normal
Received header because a proxypot is hardly a normal
SMTP server, but servers the same purpose of providing information on the
method and path of transmission of the message.
- response
- an indication of success or failure given by an SMTP
server to a client that is attempting to send a message. Each piece of
envelope information sent by the client is followed
by a response from the server. Imitating these responses accurately without
bothering the real SMTP server is one of the biggest challenges in designing
a proxypot.
- session
- a series of actions beginning when a client connects to a server, and
ending with a disconnection. An SMTP session can consist
of a series of transactions and can last a long
time, but a typical spam SMTP session lasts only a few seconds and contains
a single transaction.
- SMTP
- Simple Mail Transport Protocol. Every domain that wants to receive mail
from the Internet must have an SMTP server. Mail senders must connect to the
destination domain's SMTP server to send messages there. An SMTP server
records the IP address of the sender in a
Received header.
- spam
- a worthless message that is repeated often enough and loudly enough to
inhibit normal conversation. This general concept has been applied to many
media, including message boards, chat rooms, and of course e-mail. Spammers
are thieves, stealing space in your mailbox, that you paid for so you could
use it for private discussions, and reselling it without your knowledge or
consent.
- spoof
- equivalent to forge, but more commonly used to
describe the forging of IP addresses, which is generally harder to do than
forging headers.
- TCP
- Transmission Control Protocol, the protocol underneath
SMTP and many other Internet applications. TCP
provides the mechanism for a client to connect to a server and send an
ordered stream of data. How that data is interpreted depends on the upper
level protocol - TCP only sees a sequence of bytes. The most dangerous type
of open proxy is one that allows clients to make
any TCP connection and send any data it wants.
- transaction
- in SMTP, a series of commands beginning with MAIL
FROM (which specifies the envelope sender), followed
by RCPT TO (once for each envelope recipient), then
DATA (which introduces the message headers and
body). A single transaction transfers a single message,
which may have multiple recipients.
- victim
- spam recipient. If it's in your mailbox without your permission, it's a
theft of a resource that belongs to you. It's not just a minor disagreement,
it's an intrusion, and that's why spam recipients are properly called victims.