mirror of https://github.com/nodemailer/wildduck.git synced 2024-09-20 15:26:03 +08:00

Opinionated email server

imap-server lmtp mongodb pop3-server starred-nodemailer-repo starred-repo

Go to file

Andris Reinman 3882be5f00 updated api		2017-05-23 20:12:11 +03:00
config	Allow setting default retention to all mailboxes	2017-05-22 10:03:30 +03:00
examples	Added message filtering for LMTP	2017-04-17 15:58:46 +03:00
imap-core	use async hashing	2017-05-23 19:11:42 +03:00
lib	ignore common headers	2017-05-23 17:17:54 +03:00
.eslintrc	Initial preview	2017-03-05 23:45:50 +02:00
.gitignore	Working POP3 implementation	2017-04-09 12:33:10 +03:00
api.js	updated api	2017-05-23 20:12:11 +03:00
Gruntfile.js	Initial preview	2017-03-05 23:45:50 +02:00
imap.js	v1.0.37	2017-05-22 10:35:58 +03:00
indexes.json	ignore common headers	2017-05-23 17:17:54 +03:00
LICENSE	test	2017-05-23 19:42:53 +03:00
lmtp.js	deduplicate attachments	2017-05-15 16:09:08 +03:00
logger.js	SUpport logging to syslog and multiple processes	2017-03-11 17:21:08 +02:00
package.json	v1.0.39	2017-05-23 19:17:38 +03:00
pop3.js	v1.0.26. Delete messages from Trash and Junk after 30 days	2017-05-02 16:17:37 +03:00
README.md	Allow setting default retention to all mailboxes	2017-05-22 10:03:30 +03:00
server.js	updated thread pool size	2017-05-23 20:05:16 +03:00
worker.js	Replaced SMTP with LMTP	2017-04-12 14:52:29 +03:00

README.md

Agent Wild Duck

Wild Duck is a distributed IMAP/POP3 server built with Node.js, MongoDB and Redis. Node.js runs the application, MongoDB is used as the mail store and Redis is used for ephemeral actions like publish/subscribe, locking and caching.

NB! Wild Duck is currently in beta. You should not use it in production.

Goals of the Project

Build a scalable and distributed IMAP/POP3 server that uses clustered database instead of single machine file system as mail store
Allow using internationalized email addresses
Provide Gmail-like features like pushing sent messages automatically to Sent Mail folder or notifying about messages moved to Junk folder so these could be marked as spam
Provide parsed mailbox and message data over HTTP. This should make creating webmail interfaces super easy, no need to parse RFC822 messages to get text content or attachments

Alternatives

Here's a list of email/IMAP servers that use database for storing email messages

DBMail (IMAP)
Archiveopteryx (IMAP)
ElasticInbox (POP3)

Supported features

Wild Duck IMAP server supports the following IMAP standards:

The entire IMAP4rev1 suite with some minor differences from the spec. See below for IMAP Protocol Differences for a complete list
IDLE (RFC2177) – notfies about new and deleted messages and also about flag updates
CONDSTORE (RFC4551) and ENABLE (RFC5161) – supports most of the spec, except metadata stuff which is ignored
STARTTLS (RFC2595)
NAMESPACE (RFC2342) – minimal support, just lists the single user namespace with hierarchy separator
UNSELECT (RFC3691)
UIDPLUS (RFC4315)
SPECIAL-USE (RFC6154)
ID (RFC2971)
MOVE (RFC6851)
AUTHENTICATE PLAIN (RFC4959) and SASL-IR
APPENDLIMIT (RFC7889) – maximum global allowed message size is advertised in CAPABILITY listing
UTF8=ACCEPT (RFC6855) – this also means that Wild Duck natively supports unicode email usernames. For example <андрис@уайлддак.орг> is a valid email address that is hosted by a test instance of Wild Duck
QUOTA (RFC2087) – Quota size is global for an account, using a single quota root. Be aware that quota size does not mean actual byte storage in disk, it is calculated as the sum of the RFC822 sources of stored messages. Actual disk usage is larger as there are database overhead per every message.
COMPRESS=DEFLATE (RFC4978) – Compress traffic between the client and the server

Wild Duck more or less passes the ImapTest. Common errors that arise in the test are unknown labels (Wild Duck doesn't send unsolicited FLAGS updates even though it does send unsolicited FETCH FLAGS updates) and sometimes NO for STORE (messages deleted in one session can not be updated in another).

POP3 Support

In addition to the required POP3 commands (RFC1939) Wild Duck supports the following extensions:

UIDL
USER
PASS
SASL PLAIN
PIPELINING
TOP

POP3 command behaviors

All changes to messages like deleting messages or marking messages as seen are stored in storage only in the UPDATE stage (eg. after calling QUIT). Until then the changes are preserved in memory only. This also means that if a message is downloaded but QUIT is not issued then the message does not get marked as Seen.

LIST

POP3 listing displays the newest 250 messages in INBOX (configurable)

UIDL

Wild Duck uses message _id value (24 byte hex) as the unique ID. If a message is moved from one mailbox to another then it might re-appear in the listing.

RETR

If a messages is downloaded by a client this message gets marked as Seen

DELE

If a messages is deleted by a client this message gets marked as Seen and moved to Trash folder

FAQ

Does it work?

Yes, it does. You can run the server and get working IMAP and POP3 servers for mail store, LMTP server for pushing messages to the mail store and HTTP API server to create new users. All handled by Node.js, MongoDB and Redis, no additional dependencies needed. Provided services can be disabled and enabled one by one so, for example you could process just IMAP in one host and LMTP in another.

What are the killer features?

Start as many instances as you want. You can start multiple Wild Duck instances in different machines and as long as they share the same MongoDB and Redis settings, users can connect to any instances. This is very different from the traditional IMAP servers where a single user always needs to connect (or be proxied) to the same IMAP server. Wild Duck keeps all required state information in MongoDB, so it does not matter which IMAP instance you use.
Super easy to tweak. The entire codebase is pure JavaScript, so there's nothing to compile or anything platform specific. If you need to tweak something then change the code, restart the app and you're ready to go. If it works on one machine then most probably it works in every other machine as well.
Works almost on any OS including Windows. At least if you get MongoDB and Redis (Windows fork) running first.
Focus on internationalization, ie. supporting email addresses with non-ascii characters
+-labels: андрис+ööö@уайлддак.орг is delivered to андрис@уайлддак.орг
Access messages both using IMAP and HTTP API. The latter serves parsed data, so no need to fetch RFC822 messages and parse out html, plaintext content or attachments. It is super easy to create a webmail interface on top of this.
Deduplication of attachments. If the same attachment is referenced by different messages then only a single copy of the attachment is stored. Attachment is stored in the encoded form (eg. encoded in base64) to not break any signatures so the resulting encoding must match as well.

Isn't it bad to use a database as a mail store?

Yes, historically it has been considered a bad practice to store emails in a database. And for a good reason. The data model of relational databases like MySQL does not work well with tree like structures (email mime tree) or large blobs (email source).

Notice the word "relational"? In fact document stores like MongoDB work very well with emails. Document store is great for storing tree-like structures and while GridFS is not as good as "real" object storage, it is good enough for storing the raw parts of the message. Additionally there's nothing too GridFS specific, so (at least in theory) it could be replaced with any object store.

You can see an example mail entry here. Lines 184-217 demonstrate a node that has its body missing as it was big enough to be moved to GridStore and not be included with the main entry.

Is the server scalable?

Somewhat yes. Even though on some parts Wild Duck is already fast (Wild Duck is successfully tested with mailboxes up to 200K messages), there are still some important improvements that need to be done:

Optimize FETCH queries to load only partial data for BODY subparts
Parse incoming message into the mime tree as a stream. Currently the entire message is buffered in memory before being parsed.
CPU usage seems a bit too high, there is probably a ton of profiling to do

How does it work?

Whenever a message is received Wild Duck parses it into a tree-like structure based on the MIME tree and stores this tree to MongoDB. Attachments are removed from the tree and stored separately in GridStore. If a message needs to be loaded then Wild Duck fetches the tree structure first and, if needed, loads attachments from GridStore and then compiles it back into the original RFC822 message. The result should be identical to the original messages unless the original message used unix newlines, these might be partially replaced with windows newlines.

Wild Duck tries to keep minimal state for sessions (basically just a list of currently known UIDs and latest MODSEQ value) to be able to distribute sessions between different hosts. Whenever a mailbox is opened the entire message list is loaded as an array of UID values. The first UID in the array element points to the message #1 in IMAP, second one points to message #2 etc.

Actual update data (information about new and deleted messages, flag updates and such) is stored to a journal log and an update beacon is propagated through Redis pub/sub whenever something happens. If a session detects that there have been some changes in the current mailbox and it is possible to notify the user about it (eg. a NOOP call was made), journaled log is loaded from the database and applied to the UID array one action at a time. Once all journaled updates have applied then the result should match the latest state. If it is not possible to notify the user (eg a FETCH call was made), then journal log is not loaded and the user continues to see the old state.

Future considerations

Add interoperability with current servers, for example by fetching authentication data from MySQL
Maybe allow some kind of message manipulation through plugins? This would allow to turn Wild Duck for example into an encrypted mail server – mail data would be encrypted using users public key before storing it to DB and decrypted with users private key whenever the user logs in and FETCHes or SEARCHes messages. Private key would be protected by users password. For the user the encryption layer would be invisible while guaranteeing that if the user is currently not logged in then there would be no way to read the messages as the private key is locked.

Usage

Assuming you have MongoDB and Redis running somewhere.

Step 1. Get the code from github

$ git clone git://github.com/wildduck-email/wildduck.git
$ cd wildduck

Step 2. Install dependencies

Install dependencies from npm

$ npm install --production

Step 3. Modify config

You can either modify the default config file or alternatively generate an environment related config file that gets merged with the default values. Read about the config module here

Step 4. Run the server

To use the default config file, run the following

npm start

Or if you want to use environment related config file, eg from production.js, run the following

NODE_ENV=production npm start

Step 5. Create an user account

See see below for details about creating new user accounts

HTTP API

Users, mailboxes and messages can be managed with HTTP requests against Wild Duck API

TODO:

Expose counters (seen/unseen messages, message count in mailbox etc.)
Search messages
Expose journal updates through WebSocket or similar

POST /user/create

Creates a new user.

Arguments

username is the username of the user. This is not an email address but authentication username, use only letters and numbers
password is the password for the user
quota (optional) is the maximum storage in bytes allowed for this user. If not set then the default value is used
retention (optional) is the default retention time in ms for mailboxes. Messages in Trash and Junk folders have a maximum retention time of 30 days.

Example

curl -XPOST "http://localhost:8080/user/create" -H 'content-type: application/json' -d '{
  "username": "testuser",
  "password": "secretpass"
}'

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser"
}

After you have created an user you can use these credentials to log in to the IMAP server. To be able to receive mail for that user you need to register an email address.

POST /user/address/create

Creates a new email address alias for an existing user. You can use internationalized email addresses like андрис@уайлддак.орг.

Arguments

username is the username
address is the email address to use as an alias for this user
main (either true or false, defaults to false) indicates that this is the default address for that user

First added address becomes main by default

Example

curl -XPOST "http://localhost:8080/user/address/create" -H 'content-type: application/json' -d '{
  "username": "testuser",
  "address": "user@example.com"
}'

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser",
  "address": "user@example.com"
}

After you have registered a new address then LMTP maildrop server starts accepting mail for it and store the messages to the users mailbox.

POST /user/quota

Updates maximum allowed quota for an user

Arguments

username is the username of the user to modify
quota (optional) is the maximum storage in bytes allowed for this user
recipients (optional) is the maximum sending recipients per 24h allowed for this user. Assumes ZoneMTA with zonemta-wildduck plugin
forwards (optional) is the maximum forwarded recipients per 24h allowed for this user.

At least one limit value must be set

Example

curl -XPOST "http://localhost:8080/user/quota" -H 'content-type: application/json' -d '{
  "username": "testuser",
  "quota": 1234567,
  "recipients": 500
}'

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser",
  "quota": 1234567,
  "recipients": 500
}

Quota changes apply immediately.

POST /user/quota/reset

Recalculates used storage for an user. Use this when it seems that quota counters for an user do not match with reality.

Arguments

username is the username of the user to check

Example

curl -XPOST "http://localhost:8080/user/quota/reset" -H 'content-type: application/json' -d '{
  "username": "testuser"
}'

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser",
  "previousStorageUsed": 1000,
  "storageUsed": 800
}

Be aware though that this method is not atomic and should be done only if quota counters are way off.

POST /user/password

Updates password for an user

Arguments

username is the username of the user to modify
password is the new password for the user

Example

curl -XPOST "http://localhost:8080/user/password" -H 'content-type: application/json' -d '{
  "username": "testuser",
  "password": "newpass"
}'

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser"
}

Password change applies immediately.

GET /user

Returns user information including quota usage and registered addresses

Arguments

username is the username of the user to modify

Example

curl "http://localhost:8080/user?username=testuser"

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser",
  "quota": 1234567,
  "storageUsed": 1822,
  "recipients": 500,
  "recipientsLimited": false,
  "recipientsSent": 47,
  "recipientsTtl": 3392,
  "addresses": [
    {
      "id": "58d8fccb645b0deb23d6c37d",
      "address": "user@example.com",
      "main": true,
      "created": "2017-03-27T11:51:39.639Z"
    }
  ]
}

Where

recipients – is the count of maximum recipients for 24 hour period (starts with the first message)
recipientsLimited – if true then sending is currently disabled as recipient limit has been reached
recipientsSent – how many recipients has been used in the current 24 hour period
recipientsTtl – seconds until the end of current period

Recipient limits assume that messages are sent using ZoneMTA with zonemta-wildduck plugin, otherwise the counters are not updated.

GET /user/mailboxes

Returns all mailbox names for the user

Arguments

username is the username of the user to modify

Example

curl "http://localhost:8080/user/mailboxes?username=testuser"

The response for successful operation should look like this:

{
  "success": true,
  "username": "testuser",
  "mailboxes": [
    {
      "id": "58d8f2ae240366dfd5d8049c",
      "path": "INBOX",
      "special": "Inbox",
      "messages": 100
    },
    {
      "id": "58d8f2ae240366dfd5d8049d",
      "path": "Sent Mail",
      "special": "Sent",
      "messages": 45
    },
    {
      "id": "58d8f2ae240366dfd5d8049f",
      "path": "Junk",
      "special": "Junk",
      "messages": 10
    },
    {
      "id": "58d8f2ae240366dfd5d8049e",
      "path": "Trash",
      "special": "Trash",
      "messages": 11
    }
  ]
}

GET /mailbox/:id

List messages in a mailbox.

Parameters

id is the mailbox ID
size is optional number to limit the length of the messages array (defaults to 20)
before is an optional paging number (see next in response)
after is an optional paging number (see prev in response)

Response includes the following fields

mailbox is an object that lists some metadata about the current mailbox
- id is the mailbox ID
- path is the folder path
next is an URL fragment for retrieving the next page (or false if there are no more pages)
prev is an URL fragment for retrieving the previous page (or false if it is the first page)
messages is an array of messages in the mailbox
- id is the message ID
- date is the date when this message was received
- ha is a boolean that indicates if this messages has attachments or not
- intro includes the first 256 characters from the message
- subject is the message title
- from is the From: field
- to is the To: field
- cc is the Cc: field
- bcc is the Bcc: field

The response for successful listing should look like this:

{
  "success": true,
  "mailbox": {
    "id": "58dbf87fcff690a8c30470c7",
    "path": "INBOX"
  },
  "next": "/mailbox/58dbf87fcff690a8c30470c7?before=34&size=20",
  "prev": false,
  "messages": [
    {
      "id": "58e25243ab71621c3890417e",
      "date": "2017-04-03T13:46:44.226Z",
      "ha": true,
      "intro": "Welcome to Ryan Finnie's MIME torture test. This message was designed to introduce a couple of the newer features of MIME-aware MUAs, features that have come around since the days of the original MIME torture test. Just to be clear, this message SUPPLEMENT…",
      "subject": "ryan finnie's mime torture test v1.0",
      "from": "ryan finnie <rfinnie@domain.dom>",
      "to": "bob@domain.dom"
    }
  ]
}

GET /message/:id

Retrieves message information

Parameters

id is the MongoDB _id as a string for a message
mailbox is optional Mailbox id. Use this to verify that the message is located at this mailbox

Example

curl "http://localhost:8080/message/58d8299c5195c38e77c2daa5"

Response message includes the following fields

id is the id of the message
headers is an array that lists all headers of the message. A header is an object:
- key is the lowercase key of the header
- value is the header value in unicode (all encoded values are decoded to utf-8). The value is capped at around 800 characters.
date is the receive date (not header Date: field)
mailbox is the id of the mailbox this messages belongs to
flags is an array of IMAP flags for this message
text is the plaintext version of the message (derived from html if not present in message source)
html is the HTML version of the message (derived from plaintext if not present in message source). It is an array of strings, each array element corresponds to different MIME node and might have its own html header
attachments is an array of attachment objects. Attachments can be shared between messages.
- id is the id of the attachment in the form of "ATT00001"
- fileName is the name of the attachment. Autogenerated from Content-Type if not set in source
- contentType is the MIME type of the message
- disposition defines Content-Disposition and is either 'inline', 'attachment' or false
- transferEncoding defines Content-Transfer-Encoding
- related is a boolean value that states if the attachment should be hidden (true) or not. Related attachments are usually embedded images
- sizeKb is the approximate size of the attachment in kilobytes

Embedded images

HTML content has embedded images linked with the following URL structure:

attachment:MESSAGE_ID/ATTACHMENT_ID

For example:

<img src="attachment:aaaaaa/bbbbbb">

To fetch the actual attachment contents for this image, use the following url:

http://localhost:8080/message/aaaaaa/attachment/bbbbbb

Example response

The response for successful operation should look like this:

{
  "success": true,
  "message": {
    "id": "58d8299c5195c38e77c2daa5",
    "mailbox": "58dbf87fcff690a8c30470c7",
    "headers": [
      {
        "key": "delivered-to",
        "value": "andris@addrgw.com"
      }
    ],
    "date": "2017-04-03T10:34:43.007Z",
    "flags": ["\\Seen"],
    "text": "Hello world!",
    "html": ["<p>Hello world!</p>"],
    "attachments": [
      {
        "id": "ATT00001",
        "fileName": "image.png",
        "contentType": "image/png",
        "disposition": "attachment",
        "transferEncoding": "base64",
        "related": true,
        "sizeKb": 1
      }
    ]
  }
}

GET /message/:mid/attachment/:aid

Retrieves an attachment of the message

Parameters

mid is the message ID
aid is the attachment ID

Example

curl "http://localhost:8080/message/58d8299c5195c38e77c2daa5/attachment/ATT00001"

GET /message/:id/raw

Retrieves RFC822 source of the message

Parameters

id is the MongoDB _id as a string for a message
mailbox is optional Mailbox id. Use this to verify that the message is located at this mailbox

Example

curl "http://localhost:8080/message/58d8299c5195c38e77c2daa5/raw"

DELETE /message/:id

Deletes a message from a mailbox.

Parameters

id is the MongoDB _id as a string for a message
mailbox is an optional Mailbox id. Use this to verify that the message to be deleted is located at this mailbox

Example

curl -XDELETE "http://localhost:8080/message/58d8299c5195c38e77c2daa5"

The response for successful operation should look like this:

{
  "success": true,
  "message":{
    "id": "58d8299c5195c38e77c2daa5"
  }
}

Message filtering

Wild Duck has built-in message filtering in LMTP server. This is somewhat similar to Sieve even though the filters are not scripts.

Filters are configuration objects stored in the filters array of the users object.

Example filter

{
    // identifier for this filter
    id: ObjectId('abcdefghij...'),

    // query to check messages against
    query: {
        // message must match all filter rules for the filter actions to apply
        // all values are case insensitive
        headers: {
            // partial string match against decoded From: header
            from: 'sender@example.com',
            // partial string match against decoded To: header
            to: 'recipient@example.com',
            // partial string match against decoded Subject: header
            subject: 'Väga tõrges'
        },

        // partial string match (case insensitive) against decoded plaintext message
        text: 'Mõigu ristis oli mis?',

        // positive: must have attachments, negative: no attachments
        ha: 1,

        // positive: larger than size, negative: smaller than abs(size)
        size: 10
    },
    // what to do if the filter query matches the message
    action: {

        // mark message as seen
        seen: true,

        // mark message as flagged
        flag: true,

        // set mailbox ID
        mailbox: 'aaaaa', // must be ObjectID!

        // positive spam, negative ham
        spam: 1,

        // if true, delete message
        delete: false
    }
}

NB! If you do not care about an action field then do not set it, otherwise matches from other filters do not apply

IMAP Protocol Differences

This is a list of known differences from the IMAP specification. Listed differences are either intentional or are bugs that became features.

\Recent flags is not implemented and most probably never will be (RFC3501 2.3.2.)
RENAME does not touch subfolders which is against the spec (RFC3501 6.3.5. If the name has inferior hierarchical names, then the inferior hierarchical names MUST also be renamed.). Wild Duck stores all folders using flat hierarchy, the "/" separator is fake and only used for listing mailboxes
Unsolicited FLAGS responses (RFC3501 7.2.6.) and PERMANENTFLAGS are not sent (except for as part of SELECT and EXAMINE responses). Wild Duck notifies about flag updates only with unsolicited FETCH updates.
Wild Duck responds with NO for STORE if matching messages were deleted in another session
CHARSET argument for the SEARCH command is ignored (RFC3501 6.4.4.)
Metadata arguments for SEARCH MODSEQ are ignored (RFC7162 3.1.5.). You can define <entry-name> and <entry-type-req> values but these are not used for anything
SEARCH TEXT and SEARCH BODY both use MongoDB $text index against decoded plaintext version of the message. RFC3501 assumes that it should be a string match either against full message (TEXT) or body section (BODY).
What happens when FETCH is called for messages that were deleted in another session? Not sure, need to check

Any other differences are most probably real bugs and unintentional.

Future considerations for IMAP extensions

Wild Duck does not plan to be the most feature-rich IMAP client in the world. Most IMAP extensions are useless because there aren't too many clients that are able to benefit from these extensions. There are a few extensions though that would make sense to be added to Wild Duck

IMAP4 non-synchronizing literals, LITERAL- (RFC7888). Synchronized literals are needed for APPEND to check mailbox quota, small values could go with the non-synchronizing version.
LIST-STATUS (RFC5819)
What else? (definitely not NOTIFY nor QRESYNC)

Testing

Create an email account and use your IMAP client to connect to it. To send mail to this account, run the example script:

node examples/push-mail.js username@example.com

This should "deliver" a new message to the INBOX of username@example.com by using the built-in LMTP maildrop interface. If your email client is connected then you should promptly see the new message.

Outbound SMTP

Use ZoneMTA with the ZoneMTA-WildDuck plugin. This gives you an outbound SMTP server that uses Wild Duck accounts for authentication.

Outbound SMTP

Use Haraka with queue/lmtp plugin. Wild Duck specific recipient processing plugin coming soon!

License

Wild Duck Mail Agent is licensed under the European Union Public License 1.1.

README.md Unescape Escape

Agent Wild Duck

Goals of the Project

Alternatives

Supported features

POP3 Support

POP3 command behaviors

LIST

UIDL

RETR

DELE

FAQ

Does it work?

What are the killer features?

Isn't it bad to use a database as a mail store?

Is the server scalable?

How does it work?

Future considerations

Usage

Step 1. Get the code from github

Step 2. Install dependencies

Step 3. Modify config

Step 4. Run the server

Step 5. Create an user account

HTTP API

POST /user/create

POST /user/address/create

POST /user/quota

POST /user/quota/reset

POST /user/password

GET /user

GET /user/mailboxes

GET /mailbox/:id

GET /message/:id

Embedded images

Example response

GET /message/:mid/attachment/:aid

GET /message/:id/raw

DELETE /message/:id

Message filtering

IMAP Protocol Differences

Future considerations for IMAP extensions

Testing

Outbound SMTP

Outbound SMTP

License

README.md