Requirements of TinyURL's Design
Requirements for URL Shortening Design
Let’s look at the functional and non-functional requirements for the service we’ll be designing:
Functional requirements
- Short URL generation: Our service should be able to generate a unique shorter alias of the given URL.
- Redirection: Given a short link, our system should be able to redirect the user to the original URL.
- Custom short links: Users should be able to generate custom short links for their URLs using our system.
- Deletion: Users should be able to delete a short link generated by our system, given the rights.
- Update: Users should be able to update the long URL associated with the short link, given the proper rights.
- Expiry time: There must be a default expiration time for the short links, but users should be able to set the expiration time based on their requirements.
Question
As a design choice, we don’t reuse the expired short URLs. Since we don’t reuse them, why do we need to delete them from our system?
So far, we’ve kept the default expiration time to five years. If we relax that limitation and start saving the records forever, our datastore’s search index will grow without bound, and querying time from it can add noticeable latency.
----------------
Non-functional requirements
- Availability: Our system should be highly available, because even a fraction of the second downtime would result in URL redirection failures. Since our system’s domain is in URLs, we don’t have the leverage of downtime, and our design must have fault-tolerance conditions instilled in it.
- Scalability: Our system should be horizontally scalable with increasing demand.
- Readability: The short links generated by our system should be easily readable, distinguishable, and typeable.
- Latency: The system should perform at low latency to provide the user with a smooth experience.
- Unpredictability: From a security standpoint, the short links generated by our system should be highly unpredictable. This ensures that the next-in-line short URL is not serially produced, eliminating the possibility of someone guessing all the short URLs that our system has ever produced or will produce.
Question
Why is producing unpredictable short URLs mandatory for our system?
The following two problems highlight the necessity of producing non-serial, unpredictable short URLs:
- Attackers can have access to the system-level information of the short URLs’ total count, giving them a defined range to plan out their attacks. This type of internal information shouldn’t be available outside the system.
- Our users might have used our service for generating short URLs for secret URLs. With the information above, the attackers can try to list all the short URLs, access them and gain insights about the associated long URLs, risking the secrecy of the private URLs. It will compromise the privacy of a user’s data, making our system less secure.
Hence, randomly assigning unique IDs deprives the attackers of such system insights, which are needed for enumerating and compromising the user’s private data.
----------------

Resource estimation
It’s better to have realistic estimations at the start. For instance, we might need to change them in the future based on the design modifications. Let’s make some assumptions to complete our estimation.
Assumptions
- We assume that the shortening:redirection request ratio is 1:1001:100.
- There are 200 million new URL shortening requests per month.
- A URL shortening entry requires 500 Bytes of database storage.
- Each entry will have a maximum of five years of expiry time, unless explicitly deleted.
- There are 100 million Daily Active Users (DAU).
Storage estimation
Since entries are saved for a time period of 5 years and there are a total of 200 million entries per month, the total entries will be approximately 12 billion.

URL Shortening Service Storage Estimation Calculator
URL shortening per month | 200 | Million |
---|---|---|
Expiration time | 5 | Years |
URL object size | 500 | Bytes |
Total number of requests | f12 | Billion |
Total storage | f6 | TB |

Query rate estimation
Based on the estimations above, we can expect 20 billion redirection requests per month.

Bandwidth estimation

Memory estimation
We need memory estimates in case we want to cache some of the frequently accessed URL redirection requests. Let’s assume a split of 80-20 in the incoming requests. 20 percent of redirection requests generate 80 percent of the traffic.

URL Shortening Service Estimates Calculator
URL shortening per month | 200 | Million |
---|---|---|
URL redirection per month | f20 | Billion |
Query rate for URL shortening | f76 | URLs / s |
Query rate for URL redirection | f7600 | URLs / s |
Single entry storage size | 500 | Bytes |
Incoming data | f304 | Kbps |
Outgoing data | f30.4 | Mbps |
Cache memory | f66 | GB |
Number of servers estimation
We adopt the same approximation discussed in the back-of-the-envelope calculations to calculate the number of servers needed: the number of daily active users and the daily user handling limit of a server are the two main factors in depicting the total number of servers required. According to the approximation, we need to divide the Daily Active Users (DAU) by 8000 to calculate the approximated number of servers.

Summarizing estimation
Based on the assumption above, the following table summarizes our estimations:
Type of operation | Time estimates |
---|---|
New URLs | 76/s |
URL redirections | 7.6 K/s |
Incoming data | 304 Kbps |
Outgoing data | 30.4 Mbps |
Storage for 5 years | 6 TB |
Memory for cache | 66 GB |
Servers | 12500 |
Building blocks we will use
With the estimations done, we can identify the key building blocks in our design. Such a list is given below:

- Database(s) will be needed to store the mapping of long URLs and the corresponding short URLs.
- Sequencer will provide unique IDs that will serve as a starting point for each short URL generation.
- Load balancers at various layers will ensure smooth requests distribution among available servers.
- Caches will be utilized to store the most frequent short URLs related requests.
- Rate limiters will be used to avoid system exploitation.
Besides these building blocks, we'll also need the following additional components to achieve the desired service:
- Servers to handle and navigate the service requests along with running the application logic.
- A Base-58 encoder to transform the sequencer’s numeric output to a more readable and usable alphanumeric form.