Amazon S3 (Simple Storage Service)
Amazon S3 is object storage built to store and retrieve any amount of data from anywhere on the Internet.
It’s a simple storage service that offers an extremely durable, highly available, and infinitely scalable data storage infrastructure at very low costs. Now in this statement you need to understand the meaning of durable, highly available and scalable:
· Durable: With respect to an object stored in S3, durability means the probability that the object will remain intact and accessible after a period of one year. 100% durability would mean that there’s no possible way for the object to be lost, 90% durability would mean that there’s a 1-in-10 chance, and so forth.
- Using the definition that I stated above, the durability of an object stored in Amazon S3 is 99.999999999 %( IMP). If you store 10,000 objects with AWS-S3, on average you may lose one of them every 10 million years or so which is practically no loss at all. This storage is designed in such a way that it can sustain the concurrent loss of data in two separate storage facilities.
· Highly available: Availability refers to the fact that even a data center fails whtether your data is available to users or not. Though S3 availability depends on kind of storage you chose but the least availability is 99.9%.
· Scalable: this refers to the fact that how much data you can store in S3 which is practically unlimited with one file size upto 5 terabytes. Individual Amazon S3 objects can range in size from a minimum of 0 bytes to a maximum of 5 terabytes
Amazon S3 is a distributed architecture and objects are redundantly stored on
multiple devices across multiple facilities (Availability zones) in an Amazon
S3 region.By object I mean things like flat files, basically. So Word documents,
pictures, movies etc.So S3 is basically a safe place
to store your files.
There is unlimited storage and files are stored in these things called buckets. Now, when I first heard about buckets, it sounded a bit strange. But a bucket is basically just a folder, that's all it is. So it's a folder in which to put your files.
S3 is a universal namespace. Well, all that means is that the names must be unique globally. So when you create a bucket, it can't just be named my-bucket because someone might have already taken that name.So when you create a bucket, the name has to be unique, reason being it's actually creating a web address. So if you were going to create a bucket in Northern Virginia, which is the default AWS region, and we call it a say crackthatcode, it will create a web address, crackthatcode.s3.amazonaws.com.
Uploading to S3: Now when you upload a file to S3,
you're will receive an HTTP 200 code, if the upload was successful, and that's
going to come back to your browser.
- you can use S3 to Amazon S3 console
- or you can use it via REST apis programmatically. Obviously there are security features which determine access, we will discuss them later.
Versioning in S3: S3 allows you to have multiple versions of your file. So you can have version one, which says “hi, I am going to crackthatcode”, and then you could go in and upload a file with the same name and change the file so it says “hi, I am going to crackthatcode version two”.then you'll have both versions of the object in S3 and you'll be able to do version control.So you could go back to a previous version.
Metadata: it just simply means data about
data. So it's data about the data that you're storing.So you may say, "This object
belongs to the finance department”, "this is our payroll spreadsheet."
Basically like a comment to describe the data that you are storing in the
bucket,
Now since you understand S3 . We will move forward and cover it in more details and its sub resources like:
- S3-Data Consistency Model
- Storage classes
- Access control lists
- Bucket policies
- Data consistency model
- Lifecycle Management
No comments