If I want to populate a DynamoDB table with a huge list of domain names, there are two parts to every domain name (example.com), the host name (example) and the top-level domain (com).
I am trying to determine how best to store this using a hash and range primary key. According to the documentation, you want to spread out the hash part of the key so I am thinking that should use the full domain name (example.com) and the range can be the updated date of the domain. This will allow me to query all domains updated after a specific date. Does this sound like a good design?
To get the most out of DynamoDB throughput, create tables where the hash key element has a large number of distinct values, and values are requested fairly uniformly, as randomly as possible.
Aucun commentaire:
Enregistrer un commentaire