일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 |
- U플러스샵
- 네셔널지오그래픽 백팩
- 아이폰6S플러스
- 아우디A5
- 애플
- 지얄오크
- 경량패딩
- 코오롱 헤스티아
- 스타벅스 다이어리
- 영어 감사표현
- 아우디 A5 스포트 백
- iPhone 12 mini
- AUDI A5 SPORTBACK
- 아이폰12 미니
- 맥미니 M1
- MSI #MSI Prestige #MSI 프레스티지 #MSI 노트북 #노트북
- 아우디 A6 40TDI
- M1칩
- iphone6SPlus
- 아이패드
- GA-2100
- 루틴 다이어리
- 남자 클러치백
- 구찌 클러치백
- 비츠 솔로3
- mysql
- 45TFSI
- 아이폰6s
- 아이패드용 키보드
- 맥미니 2020
- Today
- Total
누눕's blog
Hbase Data Model 본문
Data Model
In HBase, data is stored in tables, which have rows and columns. This is a terminology overlap with relational databases (RDBMSs), but this is not a helpful analogy. Instead, it can be helpful to think of an HBase table as a multi-dimensional map.
HBase Data Model Terminology
Table
An HBase table consists of multiple rows.
Row
A row in HBase consists of a row key and one or more columns with values associated with them. Rows are sorted alphabetically by the row key as they are stored. For this reason, the design of the row key is very important. The goal is to store data in such a way that related rows are near each other. A common row key pattern is a website domain. If your row keys are domains, you should probably store them in reverse (org.apache.www, org.apache.mail, org.apache.jira). This way, all of the Apache domains are near each other in the table, rather than being spread out based on the first letter of the subdomain.
Column
A column in HBase consists of a column family and a column qualifier, which are delimited by a : (colon) character.
Column Family
Column families physically colocate a set of columns and their values, often for performance reasons. Each column family has a set of storage properties, such as whether its values should be cached in memory, how its data is compressed or its row keys are encoded, and others. Each row in a table has the same column families, though a given row might not store anything in a given column family.
Column Qualifier
A column qualifier is added to a column family to provide the index for a given piece of data. Given a column family content, a column qualifier might be content:html, and another might be content:pdf. Though column families are fixed at table creation, column qualifiers are mutable and may differ greatly between rows.
Cell
A cell is a combination of row, column family, and column qualifier, and contains a value and a timestamp, which represents the value’s version.
Timestamp
A timestamp is written alongside each value, and is the identifier for a given version of a value. By default, the timestamp represents the time on the RegionServer when the data was written, but you can specify a different timestamp value when you put data into the cell.
참조
http://hbase.apache.org/book.html#datamodel
Apache HBase ™ Reference Guide
Data comes in many sizes, and saving all of your data in HBase, including binary data such as images and documents, is ideal. While HBase can technically handle binary objects with cells that are larger than 100 KB in size, HBase’s normal read and write pa
hbase.apache.org
'IT 기록 > Hadoop' 카테고리의 다른 글
하둡(Hadoop) 설치 (0) | 2020.03.31 |
---|---|
HBase 명령어 모음 (0) | 2020.03.26 |