Contents contributed and discussions participated by 張旭

Simple Middle Filter: All | Bookmarks | Topics

Databases and Collections - MongoDB Manual - 0 views

docs.mongodb.com/...databases-and-collections

database mongodb

shared by 張旭 on 20 Apr 21 - No Cached

MongoDB stores data records as documents (specifically BSON documents) which are gathered together in collections.
...

Cancel
A database stores one or more collections of documents.
...

Cancel
In MongoDB, databases hold one or more collections of documents.
...

Cancel
...9 more annotations...
If a database does not exist, MongoDB creates the database when you first store data for that database.
...

Cancel
The insertOne() operation creates both the database myNewDB and the collection myNewCollection1 if they do not already exist.
...

Cancel
MongoDB stores documents in collections.
...

Cancel
If a collection does not exist, MongoDB creates the collection when you first store data for that collection.
...

Cancel
MongoDB provides the db.createCollection() method to explicitly create a collection with various options, such as setting the maximum size or the documentation validation rules.
...

Cancel
By default, a collection does not require its documents to have the same schema;
...

Cancel
To change the structure of the documents in a collection, such as add new fields, remove existing fields, or change the field values to a new type, update the documents to the new structure.
...

Cancel
Collections are assigned an immutable UUID.
...

Cancel
To retrieve the UUID for a collection, run either the listCollections command or the db.getCollectionInfos() method.
...

Cancel

Introduction to MongoDB - MongoDB Manual - 0 views

docs.mongodb.com/...introduction

database mongodb

shared by 張旭 on 20 Apr 21 - No Cached

MongoDB is a document database designed for ease of development and scaling
...

Cancel
MongoDB offers both a Community and an Enterprise version
...

Cancel
A record in MongoDB is a document, which is a data structure composed of field and value pairs.
...

Cancel
...12 more annotations...
MongoDB documents are similar to JSON objects.
...

Cancel
The values of fields may include other documents, arrays, and arrays of documents.
...

Cancel
reduce need for expensive joins
...

Cancel
MongoDB stores documents in collections.
...

Cancel
Collections are analogous to tables in relational databases.
...

Cancel
Read-only Views
...

Cancel
Indexes support faster queries and can include keys from embedded documents and arrays.
...

Cancel
MongoDB's replication facility, called replica set
...

Cancel
A replica set is a group of MongoDB servers that maintain the same data set, providing redundancy and increasing data availability.
...

Cancel
Sharding distributes data across a cluster of machines.
...

Cancel
MongoDB supports creating zones of data based on the shard key.
...

Cancel
MongoDB provides pluggable storage engine API
...

Cancel

MongoDB Performance - MongoDB Manual - 0 views

docs.mongodb.com/...analyzing-mongodb-performance

database mongodb performance

shared by 張旭 on 20 Apr 21 - No Cached

MongoDB uses a locking system to ensure data set consistency. If certain operations are long-running or a queue forms, performance will degrade as requests and operations wait for the lock.
...

Cancel
performance limitations as a result of inadequate or inappropriate indexing strategies, or as a consequence of poor schema design patterns.
...

Cancel
performance issues may be temporary and related to abnormal traffic load.
...

Cancel
...9 more annotations...
Lock-related slowdowns can be intermittent.
...

Cancel
If globalLock.currentQueue.total is consistently high, then there is a chance that a large number of requests are waiting for a lock.
...

Cancel
If globalLock.totalTime is high relative to uptime, the database has existed in a lock state for a significant amount of time.
...

Cancel
For write-heavy applications, deploy sharding and add one or more shards to a sharded cluster to distribute load among mongod instances.
...

Cancel
Unless constrained by system-wide limits, the maximum number of incoming connections supported by MongoDB is configured with the maxIncomingConnections setting.
...

Cancel
When logLevel is set to 0, MongoDB records slow operations to the diagnostic log at a rate determined by slowOpSampleRate.
...

Cancel
At higher logLevel settings, all operations appear in the diagnostic log regardless of their latency with the following exception
...

Cancel
Full Time Diagnostic Data Collection (FTDC) mechanism. FTDC data files are compressed, are not human-readable, and inherit the same file access permissions as the MongoDB data files.
...

Cancel
mongod processes store FTDC data files in a diagnostic.data directory under the instances storage.dbPath.
...

Cancel

張旭 on 20 Apr 21

"MongoDB uses a locking system to ensure data set consistency. If certain operations are long-running or a queue forms, performance will degrade as requests and operations wait for the lock."

<div class="cArrow"> </div><div class="cContentInner">"MongoDB uses a locking system to ensure data set consistency. If certain operations are long-running or a queue forms, performance will degrade as requests and operations wait for the lock."</div>

...

Cancel

[Elasticsearch] 分散式特性 & 分散式搜尋的機制 | 小信豬的原始部落 - 0 views

godleon.github.io/...icsearch-distributed-mechanism

elasticsearch database

shared by 張旭 on 17 Apr 21 - No Cached

水平擴展儲存空間
...

Cancel
Data HA：若有 node 掛掉，資料不會遺失
...

Cancel
若是要查詢 cluster 中的 node 狀態，可以使用 GET /_cat/nodes API
...

Cancel
...39 more annotations...
決定每個 shard 要被分配到哪個 data node 上
...

Cancel
為 cluster 設置多個 master node
...

Cancel
一旦發現被選中的 master node 出現問題，就會選出新的 master node
...

Cancel
每個 node 啟動時就預設是一個 master eligible node，可以透過設定 node.master: false 取消此預設設定
...

Cancel
處理 request 的 node 稱為 Coordinating Node，其功能是將 request 轉發到合適的 node 上
...

Cancel
所有的 node 都預設是 Coordinating Node
...

Cancel
coordinating node 可以直接接收 search request 並處理，不需要透過 master node 轉過來
...

Cancel
可以保存資料的 node，每個 node 啟動後都會預設是 data node，可以透過設定 node.data: false 停用 data node 功能
...

Cancel
由 master node 決定如何把分片分發到不同的 data node 上
...

Cancel
每個 node 上都保存了 cluster state
...

Cancel
只有 master 才可以修改 cluster state 並負責同步給其他 node
...

Cancel
每個 node 都會詳細紀錄本身的狀態資訊
...

Cancel
shard 是 Elasticsearch 分散式儲存的基礎，包含 primary shard & replica shard
...

Cancel
每一個 shard 就是一個 Lucene instance
...

Cancel
primary shard 功能是將一份被索引後的資料，分散到多個 data node 上存放，實現儲存方面的水平擴展
...

Cancel
primary shard 的數量在建立 index 時就會指定，後續是無法修改的，若要修改就必須要進行 reindex
...

Cancel
當 primary shard 遺失時，replica shard 就可以被 promote 成 primary shard 來保持資料完整性
...

Cancel
replica shard 數量可以動態調整，讓每個 data node 上都有完整的資料
...

Cancel
ES 7.0 開始，primary shard 預設為 1，replica shard 預設為 0
...

Cancel
replica shard 若設定過多，會降低 cluster 整體的寫入效能
...

Cancel
replica shard 必須和 primary shard 被分配在不同的 data node 上
...

Cancel
所有的 primary shard 可以在同一個 data node 上
...

Cancel
透過 GET _cluster/health/<target> 可以取得目前 cluster 的健康狀態
...

Cancel
Yellow：表示 primary shard 可以正常分配，但 replica shard 分配有問題
...

Cancel
透過 GET /_cat/shards/<target> 可以取得目前的 shard 狀態
...

Cancel
replica shard 無法被分配，因此 cluster 健康狀態為黃色
...

Cancel
若是擔心 reboot 機器造成 failover 動作開始執行，可以設定將 replication 延遲一段時間後再執行(透過調整 settings 中的 index.unassigned.node_left.delayed_timeout 參數)，避免無謂的 data copy 動作 (此功能稱為 delay allocation)
...

Cancel
集群變紅，代表有 primary shard 丟失，這個時候會影響讀寫。
...

Cancel
如果 node 重新回來，會從 translog 中恢復沒有寫入的資料
...

Cancel
設定 index settings 之後，primary shard 數量無法隨意變更
...

Cancel
不建議直接發送請求到master節點，雖然也會工作，但是大量請求發送到 master，會有潛在的性能問題
...

Cancel
shard 是 ES 中最小的工作單元
...

Cancel
shard 是一個 Lucene 的 index
...

Cancel
將 Index Buffer 中的內容寫入 Segment，而這寫入的過程就稱為 Refresh
...

Cancel
當 document 被 refresh 進入到 segment 之後，就可以被搜尋到了
...

Cancel
在進行 refresh 時先將 segment 寫入 cache 以開放查詢
...

Cancel
將 document 進行索引時，同時也會寫入 transaction log，且預設都會寫入磁碟中
...

Cancel
每個 shard 都會有對應的 transaction log
...

Cancel
由於 transaction log 都會寫入磁碟中，因此當 node 從故障中恢復時，就會優先讀取 transaction log 來恢復資料
...

Cancel

Database Profiler - MongoDB Manual - 0 views

docs.mongodb.com/...manage-the-database-profiler

database mongodb

shared by 張旭 on 16 Apr 21 - No Cached

The database profiler collects detailed information about Database Commands executed against a running mongod instance.
...

Cancel
The profiler writes all the data it collects to the system.profile collection, a capped collection in the admin database.
...

Cancel
db.setProfilingLevel(2)
...

Cancel
...10 more annotations...
The slowms and sampleRate profiling settings are global. When set, these settings affect all databases in your process.
...

Cancel
db.setProfilingLevel(1, { slowms: 20 })
...

Cancel
db.setProfilingLevel(0, { slowms: 20 })
...

Cancel
show profile
...

Cancel
The system.profile collection is a capped collection with a default size of 1 megabyte.
...

Cancel
By default, sampleRate is set to 1.0, meaning all slow operations are profiled.
...

Cancel
When logLevel is set to 0, MongoDB records slow operations to the diagnostic log at a rate determined by slowOpSampleRate.
...

Cancel
The slowms field indicates operation time threshold, in milliseconds, beyond which operations are considered slow.
...

Cancel
You cannot enable profiling on a mongos instance.
...

Cancel
profiler logs information about database operations in the system.profile collection.
...

Cancel

MongoDB Performance Tuning: Everything You Need to Know - Stackify - 0 views

stackify.com/mongodb-performance-tuning

database mongodb

shared by 張旭 on 15 Apr 21 - No Cached

db.serverStatus().globalLock
...

Cancel
db.serverStatus().locks
...

Cancel
globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it’s consistently high. This can happen if a lot of requests are waiting for a lock to be released.
...

Cancel
...35 more annotations...
globalLock.totalTime: If this is higher than the total database uptime, the database has been in a lock state for too long.
...

Cancel
Unlike relational databases such as MySQL or PostgreSQL, MongoDB uses JSON-like documents for storing data.
...

Cancel
Databases operate in an environment that consists of numerous reads, writes, and updates.
...

Cancel
When a lock occurs, no other operation can read or modify the data until the operation that initiated the lock is finished.
...

Cancel
locks.deadlockCount: Number of times the lock acquisitions have encountered deadlocks
...

Cancel
Is the database frequently locking from queries? This might indicate issues with the schema design, query structure, or system architecture.
...

Cancel
For version 3.2 on, WiredTiger is the default.
...

Cancel
MMAPv1 locks whole collections, not individual documents.
...

Cancel
WiredTiger performs locking at the document level.
...

Cancel
When the MMAPv1 storage engine is in use, MongoDB will use memory-mapped files to store data.
...

Cancel
All available memory will be allocated for this usage if the data set is large enough.
...

Cancel
db.serverStatus().mem
...

Cancel
mem.resident: Roughly equivalent to the amount of RAM in megabytes that the database process uses
...

Cancel
If mem.resident exceeds the value of system memory and there’s a large amount of unmapped data on disk, we’ve most likely exceeded system capacity.
...

Cancel
If the value of mem.mapped is greater than the amount of system memory, some operations will experience page faults.
...

Cancel
The WiredTiger storage engine is a significant improvement over MMAPv1 in performance and concurrency.
...

Cancel
By default, MongoDB will reserve 50 percent of the available memory for the WiredTiger data cache.
...

Cancel
wiredTiger.cache.bytes currently in the cache – This is the size of the data currently in the cache.
...

Cancel
wiredTiger.cache.tracked dirty bytes in the cache – This is the size of the dirty data in the cache.
...

Cancel
we can look at the wiredTiger.cache.bytes read into cache value for read-heavy applications. If this value is consistently high, increasing the cache size may improve overall read performance.
...

Cancel
check whether the application is read-heavy. If it is, increase the size of the replica set and distribute the read operations to secondary members of the set.
...

Cancel
write-heavy, use sharding within a sharded cluster to distribute the load.
...

Cancel
Replication is the propagation of data from one node to another
...

Cancel
Replication sets handle this replication.
...

Cancel
Sometimes, data isn’t replicated as quickly as we’d like.
...

Cancel
a particularly thorny problem if the lag between a primary and secondary node is high and the secondary becomes the primary
...

Cancel
use the db.printSlaveReplicationInfo() or the rs.printSlaveReplicationInfo() command to see the status of a replica set from the perspective of the secondary member of the set.
...

Cancel
shows how far behind the secondary members are from the primary. This number should be as low as possible.
...

Cancel
monitor this metric closely.
...

Cancel
watch for any spikes in replication delay.
...

Cancel
Always investigate these issues to understand the reasons for the lag.
...

Cancel
One replica set is primary. All others are secondary.
...

Cancel
it’s not normal for nodes to change back and forth between primary and secondary.
...

Cancel
use the profiler to gain a deeper understanding of the database’s behavior.
...

Cancel
Enabling the profiler can affect system performance, due to the additional activity.
...

Cancel

張旭 on 15 Apr 21

"globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it's consistently high. This can happen if a lot of requests are waiting for a lock to be released."

<div class="cArrow"> </div><div class="cContentInner">"globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it's consistently high. This can happen if a lot of requests are waiting for a lock to be released."</div>

...

Cancel

GitLab Auto DevOps 深入淺出，自動部署，連設定檔不用？！ | 五倍紅寶石・專業程式教育 - 0 views

5xruby.tw/...gitlab-auto-devops

devops gitlab auto

shared by 張旭 on 15 Apr 21 - No Cached

一個 K8S 的 Cluster，Auto DevOps 將會把網站部署到這個 Cluster
...

Cancel
需要有一個 wildcard 的 DNS 讓部署在這個環境的網站有 Domain name
...

Cancel
一個可以跑 Docker 的 GitLab Runner，將會為由它來執行 CI / CD 的流程。
...

Cancel
...37 more annotations...
其實 Auto DevOps 就是一份官方寫好的 gitlab-ci.yml，在啟動 Auto DevOps 的專案裡，如果找不到 gitlab-ci.yml 檔，那就會直接用官方 gitlab-ci.yml 去跑 CI / CD 流程。
...

Cancel
Pod 是 K8S 中可以被部署的最小元件，一個 Pod 是由一到多個 Container 組成，同個 Pod 的不同 Container 之間彼此共享網路資源。
...

Cancel
每個 Pod 都會有它的 yaml 檔，用以描述 Pod 會使用的 Image 還有連接的 Port 等資訊。
...

Cancel
Node 又分成 Worker Node 和 Master Node 兩種
...

Cancel
Helm 透過參數 (parameter) 跟模板 (template) 的方式，讓我們可以在只修改參數的方式重複利用模板。
...

Cancel
為了要有 CI CD 的功能我們會把 .gitlab-ci.yml 放在專案的根目錄裡， GitLab 會依造 .gitlab-ci.yml 的設定產生 CI/CD Pipeline，每個 Pipeline 裡面可能有多個 Job，這時候就會需要有 GitLab Runner 來執行這些 Job 並把執行的結果回傳給 GitLab 讓它知道這個 Job 是否有正常執行。
...

Cancel
把專案打包成 Docker Image 這工作又或是 helm 的操作都會在 Container 內執行
...

Cancel
CI/CD Pipeline 是由 stage 還有 job 組成的，stage 是有順序性的，前面的 stage 完成後才會開始下一個 stage。
...

Cancel
每個 stage 裡面包含一到多個 Job
...

Cancel
Auto Devops 裡也會大量用到這種在指定 Container 內運行的工作。
...

Cancel
可以通過 health checks
...

Cancel
開 private 的話還要注意使用 Container Registry 的權限問題
...

Cancel
申請好的 wildcard 的 DNS
...

Cancel
Auto Devops 也提供只要設定環境變數就能一定程度客製化的選項
...

Cancel
特別注意 namespace 有沒有設定對，不然會找不到資料喔
...

Cancel
Auto Devops，如果想要進一步的客製化，而且是改 GitLab 環境變數都無法實現的客製化，這時候還是得回到 .gitlab-ci.yml 設定檔
...

Cancel
在 Docker in Docker 的環境用 Dockerfile 打包 Image
...

Cancel
用 helm upgrade 把 chart 部署到 K8S 上
...

Cancel
GitLab CI 的環境變數主要有三個來源，優先度高到低依序為Settings > CI/CD 介面定義的變數gitlab_ci.yml 定義環境變數GitLab 預設環境變數
...

Cancel
把專案打包成 Docker Image 首先需要在專案下新增一份 Dockerfile
...

Cancel
Auto Devops 裡面的做法，用 herokuish 提供的 Image 來打包專案
...

Cancel
在 Runner 的環境中是沒有 docker 指令可以用的，所以這邊啟動一個 Docker Container 在裡面執行就可以用 docker 指令了。
...

Cancel
其中 $CI_COMMIT_SHA $CI_COMMIT_BEFORE_SHA 這兩個都是 GitLab 預設環境變數，代表這次 commit 還有上次 commit 的 SHA 值。
...

Cancel
dind 則是直接啟動 docker daemon，此外 dind 還會自動產生 TLS certificates
...

Cancel
為了在 Docker Container 內運行 Docker，會把 Host 上面的 Docker API 分享給 Container。
...

Cancel
docker:stable 有執行 docker 需要的執行檔，他裡面也包含要啟動 docker 的程式(docker daemon)，但啟動 Container 的 entrypoint 是 sh
...

Cancel
docker:dind 繼承自 docker:stable，而且它 entrypoint 就是啟動 docker 的腳本，此外還會做完 TLS certificates
...

Cancel
Container 要去連 Host 上的 Docker API 。但現在連線失敗卻是找 http://docker:2375，現在的 dind 已經不是被當做 services 來用了，而是要直接在裡面跑 Docker，所以他應該是要 unix:///var/run/docker.sock 用這種連線，於是把環境變數 DOCKER_HOST 從 tcp://docker:2375 改成空字串，讓 docker daemon 走預設連線就能成功囉！
...

Cancel
auto-deploy preparationhelm init 建立 helm 專案設定 tiller 在背景執行設定 cluster 的 namespace
...

Cancel
auto-deploy deploy使用 helm upgrade 部署 chart 到 K8S 上透過 --set 來設定要注入 template 的參數
...

Cancel
set -x，這樣就能在執行前，顯示指令內容。
...

Cancel
用 helm repo list 看看現在有註冊哪些 Chart Repository
...

Cancel
helm fetch gitlab/auto-deploy-app --untar
...

Cancel
nohup 可以讓你在離線或登出系統後，還能夠讓工作繼續進行
...

Cancel
在不特別設定 CI_APPLICATION_REPOSITORY 的情況下，image_repository 的值就是預設環境變數 CI_REGISTRY_IMAGE/CI_COMMIT_REF_SLUG
...

Cancel
A:-B 的意思是如果有 A 就用它，沒有就用 B
...

Cancel
研究 Auto Devops 難度最高的地方就是太多工具整合在一起，搞不清楚他們之間的關係，出錯也不知道從何查起
...

Cancel