Group items tagged

Filter: All | Bookmarks | Topics Simple Middle

4 system monitoring tools for Linux | Opensource.com - 0 views

opensource.com/...source-tools-system-monitoring

linux monitoring sysadmin top

shared by crazylion lee on 01 Mar 16 - No Cached

crazylion lee on 01 Mar 16

"4 open source tools for Linux system monitoring"

<div class="cArrow"> </div><div class="cContentInner">"4 open source tools for Linux system monitoring"</div>

...

Cancel

Auto DevOps | GitLab - 0 views

docs.gitlab.com/...index.html

gitlab auto devops CI deploy

shared by 張旭 on 02 Aug 19 - No Cached

Auto DevOps provides pre-defined CI/CD configuration which allows you to automatically detect, build, test, deploy, and monitor your applications
...

Cancel
Just push your code and GitLab takes care of everything else.
...

Cancel
Auto DevOps will be automatically disabled on the first pipeline failure.
...

Cancel
...78 more annotations...
Your project will continue to use an alternative CI/CD configuration file if one is found
...

Cancel
Auto DevOps works with any Kubernetes cluster;
...

Cancel
using the Docker or Kubernetes executor, with privileged mode enabled.
...

Cancel
Base domain (needed for Auto Review Apps and Auto Deploy)
...

Cancel
Kubernetes (needed for Auto Review Apps, Auto Deploy, and Auto Monitoring)
...

Cancel
Prometheus (needed for Auto Monitoring)
...

Cancel
scrape your Kubernetes cluster.
...

Cancel
project level as a variable: KUBE_INGRESS_BASE_DOMAIN
...

Cancel
A wildcard DNS A record matching the base domain(s) is required
...

Cancel
Once set up, all requests will hit the load balancer, which in turn will route them to the Kubernetes pods that run your application(s).
...

Cancel
review/ (every environment starting with review/)
...

Cancel
staging
...

Cancel
production
...

Cancel
need to define a separate KUBE_INGRESS_BASE_DOMAIN variable for all the above based on the environment.
...

Cancel
Continuous deployment to production: Enables Auto Deploy with master branch directly deployed to production.
...

Cancel
Continuous deployment to production using timed incremental rollout
...

Cancel
Automatic deployment to staging, manual deployment to production
...

Cancel
Auto Build creates a build of the application using an existing Dockerfile or Heroku buildpacks.
...

Cancel
If a project’s repository contains a Dockerfile, Auto Build will use docker build to create a Docker image.
...

Cancel
Each buildpack requires certain files to be in your project’s repository for Auto Build to successfully build your application.
...

Cancel
Auto Test automatically runs the appropriate tests for your application using Herokuish and Heroku buildpacks by analyzing your project to detect the language and framework.
...

Cancel
Auto Code Quality uses the Code Quality image to run static analysis and other code checks on the current code.
...

Cancel
Static Application Security Testing (SAST) uses the SAST Docker image to run static analysis on the current code and checks for potential security issues.
...

Cancel
Dependency Scanning uses the Dependency Scanning Docker image to run analysis on the project dependencies and checks for potential security issues.
...

Cancel
License Management uses the License Management Docker image to search the project dependencies for their license.
...

Cancel
Vulnerability Static Analysis for containers uses Clair to run static analysis on a Docker image and checks for potential security issues.
...

Cancel
Review Apps are temporary application environments based on the branch’s code so developers, designers, QA, product managers, and other reviewers can actually see and interact with code changes as part of the review process. Auto Review Apps create a Review App for each branch. Auto Review Apps will deploy your app to your Kubernetes cluster only. When no cluster is available, no deployment will occur.
...

Cancel
The Review App will have a unique URL based on the project ID, the branch or tag name, and a unique number, combined with the Auto DevOps base domain.
...

Cancel
Review apps are deployed using the auto-deploy-app chart with Helm, which can be customized.
...

Cancel
Your apps should not be manipulated outside of Helm (using Kubernetes directly).
...

Cancel
Dynamic Application Security Testing (DAST) uses the popular open source tool OWASP ZAProxy to perform an analysis on the current code and checks for potential security issues.
...

Cancel
Auto Browser Performance Testing utilizes the Sitespeed.io container to measure the performance of a web page.
...

Cancel
add the paths to a file named .gitlab-urls.txt in the root directory, one per line.
...

Cancel
After a branch or merge request is merged into the project’s default branch (usually master), Auto Deploy deploys the application to a production environment in the Kubernetes cluster, with a namespace based on the project name and unique project ID
...

Cancel
Auto Deploy doesn’t include deployments to staging or canary by default, but the Auto DevOps template contains job definitions for these tasks if you want to enable them.
...

Cancel
Apps are deployed using the auto-deploy-app chart with Helm.
...

Cancel
For internal and private projects a GitLab Deploy Token will be automatically created, when Auto DevOps is enabled and the Auto DevOps settings are saved.
...

Cancel
If the GitLab Deploy Token cannot be found, CI_REGISTRY_PASSWORD is used. Note that CI_REGISTRY_PASSWORD is only valid during deployment.
...

Cancel
If present, DB_INITIALIZE will be run as a shell command within an application pod as a helm post-install hook.
...

Cancel
a post-install hook means that if any deploy succeeds, DB_INITIALIZE will not be processed thereafter.
...

Cancel
DB_MIGRATE will be run as a shell command within an application pod as a helm pre-upgrade hook.
...

Cancel
- 張旭 on 02 Aug 19
  
  如果專案類型不同，就要去查 buildpacks 裡面如何叫用該指令，例如 laravel 的 migration
  
  <div class="cArrow"> </div><div class="cContentInner">如果專案類型不同，就要去查 buildpacks 裡面如何叫用該指令，例如 laravel 的 migration</div>
  
  ...
  
  Cancel
...

Cancel
- 張旭 on 02 Aug 19
  
  如果是自己的 Dockerfile 建立起來的，看來就不用鳥 buildpacks 的作法
  
  <div class="cArrow"> </div><div class="cContentInner">如果是自己的 Dockerfile 建立起來的，看來就不用鳥 buildpacks 的作法</div>
  
  ...
  
  Cancel
...

Cancel
Once your application is deployed, Auto Monitoring makes it possible to monitor your application’s server and response metrics right out of the box.
...

Cancel
annotate the NGINX Ingress deployment to be scraped by Prometheus using prometheus.io/scrape: "true" and prometheus.io/port: "10254"
...

Cancel
If you are also using Auto Review Apps and Auto Deploy and choose to provide your own Dockerfile, make sure you expose your application to port 5000 as this is the port assumed by the default Helm chart.
...

Cancel
While Auto DevOps provides great defaults to get you started, you can customize almost everything to fit your needs; from custom buildpacks, to Dockerfiles, Helm charts, or even copying the complete CI/CD configuration into your project to enable staging and canary deployments, and more.
...

Cancel
If your project has a Dockerfile in the root of the project repo, Auto DevOps will build a Docker image based on the Dockerfile rather than using buildpacks.
...

Cancel
Auto DevOps uses Helm to deploy your application to Kubernetes.
...

Cancel
Bundled chart - If your project has a ./chart directory with a Chart.yaml file in it, Auto DevOps will detect the chart and use it instead of the default one.
...

Cancel
Create a project variable AUTO_DEVOPS_CHART with the URL of a custom chart to use or create two project variables AUTO_DEVOPS_CHART_REPOSITORY with the URL of a custom chart repository and AUTO_DEVOPS_CHART with the path to the chart.
...

Cancel
make use of the HELM_UPGRADE_EXTRA_ARGS environment variable to override the default values in the values.yaml file in the default Helm chart.
...

Cancel
specify the use of a custom Helm chart per environment by scoping the environment variable to the desired environment.
...

Cancel
- 張旭 on 02 Aug 19
  
  Auto DevOps 就是一套人家寫好好的傳便便的 .gitlab-ci.yml
  
  <div class="cArrow"> </div><div class="cContentInner">Auto DevOps 就是一套人家寫好好的傳便便的 .gitlab-ci.yml</div>
  
  ...
  
  Cancel
...

Cancel
Your additions will be merged with the Auto DevOps template using the behaviour described for include
...

Cancel
copy and paste the contents of the Auto DevOps template into your project and edit this as needed.
...

Cancel
In order to support applications that require a database, PostgreSQL is provisioned by default.
...

Cancel
Set up the replica variables using a project variable and scale your application by just redeploying it!
...

Cancel
You should not scale your application using Kubernetes directly.
...

Cancel
Some applications need to define secret variables that are accessible by the deployed application.
...

Cancel
Auto DevOps detects variables where the key starts with K8S_SECRET_ and make these prefixed variables available to the deployed application, as environment variables.
...

Cancel
Auto DevOps pipelines will take your application secret variables to populate a Kubernetes secret.
...

Cancel
Environment variables are generally considered immutable in a Kubernetes pod.
...

Cancel
if you update an application secret without changing any code then manually create a new pipeline, you will find that any running application pods will not have the updated secrets.
...

Cancel
Variables with multiline values are not currently supported
...

Cancel
The normal behavior of Auto DevOps is to use Continuous Deployment, pushing automatically to the production environment every time a new pipeline is run on the default branch.
...

Cancel
If STAGING_ENABLED is defined in your project (e.g., set STAGING_ENABLED to 1 as a CI/CD variable), then the application will be automatically deployed to a staging environment, and a production_manual job will be created for you when you’re ready to manually deploy to production.
...

Cancel
If CANARY_ENABLED is defined in your project (e.g., set CANARY_ENABLED to 1 as a CI/CD variable) then two manual jobs will be created: canary which will deploy the application to the canary environment production_manual which is to be used by you when you’re ready to manually deploy to production.
...

Cancel
If INCREMENTAL_ROLLOUT_MODE is set to manual in your project, then instead of the standard production job, 4 different manual jobs will be created: rollout 10% rollout 25% rollout 50% rollout 100%
...

Cancel
The percentage is based on the REPLICAS variable and defines the number of pods you want to have for your deployment.
...

Cancel
To start a job, click on the play icon next to the job’s name.
...

Cancel
Once you get to 100%, you cannot scale down, and you’d have to roll back by redeploying the old version using the rollback button in the environment page.
...

Cancel
With INCREMENTAL_ROLLOUT_MODE set to manual and with STAGING_ENABLED
...

Cancel
not all buildpacks support Auto Test yet
...

Cancel
When a project has been marked as private, GitLab’s Container Registry requires authentication when downloading containers.
...

Cancel
Authentication credentials will be valid while the pipeline is running, allowing for a successful initial deployment.
...

Cancel
After the pipeline completes, Kubernetes will no longer be able to access the Container Registry.
...

Cancel
We strongly advise using GitLab Container Registry with Auto DevOps in order to simplify configuration and prevent any unforeseen issues.
...

Cancel

Monitor Node Health | Kubernetes - 0 views

kubernetes.io/...monitor-node-health

kubernetes monitor system

shared by 張旭 on 21 Jul 21 - No Cached

Node Problem Detector is a daemon for monitoring and reporting about a node's health
...

Cancel
Node Problem Detector collects information about node problems from various daemons and reports these conditions to the API server as NodeCondition and Event.
...

Cancel
Node Problem Detector only supports file based kernel log. Log tools such as journald are not supported.
...

Cancel
...2 more annotations...
kubectl provides the most flexible management of Node Problem Detector.
...

Cancel
run the Node Problem Detector in your cluster to monitor node health.
...

Cancel

GitHub - google/cloudprober: An active monitoring software to detect failures before yo... - 1 views

github.com/cloudprober

google probe monitor ping

shared by crazylion lee on 16 Apr 18 - No Cached

crazylion lee on 16 Apr 18

"An active monitoring software to detect failures before your customers do. https://cloudprober.org"

<div class="cArrow"> </div><div class="cContentInner">"An active monitoring software to detect failures before your customers do. <a href="https://cloudprober.org" rel="nofollow" target="_blank">https://cloudprober.org</a>"</div>

...

Cancel

GitHub - SigNoz/signoz: SigNoz helps developers monitor their applications & troublesho... - 0 views

github.com/signoz

datadog monitor

shared by crazylion lee on 05 Jul 21 - No Cached

crazylion lee on 05 Jul 21

"Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc."

<div class="cArrow"> </div><div class="cContentInner">"Monitor your applications and troubleshoot problems in your deployed applications, an open-source alternative to DataDog, New Relic, etc."</div>

...

Cancel

Improving Kubernetes reliability: quicker detection of a Node down | Fatal failure - 0 views

fatalfailure.wordpress.com/...icker-detection-of-a-node-down

kubernetes HA

shared by 張旭 on 21 Jul 21 - No Cached

when a Node gets down, the pods of the broken node are still running for some time and they still get requests, and those requests, will fail.
...

Cancel
1- The Kubelet posts its status to the masters using –node-status-update-frequency=10s 2- A node dies 3- The kube controller manager is the one monitoring the nodes, using –-node-monitor-period=5s it checks, in the masters, the node status reported by the Kubelet. 4- Kube controller manager will see the node is unresponsive, and has this grace period –node-monitor-grace-period=40s until it considers the node unhealthy.
...

Cancel
node-status-update-frequency x (N-1) != node-monitor-grace-period
...

Cancel
...2 more annotations...
5- Once the node is marked as unhealthy, the kube controller manager will remove its pods based on –pod-eviction-timeout=5m0s
...

Cancel
6- Kube proxy has a watcher over the API, so the very first moment the pods are evicted the proxy will notice and update the iptables of the node, removing the endpoints from the services so the failing pods won’t be accessible anymore.
...

Cancel

Supervisor: A Process Control System - supervisor 3.1a1-dev documentation - 1 views

supervisord.org

monitoring sysadmin python linux unix supervisor

shared by crazylion lee on 13 Feb 14 - Cached

crazylion lee on 13 Feb 14

Supervisor is a client/server system that allows its users to monitor and control a number of processes on UNIX-like operating systems. It shares some of the same goals of programs like launchd, daemontools, and runit. Unlike some of these programs, it is not meant to be run as a substitute for init as "process id 1". Instead it is meant to be used to control processes related to a project or a customer, and is meant to start like any other program at boot time.

<div class="cArrow"> </div><div class="cContentInner">Supervisor is a client/server system that allows its users to monitor and control a number of processes on UNIX-like operating systems. It shares some of the same goals of programs like launchd, daemontools, and runit. Unlike some of these programs, it is not meant to be run as a substitute for init as "process id 1". Instead it is meant to be used to control processes related to a project or a customer, and is meant to start like any other program at boot time. </div>

...

Cancel

Best practices for building Kubernetes Operators and stateful apps | Google Cloud Blog - 0 views

cloud.google.com/...es-operators-and-stateful-apps

kubernetes system operator

shared by 張旭 on 17 May 21 - No Cached

use the StatefulSet workload controller to maintain identity for each of the pods, and to use Persistent Volumes to persist data so it can survive a service restart.
...

Cancel
a way to extend Kubernetes functionality with application specific logic using custom resources and custom controllers.
...

Cancel
An Operator can automate various features of an application, but it should be specific to a single application
...

Cancel
...12 more annotations...
Kubebuilder is a comprehensive development kit for building and publishing Kubernetes APIs and Controllers using CRDs
...

Cancel
Design declarative APIs for operators, not imperative APIs. This aligns well with Kubernetes APIs that are declarative in nature.
...

Cancel
With declarative APIs, users only need to express their desired cluster state, while letting the operator perform all necessary steps to achieve it.
...

Cancel
scaling, backup, restore, and monitoring. An operator should be made up of multiple controllers that specifically handle each of the those features.
...

Cancel
the operator can have a main controller to spawn and manage application instances, a backup controller to handle backup operations, and a restore controller to handle restore operations.
...

Cancel
each controller should correspond to a specific CRD so that the domain of each controller's responsibility is clear.
...

Cancel
If you keep a log for every container, you will likely end up with unmanageable amount of logs.
...

Cancel
integrate application-specific details to the log messages such as adding a prefix for the application name.
...

Cancel
you may have to use external logging tools such as Google Stackdriver, Elasticsearch, Fluentd, or Kibana to perform the aggregations.
...

Cancel
adding labels to metrics to facilitate aggregation and analysis by monitoring systems.
...

Cancel
a more viable option is for application pods to expose a metrics HTTP endpoint for monitoring tools to scrape.
...

Cancel
A good way to achieve this is to use open-source application-specific exporters for exposing Prometheus-style metrics.
...

Cancel

Using NGINX Logging for Application Performance Monitoring - 0 views

www.nginx.com/...ication-performance-monitoring

nginx performance log

shared by 張旭 on 22 Sep 21 - No Cached

taking advantage of the flexibility of NGINX access logging is application performance monitoring (APM).
...

Cancel
it’s simple to get detailed visibility into the performance of your applications by adding timing values to your code and passing them as response headers for inclusion in the NGINX access log.
...

Cancel
$request_time – Full request time, starting when NGINX reads the first byte from the client and ending when NGINX sends the last byte of the response body
...

Cancel
...3 more annotations...
$upstream_response_time – Time between establishing a connection to an upstream server and receiving the last byte of the response body
...

Cancel
capture timings in the application itself and include them as response headers, which NGINX then captures in its access log.
...

Cancel
$upstream_header_time – Time between establishing a connection to an upstream server and receiving the first byte of the response header
...

Cancel

crontab.guru - the cron schedule expression editor - 0 views

crontab.guru

cron linux guru

shared by crazylion lee on 12 Nov 16 - No Cached

crazylion lee on 12 Nov 16

"By WDT.io - the reliable monitor for your cronjobs."

<div class="cArrow"> </div><div class="cContentInner">"By WDT.io - the reliable monitor for your cronjobs."</div>

...

Cancel

The Netflix Tech Blog: Introducing Vector: Netflix's On-Host Performance Monitoring Tool - 0 views

techblog.netflix.com/...g-vector-netflixs-on-host.html

netflix monitoring performance linux sysadmin

shared by crazylion lee on 30 Jan 17 - No Cached

Riemann - A network monitoring system - 0 views

riemann.io/index.html

monitoring dashboard sysadmin

shared by crazylion lee on 17 Apr 16 - No Cached

crazylion lee on 17 Apr 16

"Riemann aggregates events from your servers and applications with a powerful stream processing language. Send an email for every exception in your app. Track the latency distribution of your web app. See the top processes on any host, by memory and CPU. Combine statistics from every Riak node in your cluster and forward to Graphite. Track user activity from second to second."

<div class="cArrow"> </div><div class="cContentInner">"Riemann aggregates events from your servers and applications with a powerful stream processing language. Send an email for every exception in your app. Track the latency distribution of your web app. See the top processes on any host, by memory and CPU. Combine statistics from every Riak node in your cluster and forward to Graphite. Track user activity from second to second."</div>

...

Cancel

Open Source Monitoring › Icinga - 0 views

www.icinga.org

monitoring nagios opensource network linux

shared by crazylion lee on 15 Aug 14 - No Cached

Security Onion - 0 views

security-onion-solutions.github.io/security-onion

security onion linux distribution

shared by crazylion lee on 18 Nov 15 - No Cached

crazylion lee on 18 Nov 15

"Security Onion is a Linux distro for intrusion detection, network security monitoring, and log management. It's based on Ubuntu and contains Snort, Suricata, Bro, OSSEC, Sguil, Squert, ELSA, Xplico, NetworkMiner, and many other security tools. The easy-to-use Setup wizard allows you to build an army of distributed sensors for your enterprise in minutes!"

<div class="cArrow"> </div><div class="cContentInner">"Security Onion is a Linux distro for intrusion detection, network security monitoring, and log management. It's based on Ubuntu and contains Snort, Suricata, Bro, OSSEC, Sguil, Squert, ELSA, Xplico, NetworkMiner, and many other security tools. The easy-to-use Setup wizard allows you to build an army of distributed sensors for your enterprise in minutes!"</div>

...

Cancel

Nmap: the Network Mapper - Free Security Scanner - 1 views

nmap.org

security network

shared by crazylion lee on 22 Nov 15 - No Cached

crazylion lee on 22 Nov 15

"Nmap ("Network Mapper") is a free and open source (license) utility for network discovery and security auditing. Many systems and network administrators also find it useful for tasks such as network inventory, managing service upgrade schedules, and monitoring host or service uptime. Nmap uses raw IP packets in novel ways to determine what hosts are available on the network, what services (application name and version) those hosts are offering, what operating systems (and OS versions) they are running, what type of packet filters/firewalls are in use, and dozens of other characteristics. It was designed to rapidly scan large networks, but works fine against single hosts. Nmap runs on all major computer operating systems, and official binary packages are available for Linux, Windows, and Mac OS X. In addition to the classic command-line Nmap executable, the Nmap suite includes an advanced GUI and results viewer (Zenmap), a flexible data transfer, redirection, and debugging tool (Ncat), a utility for comparing scan results (Ndiff), and a packet generation and response analysis tool (Nping)."

<div class="cArrow"> </div><div class="cContentInner">"Nmap ("Network Mapper") is a free and open source (license) utility for network discovery and security auditing. Many systems and network administrators also find it useful for tasks such as network inventory, managing service upgrade schedules, and monitoring host or service uptime. Nmap uses raw IP packets in novel ways to determine what hosts are available on the network, what services (application name and version) those hosts are offering, what operating systems (and OS versions) they are running, what type of packet filters/firewalls are in use, and dozens of other characteristics. It was designed to rapidly scan large networks, but works fine against single hosts. Nmap runs on all major computer operating systems, and official binary packages are available for Linux, Windows, and Mac OS X. In addition to the classic command-line Nmap executable, the Nmap suite includes an advanced GUI and results viewer (Zenmap), a flexible data transfer, redirection, and debugging tool (Ncat), a utility for comparing scan results (Ndiff), and a packet generation and response analysis tool (Nping)."</div>

...

Cancel

Intro to deployment strategies: blue-green, canary, and more - DEV Community - 0 views

dev.to/...blue-green-canary-and-more-3a3

devops deploy system micro service

shared by 張旭 on 23 Nov 18 - No Cached

using a service-oriented architecture and microservices approach, developers can design a code base to be modular.
...

Cancel
Modern applications are often distributed and cloud-based
...

Cancel
different release cycles for different components
...

Cancel
...20 more annotations...
the abstraction of the infrastructure layer, which is now considered code. Deployment of a new application may require the deployment of new infrastructure code as well.
...

Cancel
"big bang" deployments update whole or large parts of an application in one fell swoop.
...

Cancel
Big bang deployments required the business to conduct extensive development and testing before release, often associated with the "waterfall model" of large sequential releases.
...

Cancel
Rollbacks are often costly, time-consuming, or even impossible.
...

Cancel
In a rolling deployment, an application’s new version gradually replaces the old one.
...

Cancel
new and old versions will coexist without affecting functionality or user experience.
...

Cancel
Each container is modified to download the latest image from the app vendor’s site.
...

Cancel
two identical production environments work in parallel.
...

Cancel
Once the testing results are successful, application traffic is routed from blue to green.
...

Cancel
In a blue-green deployment, both systems use the same persistence layer or database back end.
...

Cancel
You can use the primary database by blue for write operations and use the secondary by green for read operations.
...

Cancel
Blue-green deployments rely on traffic routing.
...

Cancel
long TTL values can delay these changes.
...

Cancel
The main challenge of canary deployment is to devise a way to route some users to the new application.
...

Cancel
Using an application logic to unlock new features to specific users and groups.
...

Cancel
With CD, the CI-built code artifact is packaged and always ready to be deployed in one or more environments.
...

Cancel
Use Build Automation tools to automate environment builds
...

Cancel
Use configuration management tools
...

Cancel
Enable automated rollbacks for deployments
...

Cancel
An application performance monitoring (APM) tool can help your team monitor critical performance metrics including server response times after deployments.
...

Cancel

What's the difference between Prometheus and Zabbix? - Stack Overflow - 0 views

stackoverflow.com/...-between-prometheus-and-zabbix

system monitor php go

shared by 張旭 on 23 Apr 18 - No Cached

Zabbix has core written in C and webUI based on PHP
...

Cancel
Zabbix stores data in RDBMS (MySQL, PostgreSQL, Oracle, sqlite) of user's choice.
...

Cancel
Prometheus uses its own database embedded into backend process
...

Cancel
...8 more annotations...
Zabbix by default uses "pull" model when a server connects to agents on each monitoring machine, agents periodically gather the info and send it to a server.
...

Cancel
Prometheus prefers "pull" model when a server gather info from client machines.
...

Cancel
Prometheus requires an application to be instrumented with Prometheus client library (available in different programming languages) for preparing metrics.
...

Cancel
expose metrics for Prometheus (similar to "agents" for Zabbix)
...

Cancel
Zabbix uses its own tcp-based communication protocol between agents and a server.
...

Cancel
Prometheus uses HTTP with protocol buffers (+ text format for ease of use with curl).
...

Cancel
Prometheus offers basic tool for exploring gathered data and visualizing it in simple graphs on its native server and also offers a minimal dashboard builder PromDash. But Prometheus is and is designed to be supported by modern visualizing tools like Grafana.
...

Cancel
Prometheus offers solution for alerting that is separated from its core into Alertmanager application.
...

Cancel

GitHub - tgraf/bmon: bandwidth monitor and rate estimator - 0 views

github.com/bmon

monitor network bandwidth

shared by crazylion lee on 21 Apr 17 - No Cached

MongoDB Performance Tuning: Everything You Need to Know - Stackify - 0 views

stackify.com/mongodb-performance-tuning

database mongodb

shared by 張旭 on 15 Apr 21 - No Cached

db.serverStatus().globalLock
...

Cancel
db.serverStatus().locks
...

Cancel
globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it’s consistently high. This can happen if a lot of requests are waiting for a lock to be released.
...

Cancel
...35 more annotations...
globalLock.totalTime: If this is higher than the total database uptime, the database has been in a lock state for too long.
...

Cancel
Unlike relational databases such as MySQL or PostgreSQL, MongoDB uses JSON-like documents for storing data.
...

Cancel
Databases operate in an environment that consists of numerous reads, writes, and updates.
...

Cancel
When a lock occurs, no other operation can read or modify the data until the operation that initiated the lock is finished.
...

Cancel
locks.deadlockCount: Number of times the lock acquisitions have encountered deadlocks
...

Cancel
Is the database frequently locking from queries? This might indicate issues with the schema design, query structure, or system architecture.
...

Cancel
For version 3.2 on, WiredTiger is the default.
...

Cancel
MMAPv1 locks whole collections, not individual documents.
...

Cancel
WiredTiger performs locking at the document level.
...

Cancel
When the MMAPv1 storage engine is in use, MongoDB will use memory-mapped files to store data.
...

Cancel
All available memory will be allocated for this usage if the data set is large enough.
...

Cancel
db.serverStatus().mem
...

Cancel
mem.resident: Roughly equivalent to the amount of RAM in megabytes that the database process uses
...

Cancel
If mem.resident exceeds the value of system memory and there’s a large amount of unmapped data on disk, we’ve most likely exceeded system capacity.
...

Cancel
If the value of mem.mapped is greater than the amount of system memory, some operations will experience page faults.
...

Cancel
The WiredTiger storage engine is a significant improvement over MMAPv1 in performance and concurrency.
...

Cancel
By default, MongoDB will reserve 50 percent of the available memory for the WiredTiger data cache.
...

Cancel
wiredTiger.cache.bytes currently in the cache – This is the size of the data currently in the cache.
...

Cancel
wiredTiger.cache.tracked dirty bytes in the cache – This is the size of the dirty data in the cache.
...

Cancel
we can look at the wiredTiger.cache.bytes read into cache value for read-heavy applications. If this value is consistently high, increasing the cache size may improve overall read performance.
...

Cancel
check whether the application is read-heavy. If it is, increase the size of the replica set and distribute the read operations to secondary members of the set.
...

Cancel
write-heavy, use sharding within a sharded cluster to distribute the load.
...

Cancel
Replication is the propagation of data from one node to another
...

Cancel
Replication sets handle this replication.
...

Cancel
Sometimes, data isn’t replicated as quickly as we’d like.
...

Cancel
a particularly thorny problem if the lag between a primary and secondary node is high and the secondary becomes the primary
...

Cancel
use the db.printSlaveReplicationInfo() or the rs.printSlaveReplicationInfo() command to see the status of a replica set from the perspective of the secondary member of the set.
...

Cancel
shows how far behind the secondary members are from the primary. This number should be as low as possible.
...

Cancel
monitor this metric closely.
...

Cancel
watch for any spikes in replication delay.
...

Cancel
Always investigate these issues to understand the reasons for the lag.
...

Cancel
One replica set is primary. All others are secondary.
...

Cancel
it’s not normal for nodes to change back and forth between primary and secondary.
...

Cancel
use the profiler to gain a deeper understanding of the database’s behavior.
...

Cancel
Enabling the profiler can affect system performance, due to the additional activity.
...

Cancel

張旭 on 15 Apr 21

"globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it's consistently high. This can happen if a lot of requests are waiting for a lock to be released."

<div class="cArrow"> </div><div class="cContentInner">"globalLock.currentQueue.total: This number can indicate a possible concurrency issue if it's consistently high. This can happen if a lot of requests are waiting for a lock to be released."</div>

...

Cancel

Kubernetes Components | Kubernetes - 0 views

kubernetes.io/...components

kubernetes system