时序数据库InfluxDB的基本语法

2023年6月16日上午8:26 • 数据库 • 阅读 92

一了解InfluxDB的必要性

Time series data is a series of data points each associated with a specific time. Examples include:

Server performance metrics
Financial averages over time
Sensor data, such as temperature, barometric pressure, wind speeds, etc.

Relational databases can be used to store and analyze time series data, but depending on the precision of your data, a query can involve potentially millions of rows. InfluxDB is purpose-built to store and query data by time, providing out-of-the-box functionality that optionally downsamples data after a specific age and a query engine optimized for time-based data.

二基本概念

database

A logical container for users, retention policies, continuous queries, and time series data.

duration

The attribute of the retention policy that determines how long InfluxDB stores data. Data older than the duration are automatically dropped from the database.

The key-value pair in an InfluxDB data structure that records metadata and the actual data value. Fields are required in InfluxDB data structures and they are not indexed – queries on field values scan all points that match the specified time range and, as a result, are not performant relative to tags.

Field keys are strings and they store metadata.Field values are the actual data; they can be strings, floats, integers, or booleans. A field value is always associated with a timestamp.

Tags are optional. The key-value pair in the InfluxDB data structure that records metadata.You don’t need to have tags in your data structure, but it’s generally a good idea to make use of them because, unlike fields, tags are indexed. This means that queries on tags are faster and that tags are ideal for storing commonly-queried metadata.

Tags are indexed and fields are not indexed. This means that queries on tags are more performant than those on fields.

（1）Store commonly-queried meta data in tags

（2）Store data in tags if you plan to use them with the InfluxQL GROUP BY clause

（3）Store data in fields if you plan to use them with an InfluxQL function

（4）Store numeric values as fields (tag values only support string values)

The measurement acts as a container for tags, fields, and the time column, and the measurement name is the description of the data that are stored in the associated fields. Measurement names are strings, and, for any SQL users out there, a measurement is conceptually similar to a table.

In InfluxDB, a point represents a single data record, similar to a row in a SQL database table. Each point:

has a measurement, a tag set, a field key, a field value, and a timestamp;
is uniquely identified by its series and timestamp.

You cannot store more than one point with the same timestamp in a series. If you write a point to a series with a timestamp that matches an existing point, the field set becomes a union of the old and new field set, and any ties go to the new field set.

In InfluxDB, a series is a collection of points that share a measurement, tag set, and field key. A point represents a single data record that has four components: a measurement, tag set, field set, and a timestamp. A point is uniquely identified by its series and timestamp.

series key

A series key identifies a particular series by measurement, tag set, and field key.

三查询

1.实现查询以给定字段开始的数据

select fieldName from measurementName where fieldName=~/^给定字段/

2.实现查询以给定字段结束的数据

select fieldName from measurementName where fieldName=~/给定字段$/

3.实现查询包含给定字段数据

select fieldName from measurementName where fieldName=~/给定字段/

必须包含field key

A query requires at least one field key in the SELECT clause to return data. If the SELECT clause only includes a single tag key or several tag keys, the query returns an empty response. This behavior is a result of how the system stores data.

使用单引号，否则无数据返回或报错

（1）Single quote string field values in the WHERE clause. Queries with unquoted string field values or double quoted string field values will not return any data and, in most cases,will not return an error.

（2）Single quote tag values in the WHERE clause. Queries with unquoted tag values or double quoted tag values will not return any data and, in most cases, will not return an error.

（1）Note that the GROUP BY clause must come after the WHERE clause.

（2）The GROUP BY clause groups query results by: one or more specified tags ；specified time interval。

（3）You cannot use GROUP BY to group fields.

（4） fill() changes the value reported for time intervals that have no data.

By default, a GROUP BY time() interval with no data reports null as its value in the output column. fill() changes the value reported for time intervals that have no data. Note that fill() must go at the end of the GROUP BY clause if you’re GROUP(ing) BY several things (for example, both tags and a time interval).

By default, InfluxDB returns results in ascending time order; the first point returned has the oldest timestamp and the last point returned has the most recent timestamp. ORDER BY time DESC reverses that order such that InfluxDB returns the points with the most recent timestamps first.

注意： ORDER by time DESC must appear after the GROUP BY clause if the query includes a GROUP BY clause. ORDER by time DESC must appear after the WHERE clause if the query includes a WHERE clause and no GROUP BY clause.

四.SHOW CARDINALITY

是用于估计或精确计算measurement、序列、tag key、tag value和field key的基数的一组命令。

SHOW CARDINALITY命令有两种可用的版本：估计和精确。估计值使用草图进行计算，对于所有基数大小来说，这是一个安全默认值。精确值是直接对TSM（Time-Structured Merge Tree）数据进行计数，但是，对于基数大的数据来说，运行成本很高。

下面以tag key、tag value为例。

估计或精确计算tag key集的基数。

举例:

估计或精确计算指定tag key对应的tag value的基数。

例如，前面的分享，我们通过Telegraf 将server的监控数据保存到了InfluxDB中，其中CPU指标是必不可少的（telegraf.conf 设置）。假如有一天，我们需要统计telegraf一共部署了多少台。其实就可以通过SHOW TAG VALUES EXACT CARDINALITY 获得。

SQL 语句如下:

即查看cpu 中 host 的key值有多少个。因为通过telegraf.conf的设置，一台Server 对应一个唯一的host值，host值有多少个，就有多少台Server已部署了telegraf。

5 Drop 与 Delete

The DROP SERIES query deletes all points from a series in a database, and it drops the series from the index.

The query takes the following form, where you must specify either the FROM clause or the WHERE clause.

语法如下：

A successful DROP SERIES query returns an empty result.

Drop all points in the series that have a specific tag pair from all measurements in the database(即，如不指定from，将会把符合条件的所有表tag数据删除).

与Delete series 的区别是：

The DELETE query deletes all points from a series in a database. Unlike DROP SERIES, DELETE does not drop the series from the index.

只允许根据tag和时间来进行删除操作.

即先执行:

然后再执行:

六常用函数部分

常用函数汇总如下:

COUNT() Returns the number of non-null field values. 可以理解为统计某段时间段内这个值的次数/行数。 DISTINCT() Returns the list of unique field values.

often returns several results with the same timestamp; InfluxDB assumes points with the same series and timestamp are duplicate points and simply overwrites any duplicate point with the most recent point in the destination measurement. INTEGRAL() Returns the area under the curve for subsequent field values. InfluxDB calculates the area under the curve for subsequent field values and converts those results into the summed area per

. The

argument is an integer followed by a duration literal and it is optional. If the query does not specify the

, the unit defaults to one second (

). MEAN() Returns the arithmetic mean (average) of field values. MEDIAN() Returns the middle value from a sorted list of field values.

is nearly equivalent to

, except

returns the average of the two middle field values if the field contains an even number of values. MODE() Returns the most frequent value in a list of field values.

returns the field value with the earliest timestamp if there’s a tie between two or more values for the maximum number of occurrences. SPREAD() Returns the difference between the minimum and maximum field values. 有些指标值是递增的，这时候，如果向看某段时间端的的变化值/增长值，可以直接使用这个函数获取。 STDDEV() Returns the standard deviation of field values. SUM() Returns the sum of field values.

BOTTOM() Returns the smallest

field values.

returns the field value with the earliest timestamp if there’s a tie between two or more values for the smallest value. FIRST() Returns the field value with the oldest timestamp. LAST() Returns the field value with the most recent timestamp. MAX() Returns the greatest field value. MIN() Returns the lowest field value. PERCENTILE() Returns the

th percentile field value. SAMPLE() Returns a random sample of

field values.

uses reservoir sampling to generate the random points. TOP() Returns the greatest

field values.

returns the field value with the earliest timestamp if there’s a tie between two or more values for the greatest value.

ABS() Returns the absolute value of the field value. ACOS() Returns the arccosine (in radians) of the field value. Field values must be between -1 and 1. ASIN() Returns the arcsine (in radians) of the field value. Field values must be between -1 and 1. ATAN() Returns the arctangent (in radians) of the field value. Field values must be between -1 and 1. ATAN2() Returns the the arctangent of

in radians. CEIL() Returns the subsequent value rounded up to the nearest integer. COS() Returns the cosine of the field value. CUMULATIVE_SUM() Returns the running total of subsequent field values. DERIVATIVE() Returns the rate of change between subsequent field values. InfluxDB calculates the difference between subsequent field values and converts those results into the rate of change per

. The

argument is an integer followed by a duration literal and it is optional. If the query does not specify the

the unit defaults to one second (

). DIFFERENCE() Returns the result of subtraction between subsequent field values. ELAPSED() Returns the difference between subsequent field value’s timestamps. InfluxDB calculates the difference between subsequent timestamps. The

option is an integer followed by a duration literal and it determines the unit of the returned difference. If the query does not specify the

option the query returns the difference between timestamps in nanoseconds. EXP() Returns the exponential of the field value. FLOOR() Returns the subsequent value rounded down to the nearest integer. LN() Returns the natural logarithm of the field value. LOG() Returns the logarithm of the field value with base

. LOG2() Returns the logarithm of the field value to the base 2. LOG10() Returns the logarithm of the field value to the base 10. MOVING_AVERAGE() Returns the rolling average across a window of subsequent field values. POW() Returns the field value to the power of

ROUND() Returns the subsequent value rounded to the nearest integer. SIN() Returns the sine of the field value. SQRT() Returns the square root of field value. TAN() Returns the tangent of the field value.

HOLT_WINTERS() Returns N number of predicted field values

Predict when data values will cross a given threshold；

Compare predicted values with actual values to detect anomalies in your data.

CHANDE_MOMENTUM_OSCILLATOR() The Chande Momentum Oscillator (CMO) is a technical momentum indicator developed by Tushar Chande. The CMO indicator is created by calculating the difference between the sum of all recent higher data points and the sum of all recent lower data points, then dividing the result by the sum of all data movement over a given time period. The result is multiplied by 100 to give the -100 to +100 range. EXPONENTIAL_MOVING_AVERAGE() An exponential moving average (EMA) is a type of moving average that is similar to a simple moving average, except that more weight is given to the latest data. It’s also known as the “exponentially weighted moving average.” This type of moving average reacts faster to recent data changes than a simple moving average. DOUBLE_EXPONENTIAL_MOVING_AVERAGE() The Double Exponential Moving Average (DEMA) attempts to remove the inherent lag associated to Moving Averages by placing more weight on recent values. The name suggests this is achieved by applying a double exponential smoothing which is not the case. The name double comes from the fact that the value of an EMA is doubled. To keep it in line with the actual data and to remove the lag, the value “EMA of EMA” is subtracted from the previously doubled EMA. KAUFMANS_EFFICIENCY_RATIO() Kaufman’s Efficiency Ration, or simply “Efficiency Ratio” (ER), is calculated by dividing the data change over a period by the absolute sum of the data movements that occurred to achieve that change. The resulting ratio ranges between 0 and 1 with higher values representing a more efficient or trending market.

The ER is very similar to the Chande Momentum Oscillator (CMO). The difference is that the CMO takes market direction into account, but if you take the absolute CMO and divide by 100, you you get the Efficiency Ratio.

KAUFMANS_ADAPTIVE_MOVING_AVERAGE() Kaufman’s Adaptive Moving Average (KAMA) is a moving average designed to account for sample noise or volatility. KAMA will closely follow data points when the data swings are relatively small and noise is low. KAMA will adjust when the data swings widen and follow data from a greater distance. This trend-following indicator can be used to identify the overall trend, time turning points and filter data movements. TRIPLE_EXPONENTIAL_MOVING_AVERAGE() The triple exponential moving average (TEMA) was developed to filter out volatility from conventional moving averages. While the name implies that it’s a triple exponential smoothing, it’s actually a composite of a single exponential moving average, a double exponential moving average, and a triple exponential moving average. TRIPLE_EXPONENTIAL_DERIVATIVE() The triple exponential derivative indicator, commonly referred to as “TRIX,” is an oscillator used to identify oversold and overbought markets, and can also be used as a momentum indicator. TRIX calculates a triple exponential moving average of the log of the data input over the period of time. The previous value is subtracted from the previous value. This prevents cycles that are shorter than the defined period from being considered by the indicator.

Like many oscillators, TRIX oscillates around a zero line. When used as an oscillator, a positive value indicates an overbought market while a negative value indicates an oversold market. When used as a momentum indicator, a positive value suggests momentum is increasing while a negative value suggests momentum is decreasing. Many analysts believe that when the TRIX crosses above the zero line it gives a buy signal, and when it closes below the zero line, it gives a sell signal.

RELATIVE_STRENGTH_INDEX() The relative strength index (RSI) is a momentum indicator that compares the magnitude of recent increases and decreases over a specified time period to measure speed and change of data movements.

参考网址:

https://blog.csdn.net/xuxiannian/article/details/103559246

https://blog.csdn.net/funnyPython/article/details/89888972

Original: https://www.cnblogs.com/xuliuzai/p/14711334.html
Author: 东山絮柳仔
Title: 时序数据库InfluxDB的基本语法

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/622473/

转载文章受原作者版权保护。转载请注明原作者出处！

数据库

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

使用Docker安装FastDFS

1. 获取镜像可以利用已有的FastDFS Docker镜像来运行FastDFS。获取镜像可以通过下载： sudo docker image pull delron/fastd…

数据库 2023年6月14日
0091
Java学习-第一部分-第一阶段-第八节：项目-房屋出租系统

项目-房屋出租系统笔记目录：(https://www.cnblogs.com/wenjie2000/p/16378441.html) 房屋出租系统-需求 ●项目需求说明实现基于…

数据库 2023年6月11日
0070
Nginx负载均衡

Nginx负载均衡负载均衡概述早期的网站流量和业务功能都比较简单，单台服务器足以满足基本的需求，但是随着互联网的发展，业务流量越来越大并且业务逻辑也跟着越来越复杂，单台服务器的…

数据库 2023年6月6日
0078
linux系统安装MySQL数据库安装保姆级教程及1045错误和2058问题解决

使用Mysql的zip压缩包解压版，下载之后需进行一定的配置，才能使用它。下面对Mysql压缩包版的安装方法进行详细的描述，如有疑问或错误，望及时反馈。首先，mysql的官方下载…

数据库 2023年6月11日
0084
Java8Stream流

Stream流呢，以前我也有所了解，像一些面试题中也出现过，Java8的新特性，有一块就是这个Stream操作集合，而且在看一些项目中也使用的比较多。但总感觉自己学的一知半解，所以…

数据库 2023年6月11日
0074
PHP str_repeat()

str_repeat str_repeat() 函数把字符串重复指定的次数。示例： function strRepeat() { echo str_repeat("*&…

数据库 2023年6月14日
0069
解决 IDEA 2021.2.3 新建maven项目只有两个archetype项目模板的问题

最近把我的 IDEA 版本更新到 2021.2.3 了，发生了一个比较有意思的问题，做个小小的记录思路分析在新的 IDEA 中配置完Maven之后，想要创建Maven项目的时候…

数据库 2023年6月11日
0074
idea 自定义注释模板

自定义的格式： html;gutter:true; * * * @Description * $param$ * @Author $user$ * @Date $time$ $da…

数据库 2023年6月16日
0050
springcloud Alibaba 阿里组件 nacos注册中心 gateway网关 flowable流程引擎 vue.js前后分离

springcloud + springcloud Alibaba + flowable 流程引擎 1.代码生成器：正反双向freemaker模版技术 ,0个代码不用写,生成完整…

数据库 2023年6月6日
0085
[SWPU2019] Android3

先反编译出java代码查看，发现没有坑，直接调用了库文件里的check 把so文件直接丢到ida中查找check函数，看到这是说明flag是11位，刚好发现11个ascii码 &…

数据库 2023年6月11日
0098
mysql5.7初始化密码报错 ERROR 1820 (HY000): You must reset your password using ALTER USER statement before

mysql初始化密码常见报错问题1，mysql5.6是密码为空直接进入数据库的，但是mysql5.7就需要初始密码 cat /var/log/mysqld.log | grep p…

数据库 2023年6月16日
0054
弱隔离级别 & 事务并发问题

为什么要有弱隔离级别如果两个事务操作的是不同的数据，即不存在数据依赖关系，则它们可以安全地并行执行。但是当出现某个事务修改数据而另一个事务同时要读取该数据，或者两个事务同时…

数据库 2023年6月11日
0075
分布式数据库中的事务时序

在单机数据库领域，我们为每个事务都分配一个序列号，比如Oracle的SCN(SystemChangeNumber),MySQL的LSN(LogSequenceNumber)，这个序…

数据库 2023年6月9日
00103
mybatis 拦截器

1.mybatis拦截器介绍拦截器可在mybatis进行sql底层处理的时候执行额外的逻辑，最常见的就是分页逻辑、对结果集进行处理过滤敏感信息等。 public Paramete…

数据库 2023年6月16日
0070
实现线程的两种方式

实现Runnable接口如果当前类不仅要继承其他类（非Thread类），还要实现多线程，那么只能通过当前类实现 Runnable接口来创建Thread类对象。实现Run…

数据库 2023年6月16日
00106
fiddler的mock数据与二次开发示例

fiddler的使用记录 fiddler了解上官网下载工具，然后安装使用，https://www.telerik.com/fiddler，如果对该工具不熟悉，还有直白的教程，看过…

数据库 2023年6月6日
00111

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

时序数据库InfluxDB的基本语法

一 了解InfluxDB的必要性

二 基本概念

三 查询

四.SHOW CARDINALITY

5 Drop 与 Delete

六 常用函数部分

参考网址:

大家都在看

一了解InfluxDB的必要性

二基本概念

三查询

六常用函数部分