经验首页 前端设计 程序设计 Java相关 移动开发 数据库/运维 软件/图像 大数据/云计算 其他经验
当前位置:技术经验 » 数据库/运维 » MS SQL Server » 查看文章
通过Python实现对SQL Server 数据文件大小的监控告警
来源:cnblogs  作者:东山絮柳仔  时间:2021/5/6 17:52:21  对本文有异议

1.需求背景 

系统程序突然报错,报错信息如下:

  1. The transaction log for database '@dbname' is full. To find out why space in the log cannot be reused, see the log_reuse_wait_desc column in sys.databases

此时查看log文件,已达2T。

当时的紧急处理方案是,移除掉镜像,修改数据库恢复模式(由full修改为simple),收缩日志。

为了防止类似问题再次发生,需对log 文件的大小进行监控,当到达阈值后,触发告警。

2.主要基础组件(类)

配置文件 qqmssqltest_db_server_conf.ini

同过此配置文件获取DB Server信息、DB信息、UID信息、邮件服务器信息等。

  1. [sqlserver]
  2. db_user = XXXXXX
  3. db_pwd = XXXXXXX
  4. [sqlserver_qq]
  5. db_host = 110.119.120.114
  6. db_port = 1433
  7.  
  8.  
  9. [windows]
  10. user =
  11. pwd =
  12.  
  13.  
  14. [mail]
  15. host = zheshiceshidemail.qq.com
  16. port = 25
  17. user =
  18. pwd =
  19. sender = zhejiushiceshidebuyaodangzhen@qq.com

 获取连接串的组件mssql_get_db_connect.py

  1. # -*- coding: utf-8 -*-
  2.  
  3.  
  4. import sys
  5. import os
  6. import datetime
  7. import configparser
  8. import pymssql
  9. # pip3 install pymssql-2.1.4-cp37-cp37m-win_amd64.whl
  10. # pip3 install pymssql -i https://pypi.doubanio.com/simple
  11.  
  12.  
  13. # 获取连接串信息
  14. def mssql_get_db_connect(db_host, db_port):
  15. db_host = db_host
  16. db_port = db_port
  17. db_ps_file = os.path.join(sys.path[0], "qqmssqltest_db_server_conf.ini")
  18. config = configparser.ConfigParser()
  19. config.read(db_ps_file, encoding="utf-8")
  20. db_user = config.get('sqlserver', 'db_user')
  21. db_pwd = config.get('sqlserver', 'db_pwd')
  22. conn = pymssql.connect(host=db_host, port=db_port, user=db_user, password=db_pwd, charset="utf8", login_timeout=5, timeout=600, autocommit=True)
  23. return conn

 执行SQL语句的组件mysql_exec_sql.py

  1. # -*- coding: utf-8 -*-
  2.  
  3. import mysql_get_db_connect
  4. def mysql_exec_dml_sql(db_host, db_port, exec_sql):
  5. conn = mysql_get_db_connect.mysql_get_db_connect(db_host, db_port)
  6. with conn.cursor() as cursor_db:
  7. cursor_db.execute(exec_sql)
  8. conn.commit()
  9. def mysql_exec_select_sql(db_host, db_port, exec_sql):
  10. conn = mysql_get_db_connect.mysql_get_db_connect(db_host, db_port)
  11. with conn.cursor() as cursor_db:
  12. cursor_db.execute(exec_sql)
  13. sql_rst = cursor_db.fetchall()
  14. return sql_rst
  15. def mysql_exec_select_sql_include_colnames(db_host, db_port, exec_sql):
  16. conn = mysql_get_db_connect.mysql_get_db_connect(db_host, db_port)
  17. with conn.cursor() as cursor_db:
  18. cursor_db.execute(exec_sql)
  19. sql_rst = cursor_db.fetchall()
  20. col_names = cursor_db.description
  21. return sql_rst, col_names

 发邮件的功能send_monitor_mail.py

  1. # -*- coding: utf-8 -*-
  2.  
  3.  
  4. # pip3 install PyEmail
  5. import smtplib
  6. from email.mime.text import MIMEText
  7. import configparser
  8. import os
  9. import sys
  10. # 发送告警邮件
  11. def send_monitor_mail(mail_subject, mail_body, mail_receivers="testwukongbaigujing@qq.com"):
  12. db_ps_file = os.path.join(sys.path[0], "qqmssqltest_db_server_conf.ini")
  13. config = configparser.ConfigParser()
  14. config.read(db_ps_file, encoding="utf-8")
  15. mail_host = config.get('mail', 'host')
  16. mail_port = config.get('mail', 'port')
  17. # mail_user = config.get('mail', 'user')
  18. # mail_pwd = config.get('mail', 'pwd')
  19. sender = config.get('mail', 'sender')
  20. # receivers = config.get('mail', 'receivers')
  21.  
  22. # 发送HTML格式邮件
  23. message = MIMEText(mail_body, 'html', 'utf-8')
  24. # message = MIMEText(mail_body, 'plain', 'utf-8')
  25. message['subject'] = mail_subject
  26. message['From'] = sender
  27. message['To'] = mail_receivers
  28. try:
  29. smtpObj = smtplib.SMTP()
  30. smtpObj.connect(mail_host, mail_port) # 25 为 SMTP 端口号
  31. # SMTP AUTH extension not supported by server.
  32. # https://github.com/miguelgrinberg/microblog/issues/76
  33. # smtpObj.ehlo()
  34. # smtpObj.starttls()
  35. # smtpObj.login(mail_user, mail_pwd)
  36. smtpObj.sendmail(sender, mail_receivers, message.as_string())
  37. smtpObj.quit()
  38. print("邮件发送成功")
  39. except Exception as e:
  40. print(e)
  41. # except smtplib.SMTPException:
  42. # print("Error: 无法发送邮件")

3.主要功能代码

收集到的DB数据文件的信息保存到表mssql_dblogsize中,其建表的脚本如下:

  1. CREATE TABLE [dbo].[mssql_dblogsize](
  2. [id] [int] IDENTITY(1,1) NOT NULL,
  3. [createtime] [datetime] NULL,
  4. [vip] [nvarchar](100) NULL,
  5. [port] [nvarchar](100) NULL,
  6. [Environment] [nvarchar](200) NULL,
  7. [Dbname] [varchar](200) NULL,
  8. [Logical_Name] [varchar](200) NULL,
  9. [Physical_Name] [varchar](1500) NULL,
  10. [Size] [bigint] NULL,
  11. PRIMARY KEY CLUSTERED
  12. (
  13. [id] ASC
  14. )WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON) ON [PRIMARY]
  15. ) ON [PRIMARY]
  16. GO
  17.  
  18. ALTER TABLE [dbo].[mssql_dblogsize] ADD DEFAULT (getdate()) FOR [createtime]
  19. GO

 为了方便对表mssql_dblogsize的数据进行管理和展示,在其基础上抽象加工出了一个视图v_mssql_dblogsize,注意Size大小的转换(Size/128/1024 as SizeGB)

创建视图的脚本如下:

  1. CREATE view [dbo].[v_mssql_dblogsize]
  2. as
  3. SELECT [id]
  4. ,[createtime]
  5. ,[vip]
  6. ,[port]
  7. ,[Environment]
  8. ,[Dbname]
  9. ,[Logical_Name]
  10. ,[Physical_Name]
  11. ,Size/128/1024 as SizeGB
  12. FROM [dbo].[mssql_dblogsize]
  13. where size >50*128*1024
  14. and Physical_Name like '%ldf%'
  15. GO

本测试实例使用的数据库为qqDB,监控的各个DB Server保存在了表QQDBServer中,注意Port 不一定为标准端口1433.

collect_mssql_dblogsize_info.py

  1. # -*- coding: utf-8 -*-
  2.  
  3.  
  4. import sys
  5. import os
  6. import configparser
  7. import pymssql
  8. import mssql_get_db_connect
  9. import mssql_exec_sql
  10. from datetime import datetime
  11. def collect_mssql_dblogsize_info():
  12. db_ps_file = os.path.join(sys.path[0], "qqmssqltest_db_server_conf.ini")
  13. config = configparser.ConfigParser()
  14. config.read(db_ps_file, encoding="utf-8")
  15. m_db_host = config.get('sqlserver_qq', 'db_host')
  16. m_db_port = config.getint('sqlserver_qq', 'db_port')
  17. # 获取需要遍历的DB列表
  18. exec_sql_1 = """
  19. SELECT IP, case Port when '1444,1433' then '1433' else Port end as Port, Environment
  20. FROM qqDB.dbo.QQDBServer
  21. where InUse =1 AND ServerType IN ('SQL')
  22. and IP=VIP ;
  23. """
  24. sql_rst_1 = mssql_exec_sql.mssql_exec_select_sql(m_db_host, m_db_port, exec_sql_1)
  25. for j in sql_rst_1:
  26. db_host_2 = j[0]
  27. db_port_2 = j[1]
  28. db_Environment = j[2]
  29. exec_sql_2 = """
  30. select '""" + db_host_2 + """' as vip, '""" + db_port_2 + """' as port, '""" + db_Environment + """' as Environment,DB_NAME(database_id) AS DatabaseName,
  31. Name AS Logical_Name,
  32. Physical_Name, size
  33. FROM master.sys.master_files;
  34. """
  35. try:
  36. sql_rst_2 = mssql_exec_sql.mssql_exec_select_sql(db_host_2, db_port_2, exec_sql_2)
  37. except Exception as e:
  38. print(e)
  39. for k in sql_rst_2:
  40. exec_sql_3 = """
  41. insert into qqDB..mssql_dblogsize([vip], [port], [Environment], [Dbname], [Logical_Name], [Physical_Name], [Size])
  42. values('%s', '%s', '%s', '%s', '%s', '%s', '%s');
  43. """
  44. conn = mssql_get_db_connect.mssql_get_db_connect(m_db_host, m_db_port)
  45. with conn.cursor() as cursor_db:
  46. cursor_db.execute(exec_sql_3 % (k[0], k[1], k[2], k[3], k[4], k[5], k[6] ))
  47. conn.commit()
  48. collect_mssql_dblogsize_info()

 告警邮件的功能实现为mssql_alert_dblogsize.py,此份代码的告警阈值设置的为50G,数据来自于视图v_mssql_dblogsize。

  1. # -*- coding: utf-8 -*-
  2.  
  3.  
  4. import sys
  5. import os
  6. import configparser
  7. import pymssql
  8. import mssql_get_db_connect
  9. import mssql_exec_sql
  10. import datetime
  11. import send_monitor_mail
  12. import pandas as pd
  13. def mssql_alert_dblogsize():
  14. mail_subject = "SQL Server DB Log Size Greater than 50G, please check!!! "
  15. mail_receivers = "testDBAgrp@qtiantianq.com"
  16. db_ps_file = os.path.join(sys.path[0], "qqmssqltest_db_server_conf.ini")
  17. config = configparser.ConfigParser()
  18. config.read(db_ps_file, encoding="utf-8")
  19. m_db_host = config.get('sqlserver_qq', 'db_host')
  20. m_db_port = config.getint('sqlserver_qq', 'db_port')
  21. # 获取需要遍历的DB列表
  22. exec_sql_4 = """
  23. SELECT [vip] as IP,[port],[Environment],[Dbname]
  24. ,[Logical_Name],[Physical_Name],[SizeGB],[createtime]
  25. FROM qqDB.[dbo].[v_mssql_dblogsize]
  26. order by VIP,Dbname;
  27. """
  28. sql_rst_4, col_name = mssql_exec_sql.mssql_exec_select_sql_include_colnames(m_db_host, m_db_port, exec_sql_4)
  29. # print(sql_rst_4)
  30.  
  31. if len(sql_rst_4):
  32. mail_time = datetime.datetime.now().strftime('%Y-%m-%d %H:%M:%S')
  33. columns = []
  34. for i in range(len(col_name)):
  35. columns.append(col_name[i][0])
  36. df = pd.DataFrame(columns=columns)
  37. for i in range(len(sql_rst_4)):
  38. df.loc[i] = list(sql_rst_4[i])
  39. mail_body = df.to_html(index=False, justify="left").replace('<th>', '<th style = "color:red; text-align:left; background-color: yellow">')
  40. mail_html = "<html><body><h4>" + "Deal All : " + "<br><h4>" + "以下数据库的db log文件,已大于50G.请及时检查,谢谢! " + "<br><h4>" + mail_body + "</body></html>"
  41. send_monitor_mail.send_monitor_mail(mail_subject=mail_subject, mail_body=mail_html, mail_receivers=mail_receivers)
  42. mssql_alert_dblogsize()

 4.实现

定时任务是通过windows的计划任务来实现的,在此不做过多的叙述。告警邮件的部分截图如下:

5.附录

1.报错定位,判断是不是log文件过大

https://blog.csdn.net/weixin_30785593/article/details/99912405

2.关于为什么数据库log文件过大,我们可以参考以下分享的文章

https://blog.csdn.net/chinadm123/article/details/44941275

原文链接:http://www.cnblogs.com/xuliuzai/p/14659567.html

 友情链接:直通硅谷  点职佳  北美留学生论坛

本站QQ群:前端 618073944 | Java 606181507 | Python 626812652 | C/C++ 612253063 | 微信 634508462 | 苹果 692586424 | C#/.net 182808419 | PHP 305140648 | 运维 608723728

W3xue 的所有内容仅供测试,对任何法律问题及风险不承担任何责任。通过使用本站内容随之而来的风险与本站无关。
关于我们  |  意见建议  |  捐助我们  |  报错有奖  |  广告合作、友情链接(目前9元/月)请联系QQ:27243702 沸活量
皖ICP备17017327号-2 皖公网安备34020702000426号