python+rsync精确同步指定格式文件
作者:gcogle 时间:2023-09-18 06:51:26
本文实例为大家分享了python+rsync精确同步指定格式文件的具体代码,供大家参考,具体内容如下
# coding: utf-8
#!/usr/bin/env python
'''
updatedb更新本地服务器指定目录/home/upload/de locate.dbz数据库
然后locate命令正则查找符合条件的文件并将文件名输出到locate.src
通过paramiko模块得到远程服务器符合条件的文件并将文件名输出到locate.dst
比较这两个文件得到locate.diff,最好rsync命令同步locate.diff列表中的文件
到远程服务器
'''
import paramiko
import commands
import os
import logging
logging.basicConfig(level=logging.DEBUG,
format='%(asctime)s %(filename)s[line:%(lineno)d] %(levelname)s %(message)s',
datefmt='%Y-%m-%d %H:%M:%S',
filename='/var/log/upload_to_chengdu.log',
filemode='a')
logging.info("###################################################################")
hostname = '8.8.8.8'
username = 'username'
password = 'password'
port = 22
locate_lists = ['/tmp/locate.src', '/tmp/locate.dst', '/tmp/locate.diff']
dict_path = {'locate_lists': locate_lists, 'dst_db': '/tmp/locate.db',
'src_db': '/tmp/locate.db', 'dst_path': '/logs/rsync_log/',
'src_path': '/home/upload/', 'passwd_file': '/etc/rsync.pas'}
def check_file():
for l in dict_path['locate_lists']:
if os.path.exists(l):
commands.getstatusoutput('sudo rm %s' % l)
def get_dst_list():
s = paramiko.SSHClient()
s.set_missing_host_key_policy(paramiko.AutoAddPolicy())
s.connect(hostname=hostname, port=port, username=username, password=password)
cmd_create = "sudo updatedb -U %s -o %s && sudo locate -d %s --regex '.*[0-9]{4}([-]{0,1}[0-9]{2}){2}\..*gz$' >>%s" % (
dict_path['dst_path'], dict_path['dst_db'], dict_path['dst_db'], dict_path['locate_lists'][1])
cmd_delete = 'sudo rm ' + dict_path['locate_lists'][1]
stdin, stdout, stderr = s.exec_command(cmd_create)
# print stdout.read()
try:
t = paramiko.Transport((hostname, port))
t.connect(username=username, password=password)
sftp = paramiko.SFTPClient.from_transport(t)
sftp.get(dict_path['locate_lists'][1], dict_path['locate_lists'][1])
t.close()
except Exception, e:
print e
stdin, stdout, stderr = s.exec_command(cmd_delete)
s.close()
def get_src_list():
cmd = "sudo updatedb -U %s -o %s && sudo locate -d %s --regex '.*[0-9]{4}([-]{0,1}[0-9]{2}){2}\..*gz$' >>%s" % (
dict_path['src_path'], dict_path['src_db'], dict_path['src_db'], dict_path['locate_lists'][0])
commands.getstatusoutput(cmd)
def cmp_diff():
f1 = open(dict_path['locate_lists'][0], 'r')
f2 = open(dict_path['locate_lists'][1], 'r')
f3 = open(dict_path['locate_lists'][2], 'a')
x = f1.readlines()
y = f2.readlines()
f1.close()
f2.close()
for i in x:
j = dict_path['dst_path'] + i[13:]
if j not in y:
f3.writelines(i)
f3.close()
def rsync_file():
f = open(dict_path['locate_lists'][2], 'r')
for i in f.readlines():
cmd = 'cd %s && sudo rsync -Rvz %s upload@%s::aliyun_log --password-file=%s' % (
dict_path['src_path'], i.strip('\n')[13:], hostname, dict_path['passwd_file'])
logging.info(cmd)
commands.getstatusoutput(cmd)
f.close()
def del_locate_file():
cmd = 'sudo rm {%s,%s,%s}' % (dict_path['locate_lists'][1], dict_path['locate_lists'][0], dict_path['locate_lists'][2])
commands.getstatusoutput(cmd)
if __name__ == '__main__':
check_file()
get_dst_list()
get_src_list()
cmp_diff()
rsync_file()
del_locate_file()
来源:https://blog.csdn.net/gcogle/article/details/52767113
标签:python,rsync,同步文件
0
投稿
猜你喜欢
SQL Server 2012使用Offset/Fetch Next实现分页数据查询
2024-01-25 03:54:55
详解python调度框架APScheduler使用
2021-11-05 22:55:36
如何用idea数据库编写快递e站
2024-01-23 08:43:56
基于Python编写一个简单的端口扫描器
2021-09-19 09:21:22
详解js前端代码异常监控
2024-04-22 13:06:26
python设置代理和添加镜像源的方法
2022-12-03 10:26:39
详解Python Matplotlib解决绘图X轴值不按数组排序问题
2022-12-03 09:33:06
Python中用Spark模块的使用教程
2021-10-02 13:11:40
Joomla开启SEF的方法
2024-04-30 08:47:39
Javascript 构造函数 实例分析
2024-04-23 09:25:45
pycharm自动生成文件注释和函数注释
2023-10-07 04:17:18
python实现requests发送/上传多个文件的示例
2023-07-23 01:42:39
微软工程师讲解SQL server阻塞
2008-01-05 14:02:00
分享Pycharm中一些不为人知的技巧
2023-10-04 04:52:54
Python3+Pygame实现射击游戏完整代码
2022-10-14 00:15:02
Christopher Schmitt 谈学习CSS的益处
2008-07-13 14:15:00
python实现二叉树的遍历
2023-06-29 17:45:42
Python单链表的简单实现方法
2021-08-14 01:58:33
Oracle入侵常用操作命令整理
2009-03-04 11:11:00
TensorFlow中权重的随机初始化的方法
2023-01-02 06:11:32