armbian docker Chrome_Linux+chrome+selenium浏览器截图

编程

更新时间：2024-09-0952

admin管理员组
文章数量:1122849

软件安装

（1）根据操作系统安装对应的chrome浏览器软件和chrome driver驱动

（⚠️浏览器的版本和驱动的版本要对应上）驱动地址：https://chromedriver.chromium/downloads

selenium打开浏览器

from selenium import webdriver
#设置useragent
UA ='Mozilla/5.0 (Win; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/53.0.2785.116 Safari/537.36'
#添加设备的UA和浏览器的长宽，分辨率
mobile_Emulation = {"deviceMetrics": {"width": WIDTH, "height": HEIGHT, "pixelRatio": PIXEL_RATIO},"userAgent": UA}
#虚拟界面
options = webdriver.ChromeOptions()
#添加UA
options.add_experimental_option('mobileEmulation', mobile_Emulation)
#headless无界面打开浏览器，如果使用自定义插件则需要head
# options.add_argument('--headless')
#不使用gpu
options.add_argument('--disable-gpu')
#不启动沙盒
options.add_argument('--no-sandbox')
#docker中没办法启动较大的页面，
options.add_argument('--disable-dev-shm-usage')
#指定cache的存放路径
options.add_argument('--disk-cache-dir=./chrome_cache')
#添加自定义插件，讲platform转成win/mac（这里逻辑是UA='Win',platform='Win';UA='Mac',platform='mac',UA='iphone',platform='iphone'）
options.add_extension('./platform2Mac/platformChange.crx')
driver_path = './chromedriver83'
driver = webdriver.Chrome(executable_path=driver_path,chrome_options=options)
driver.get(url)
#打印浏览器的platform和页面的html
print(driver.execute_script("return navigator.platform"))
print(driver.page_source)
#将浏览器第一屏截图，保存为jpg
driver.get_screenshot_as_file('*.jpg')
driver.close()

tips：

options.add_argument('--headless')，无界面打开浏览器，在linux中会使用这种情况。

linux中也可以使用有界面，则需要给linux添加虚拟界面

下载安装xvfb虚拟界面 sudo apt-get install xvfb pip install pyvirtualdisplay

from pyvirtualdisplay import Display
display = Display(visible=0, size=(1600, 900))
display.start()
#进行以上打开浏览器截图等操作
display.stop()

但由于项目需要多线程操作打开浏览器时，display.stop()无法关闭Xvfb进程号，导致进程堆积。解决办法：启动脚本时打开虚拟界面，脚本结束虚拟界面关闭。

xvfb-run python3 **.py

2.options.add_argument('--no-sandbox')

如果出现chrome的版本对应不上的报错信息，检查是否没有添加沙盒关闭。

3.options.add_argument('--disk-cache-dir=./chrome_cache')

chrome打开会加载很多的cache，如果在docker中运行，会占满docker的内存，导致脚本无法启动，此时可以将chache挂载出来。

docker中打开的话，检查一下 /var/cache/fontconfig。会存放很多浏览器打开页面的字体缓存，及时清理，不然会影响脚本的启动。

docker中的/tmp也需要关注一下。

4.options.add_extension('./platform2Mac/platformChange.crx')

最头痛的就是这个给浏览器添加插件问题，此处解决的问题是，有些html会判断浏览器的platform是PC还是ios，区别展示html界面，这种就需要将linux上的浏览器模拟成手机端的浏览器，只改Useragent是无法解决问题的，需要在打开浏览器的时候加载插件。

chrome浏览器自定义插件-platform

1.需要两个文件

2.在chrome浏览器加载有这两个文件的目录，则会加载进浏览器这个插件

3.python代码中使用的时候需要打包成crk的文件，将刚才目录进行打包，生成相应的*.crk文件

4.文件代码

contentscript.js

// see https://stackoverflow/questions/23202136/changing-navigator-useragent-using-chrome-extension
var actualCode =  '(' + function() {
    // Get the correct platform for navigator.userAgent 
    var candidates = ["Mac","Win","Linux","Android", "iPhone", "iPod", "iPad"];
    var modifiedPlatform;
    for(var maybe of candidates) {
        if(navigator.userAgent.match(maybe)) {
            modifiedPlatform = maybe;
            break;
        }
    }

    if(!modifiedPlatform) {
        return;
    }

    // Reset platform
    Object.defineProperty(navigator, 'platform', {
        get: function(){
            return modifiedPlatform;
        }});
} + ')();';

// Inject custom js
var s = document.createElement('script');
s.textContent = actualCode;
document.documentElement.appendChild(s);
s.remove();

maniftest.json

{
    "name": "Simulate Device Hack",
    "version": "1.0",
    "description": "Modify the navigator.platform while simulating a mobile viewport",
    "manifest_version": 2,
    "content_scripts": [
        {
            "matches": ["<all_urls>"],
            "js": ["contentscript.js"],
            "run_at": "document_start"
        }
    ]
  }

本文标签：截图浏览器 docker Armbian ChromeLinux

版权声明：本文标题：armbian docker Chrome_Linux+chrome+selenium浏览器截图内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/biancheng/1725889778a1025513.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。