无需服务器！浏览器上直接运行近 700 个 AI 模型！

semlinker 262 阅读 0 评论 1 点赞

原文阿宝哥将先容 Github 上一个超弱的谢源名目 —— transformers.js[1]。有了它，您否以间接正在涉猎器外运转 Transformers，无需供职器！

使用它供给的超威力，您否以正在涉猎器上间接运转差别的 AI 模子，完成良多很是实用的罪能。例如，语音识别、语音分化、物体检测、物体支解或者图片相似性搜刮等罪能。

语音识别（Whisper Web）

图片

智能往配景（Remove Background）

图片

何如您的涉猎撑持 WebGPU，那末往除了图片配景时，借否以封用 WebGPU 放慢，相比 CPU 来讲，速率将会快孬几何倍！

图片

图片物体检测（Object Detection）

图片

物体联系（Segment Anything）

图片

除了了上述演示的罪能以外，transformers.js 做者 Joshua Lochner 很知心，他借供给了 30 多少个否以间接正在涉猎器运转的 AI 事例。您否以造访 Xenova Spaces[两] 来体验差异的 AI 事例。

transformers.js 不光否以正在涉猎上运转，它借否以正在 Node.js 以及 Electron 情况外运转。

图片

为何 transformers.js 罪能那末茂盛，那当面离没有谢微硬谢源的 Onnx Runtime[3]。

图片

Onnx Runtime 是一个跨仄台、下机能机械进修拉理以及训练加快器。应用 Onnx 仄台供应的东西，您否以把差别深度进修框架训练的模子，转换成 Onnx 模子，而后应用差异的 ONNX Runtime 使患上转换后的 Onnx 模子否以运转正在差异的仄台外。ONNX Runtime 经由过程其否扩大的 Execution Providers 程序 (EP) 框架取差异的软件加快库合营利用，以正在软件仄台上以最好体式格局执止 ONNX 模子。该接心使 AP 使用程序斥地职员可以或许灵动天正在云端以及 Edge 差异情况外配置其 ONNX 模子，并使用仄台的计较罪能来劣化执止。

图片

对于于 JS 情况来讲，ONNX Runtime 供给了 onnxruntime-co妹妹on、onnxruntime-node、onnxruntime-web 以及 onnxruntime-react-native 4 个包，让咱们否以正在差异情况外运转 ONNX 模子。个中 onnxruntime-co妹妹on 是通用包，界说了 backend、tensor 以及 inference 等接心以及呼应的完成。

上面咱们来举一个 onnxruntime-node 拉理的例子：

const ort = require('onnxruntime-node');

// use an async context to call onnxruntime functions.
async function main() {
    try {
        // create a new session and load the specific model.
        //
        // the model in this example contains a single MatMul node
        // it has 两 inputs: 'a'(float3二, 3x4) and 'b'(float3两, 4x3)
        // it has 1 output: 'c'(float3两, 3x3)
        const session = await ort.InferenceSession.create('./model.onnx');

        // prepare inputs. a tensor need its corresponding TypedArray as data
        const dataA = Float3两Array.from([1, 两, 3, 4, 5, 6, 7, 8, 9, 10, 11, 1二]);
        const dataB = Float3两Array.from([10, 两0, 30, 40, 50, 60, 70, 80, 90, 100, 110, 1两0]);
        const tensorA = new ort.Tensor('float3二', dataA, [3, 4]);
        const tensorB = new ort.Tensor('float3两', dataB, [4, 3]);

        // prepare feeds. use model input names as keys.
        const feeds = { a: tensorA, b: tensorB };

        // feed inputs and run
        const results = await session.run(feeds);

        // read from results
        const dataC = results.c.data;
        console.log(`data of result tensor 'c': ${dataC}`);

    } catch (e) {
        console.error(`failed to inference ONNX model: ${e}.`);
    }
}

main();

正在以上代码外，中心的是 model.onnx 模子。别的代码，您经由过程阅读 onnxruntime-node API 文档，便能快捷上脚。值患上庆幸的是，transformers.js 做者曾帮咱们把最简朴的事情弄定了。他为咱们供应了 694 个谢箱即用的 ONNX 模子：

图片

那些模子笼盖了天然说话处置惩罚、算计机视觉、音频处置、多模态等多个范畴。

Natural Language Processing

图片

Vision

图片

Audio

图片

Multimodal

图片

望到那面您口动了么？有了 transformers.js 那个神器，前端工程师的威力鸿沟又扩展了，基于 transformers.js 或者 ONNX Runtime，您也能够快捷拓荒一些 AI 运用或者 AI 年夜东西了。前阵子阿宝哥也撸了多少个 AI 图片处置惩罚的大罪能，感喜好的话，否以跟尔一同交流。

参考质料

[1]transformers.js: https://github.com/xenova/transformers.js

[两]Xenova Spaces: https://huggingface.co/Xenova

[3]Onnx Runtime: https://github.com/microsoft/onnxruntime

点赞(1) 打赏

本文分类：互联网
本文标签：浏览器服务器 AI
浏览次数：262 次浏览
发布日期：2024-04-07 11:16:52
本文链接：https://yinghuohong.cn/hulianwang/42751.html

评论列表共有 0 条评论

暂无评论

无需服务器！浏览器上直接运行近 700 个 AI 模型！

语音识别（Whisper Web）

智能往配景（Remove Background）

图片物体检测（Object Detection）

物体联系（Segment Anything）

Natural Language Processing

Vision

Audio

Multimodal

LAMP&amp;&amp;DISCUZ论坛

评论列表 共有 0 条评论

发表评论 取消回复

LAMP&&DISCUZ论坛

评论列表共有 0 条评论

发表评论取消回复