HuggingFace：pipeline为特定NLP任务直接调用

2023年5月28日下午1:38 • 大数据 • 阅读 76

默认情况下，pipeline选择一个特定的预训练模型，该模型已为英语情绪分析进行了微调。创建分类器对象时，将下载并缓存模型。如果重新运行该命令，则将使用缓存的模型，无需再次下载该模型。

from transformers import pipeline

classifier = pipeline("sentiment-analysis")
classifier("I've been waiting for a HuggingFace course my whole life.")

&#x8F93;&#x51FA;&#xFF1A;[{'label': 'POSITIVE', 'score': 0.9598047137260437}]

from transformers import pipeline

classifier = pipeline("zero-shot-classification")
classifier(
    "This is a course about the Transformers library",
    candidate_labels=["education", "politics", "business", "course"],
)

&#x8F93;&#x51FA;&#xFF1A;
{'sequence': 'This is a course about the Transformers library',
 'labels': ['course', 'education', 'business', 'politics'],
 'scores': [0.9461037516593933,
  0.04552055522799492,
  0.0060350666753947735,
  0.002340571256354451]}

from transformers import pipeline

generator = pipeline("text-generation")
generator("In this course, we will teach you how to", max_length=20, num_return_sequences=3)

&#x8F93;&#x51FA;&#xFF1A;
[{'generated_text': 'In this course, we will teach you how to manipulate some of the basics of working with and analyzing'},
 {'generated_text': 'In this course, we will teach you how to create a database structure and its parameters by writing a'},
 {'generated_text': 'In this course, we will teach you how to read and write with great care to make sure your'}]

针对需求的任务，在HuggingFace官网上搜索模型，输入到model参数

from transformers import pipeline

generator = pipeline("text-generation", model="distilgpt2")
generator(
    "In this course, we will teach you how to",
    max_length=30,
    num_return_sequences=2,
)

&#x8F93;&#x51FA;:
[{'generated_text': 'In this course, we will teach you how to use a small amount of JavaScript and JavaScript in combination with more advanced Javascript and JavaScript.'},
 {'generated_text': 'In this course, we will teach you how to use the best approach of the class: use the principles of a basic programming language like C++ and'}]

参数 top_k：控制要显示的可能性数量。

from transformers import pipeline

unmasker = pipeline("fill-mask")
unmasker("This course will teach you all about  models.", top_k=3)

&#x8F93;&#x51FA;&#xFF1A;
[{'sequence': 'This course will teach you all about mathematical models.',
  'score': 0.1961982101202011,
  'token': 30412,
  'token_str': ' mathematical'},
 {'sequence': 'This course will teach you all about computational models.',
  'score': 0.04052715376019478,
  'token': 38163,
  'token_str': ' computational'},
 {'sequence': 'This course will teach you all about predictive models.',
  'score': 0.03301785886287689,
  'token': 27930,
  'token_str': ' predictive'}]

from transformers import pipeline

ner = pipeline("ner", grouped_entities=True)
ner("My name is Sylvain and I work at Hugging Face in Brooklyn.")

&#x8F93;&#x51FA;&#xFF1A;
[{'entity_group': 'PER',
  'score': 0.9981693774461746,
  'word': 'Sylvain',
  'start': 11,
  'end': 18},
 {'entity_group': 'ORG',
  'score': 0.9796019991238912,
  'word': 'Hugging Face',
  'start': 33,
  'end': 45},
 {'entity_group': 'LOC',
  'score': 0.9932105541229248,
  'word': 'Brooklyn',
  'start': 49,
  'end': 57}]

请注意，此pipeline通过从提供的上下文中提取信息来工作；它不会生成答案。

from transformers import pipeline

question_answerer = pipeline("question-answering")
question_answerer(
    question="Where do I work?",
    context="My name is Sylvain and I work at Hugging Face in Brooklyn",
)

&#x8F93;&#x51FA;&#xFF1A;
{'score': 0.6949757933616638, 'start': 33, 'end': 45, 'answer': 'Hugging Face'}

与文本生成一样，可以为结果指定max_length或min_length

from transformers import pipeline

summarizer = pipeline("summarization")
summarizer(
"""
    America has changed dramatically during recent years. Not only has the number of
    graduates in traditional engineering disciplines such as mechanical, civil,
    electrical, chemical, and aeronautical engineering declined, but in most of
    the premier American universities engineering curricula now concentrate on
    and encourage largely the study of engineering science. As a result, there
    are declining offerings in engineering subjects dealing with infrastructure,
    the environment, and related issues, and greater concentration on high
    technology subjects, largely supporting increasingly complex scientific
    developments. While the latter is important, it should not be at the expense
    of more traditional engineering.

    Rapidly developing economies such as China and India, as well as other
    industrial countries in Europe and Asia, continue to encourage and advance
    the teaching of engineering. Both China and India, respectively, graduate
    six and eight times as many traditional engineers as does the United States.

    Other industrial countries at minimum maintain their output, while America
    suffers an increasingly serious decline in the number of engineering graduates
    and a lack of well-educated engineers.

"""
)

&#x8F93;&#x51FA;&#xFF1A;
[{'summary_text': ' America has changed dramatically during recent years . The number of engineering graduates in the U.S. has declined in traditional engineering disciplines such as mechanical, civil,    electrical, chemical, and aeronautical engineering . Rapidly developing economies such as China and India continue to encourage and advance the teaching of engineering .'}]

from transformers import pipeline

translator = pipeline("translation", model="Helsinki-NLP/opus-mt-zh-en")
translator("今天真是开心的一天！")

&#x8F93;&#x51FA;&#xFF1A;
[{'translation_text': "It's been a happy day!"}]

目前展示的pipeline主要用于演示目的。它们是为特定任务编程的，不能执行不同的任务。

下次将介绍pipeline（）函数的内部内容以及如何自定义其行为。

Original: https://blog.csdn.net/m0_50896529/article/details/121794421
Author: 郑不凡
Title: HuggingFace：pipeline为特定NLP任务直接调用

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/531826/

转载文章受原作者版权保护。转载请注明原作者出处！

大数据

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

System.Data.SQLite 与 Microsoft.Data.Sqlite

在 2005 年，Robert Simpson 创建了System.Data.SQLite，这是 ADO.NET 2.0 的一个 SQLite 提供程序。在 2010 年，SQL…

大数据 2023年11月11日
0051
MyLibrary –Qt + sqlite 图书馆管理系统

MyLibrary 介绍基于Sqlite 和 Qt 技术实现电子图书馆的智慧综合管理参考资料：https://blog.csdn.net/lishichengyan/artic…

大数据 2023年11月12日
0040
大数据Hadoop生态系统介绍

一、概述 1）Hadoop发行版本 1、Apache Hadoop发行版 2、DKhadoop发行版 3、Cloudera发行版 4、Hortonworks发行版 5、华为hado…

大数据 2023年5月26日
00138
【Kafka】Kafka工作原理

为什么需要消息队列周末无聊刷着手机，某宝网APP突然蹦出来一条消息”为了回馈老客户，女朋友买一送一，活动仅限今天！”。买一送一还有这种好事，那我可不能错过…

大数据 2023年5月28日
0092
Qt操作SQLite数据库

Qt操作SQLite数据库前言一、SQLite 简介 * 什么是 SQLite？为什么要用 SQLite？ SQLite 局限性 – SQL-92特性方面低并发…

大数据 2023年11月11日
0058
Hive日分区表如何快速导入到StarRocks

1、背景业务现状：集团使用FineBI做数据呈现及报表分析工具，经过近两年的BI建设，供应链域及营销域的BI建设已初具规模并体系化。数仓规模60TB，FineBI数据集约8000…

大数据 2023年11月13日
0052
hadoop如何查看文件系统

1、查看当前的文件系统 [root@hadoopmaster bin]# ./hadoop fs -ls / Found 2 items drwxr-xr-x – ro…

大数据 2023年5月26日
0092
微服务Spring Boot 整合 Redis 分布式锁 Redission 实现优惠卷秒杀一人一单

大数据 2023年11月15日
0041
Flink编译指定版本hadoop依赖

准备工作编译步骤准备工作官方文档：https://nightlies.apache.org/flink/flink-docs-release-1.11/ops/deploym…

大数据 2023年6月3日
0096
B站UP搭建世界首个纯红石神经网络、基于深度学习动作识别的色情检测、陈天奇《机器学编译MLC》课程进展、AI前沿论文 | ShowMeAI资讯日报

ShowMeAI 日报系列全新升级！覆盖AI人工智能工具&框架 | 项目&代码 | 博文&分享 | 数据&资源 | 研究&论文等方向。点…

大数据 2023年5月28日
00134
Linux-安装ifconfig

镜像下载、域名解析、时间同步请点击阿里云开源镜像站 ifconfig 命令可以用于查看、配置、启用或禁用指定的网络接口，还可以用来配置网卡的IP地址、掩码、广播地址、网关等，功能很…

大数据 2023年5月27日
0085
vscode新建sqlite3数据库配置编译文件的方法一

vscode新建sqlite3数据库配置编译文件的方法一准备sqlite3.lib库文件按照VS2019下的Link.exe工具生成sqlite3.lib库将已经利用 lin…

大数据 2023年11月11日
0043
Transformer 中的mask

目录 1.Encoder中的mask 的作用属于第一种 2.decoder中的mask。 transformer中的mask有两种作用：其一：去除掉各种padding在训练过程中…

大数据 2023年5月28日
00164
无根用户管理podman

在允许没有root特权的用户运行Podman之前，管理员必须安装或构建Podman并完成以下配置基础设置 cgroup V2Linux内核功能允许用户限制普通用户容器可以使用的资…

大数据 2023年5月27日
00107
NTP时钟服务器（时钟系统）在智慧教室里的应用

NTP时钟服务器（时钟系统）在智慧教室里的应用 NTP时钟服务器（时钟系统）在智慧教室里的应用 NTP时钟服务器（时钟系统）在智慧教室里的应用京准电子科技官微——ahjzsz 近…

大数据 2023年6月3日
0090
centos7无法联网问题

问题：centos7出现无法联网问题如下图所示，执行该命令： ping qq.com出现如下情况：解决方法：首先cd到需要修改文件的所在目录下： [root@localhost…

大数据 2023年5月27日
0078

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

HuggingFace：pipeline为特定NLP任务直接调用

大家都在看