NLP-预训练模型-2019-NLU+NLG：T5【Text-to-Text 预训练模型超大规模探索】【微调T5用于文本摘要】

《原始论文：Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer》

2019年10月，Google 在《Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer》这篇论文中提出了一个最新的预训练模型 T5（Text-To-Text Transfer Transformer），其参数量达到了 110 亿，完爆 Bert Large 模型，且在多项 NLP 任务中达到 SOTA 性能。有人说，这是一种将探索迁移学习能力边界的模型。

当然，最大的冲击还是财大气粗，bigger and bigger，但翻完它长达 34 页的论文，发现其中的分析无疑是诚意满满（都是钱）。类似这样的大型实验探索论文也有一些，首先提出一个通用框架，接着进行了各种比对实验，获得一套建议参数，最后得到一个很强的 baseline。而我们之后做这方面实验就能参考它的一套参数。

对于 T5 这篇论文，Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer，无疑也是类似的论文。它的意义不在烧了多少钱，也不在屠了多少榜（砸钱就能砸出来），其中 idea 创新也不大，它最重要作用是给整个 NLP 预训练模型领域提供了一个通用框架，把所有任务都转化成一种形式，正如论文里所说的

introducing a unified framework that converts every language problem into a text-to-text format.

之后未来做 NLP 实验时，可能就不再是自己怎么调一些模型了，而是无论什么任务，直接拿来一个超大预训练模型，然后主要工作就变成了怎么把任务转换成合适的文本输入输出，于是我们就成了带引号的”数据科学家“。而且可以用于多种任务，而模型对这些任务的区分只是根据你构建的输入输出形式，其实这让我想起 Jeff Dean 在某次谈话中谈到的谷歌未来方向，想做一个超级模型，什么任务都能直接处理，而它内部可以是稀疏的，或者可以局部 Distill，来对单独任务进行处理。

二、直接使用T5预训练模型用于文本摘要

方式01、`from transformers import AutoTokenizer, AutoModelForSeq2SeqLM`

# https://github.com/huggingface/transformers/blob/master/src/transformers/models/t5/modeling_t5.py
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained(r'D:\Pretrained_Model\t5-base')
model = AutoModelForSeq2SeqLM.from_pretrained(r'D:\Pretrained_Model\t5-base')

# 用T5做文本摘要任务，前面加 "summarize:"识别符
text = """
        summarize: (CNN)For the second time during his papacy, Pope Francis has announced a new group of bishops and archbishops set to become cardinals -- and they come from all over the world.
        Pope Francis said Sunday that he would hold a meeting of cardinals on February 14 "during which I will name 15 new Cardinals who, coming from 13 countries from every continent, manifest the indissoluble links between the Church of Rome and the particular Churches present in the world," according to Vatican Radio.
        New cardinals are always important because they set the tone in the church and also elect the next pope, CNN Senior Vatican Analyst John L. Allen said. They are sometimes referred to as the princes of the Catholic Church.
        The new cardinals come from countries such as Ethiopia, New Zealand and Myanmar.
        "This is a pope who very much wants to reach out to people on the margins, and you clearly see that in this set," Allen said. "You're talking about cardinals from typically overlooked places, like Cape Verde, the Pacific island of Tonga, Panama, Thailand, Uruguay."
        But for the second time since Francis' election, no Americans made the list.
        "Francis' pattern is very clear: He wants to go to the geographical peripheries rather than places that are already top-heavy with cardinals," Allen said.
        Christopher Bellitto, a professor of church history at Kean University in New Jersey, noted that Francis announced his new slate of cardinals on the Catholic Feast of the Epiphany, which commemorates the visit of the Magi to Jesus' birthplace in Bethlehem.
        "On feast of three wise men from far away, the Pope's choices for cardinal say that every local church deserves a place at the big table."
        In other words, Francis wants a more decentralized church and wants to hear reform ideas from small communities that sit far from Catholicism's power centers, Bellitto said.
        That doesn't mean Francis is the first pontiff to appoint cardinals from the developing world, though. Beginning in the 1920s, an increasing number of Latin American churchmen were named cardinals, and in the 1960s, St. John XXIII, whom Francis canonized last year, appointed the first cardinals from Japan, the Philippines and Africa.
        In addition to the 15 new cardinals Francis named on Sunday, five retired archbishops and bishops will also be honored as cardinals.
        Last year, Pope Francis appointed 19 new cardinals, including bishops from Haiti and Burkina Faso.
        CNN's Daniel Burke and Christabelle Fombu contributed to this report.
"""
# CNN/DM答案：
# @highlight
# The 15 new cardinals will be installed on February 14
# @highlight
# They come from countries such as Myanmar and Tonga
# @highlight
# No Americans made the list this time or the previous time in Francis' papacy

inputs = tokenizer(text, max_length=1024, truncation=True, return_tensors='pt')

print('inputs = ', inputs)

summary_ids = model.generate(inputs['input_ids'])

print('\nsummary_ids = ', summary_ids)

print([tokenizer.decode(g, skip_special_tokens=True, clean_up_tokenization_spaces=False) for g in summary_ids])
print(tokenizer.batch_decode(summary_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False))

打印结果：

Ignored unknown kwarg option direction
inputs =  {'input_ids': tensor([[21603,    10,    41,   254, 17235,    61,  3809,     8,   511,    97,
           383,   112,     3, 16281,  4710,     6, 17384, 11065,    65,  2162,
             3,     9,   126,   563,    13, 25214,     7,    11, 11508, 11514,
         10776,     7,   356,    12,   582,   895, 10270,     7,  1636,    11,
            79,   369,    45,    66,   147,     8,   296,     5, 17384, 11065,
           243,  1771,    24,     3,    88,   133,  1520,     3,     9,  1338,
            13,   895, 10270,     7,    30,  2083,   968,    96,    26,  7920,
            84,    27,    56,   564,   627,   126, 21967,     7,   113,     6,
          1107,    45,  1179,  1440,    45,   334, 10829,     6,  6571,     8,
             3,  8482,     7, 26175,  2416,   344,     8,  2345,    13,  7332,
            11,     8,  1090,  2345,    15,     7,   915,    16,     8,   296,
           976,  1315,    12, 25770,  5061,     5,   368,   895, 10270,     7,
            33,   373,   359,   250,    79,   356,     8,  5739,    16,     8,
          2078,    11,    92, 11924,     8,   416,  2783,    15,     6, 19602,
          5523, 25770, 25224,  1079,   301,     5, 10618,   243,     5,   328,
            33,  1664,     3,  4822,    12,    38,     8, 22277,     7,    13,
             8,  6502,  2345,     5,    37,   126,   895, 10270,     7,   369,
            45,  1440,   224,    38, 22138,     6,   368,  5725,    11, 27274,
             5,    96,  3713,    19,     3,     9,  2783,    15,   113,   182,
           231,  2746,    12,  1535,    91,    12,   151,    30,     8,  6346,
             7,     6,    11,    25,  3133,   217,    24,    16,    48,   356,
           976, 10618,   243,     5,    96,  3774,    31,    60,  2508,    81,
           895, 10270,     7,    45,  3115, 20633,  1747,     6,   114,  9702,
           781,   221,     6,     8,  5824,  3368,    13,   304,  1725,     9,
             6, 21099,     6, 10508,     6, 30758,   535,   299,    21,     8,
           511,    97,   437, 11065,    31,  4356,     6,   150,  5452,   263,
             8,   570,     5,    96,   371,    52, 11389,     7,    31,  3275,
            19,   182,   964,    10,   216,  2746,    12,   281,    12,     8,
         20187,   158,  5082,    88,  2593,  1066,   145,  1747,    24,    33,
           641,   420,    18,    88, 19649,    28,   895, 10270,     7,   976,
         10618,   243,     5, 14702,  5377,   155,   235,     6,     3,     9,
          5812,    13,  2078,   892,    44,  2566,   152,   636,    16,   368,
          5092,     6,  4466,    24, 11065,  2162,   112,   126, 21079,    13,
           895, 10270,     7,    30,     8,  6502,   377, 11535,    13,     8,
         12741,  8237,    63,     6,    84, 18681,    15,     7,     8,   719,
            13,     8, 22673,    12,  1850,    31,  3879,  4687,    16, 15659,
           109,  6015,     5,    96,  7638, 18886,    13,   386,  7624,  1076,
            45,   623,   550,     6,     8, 17384,    31,     7,  3703,    21,
           895, 10270,   497,    24,   334,   415,  2078, 15314,     3,     9,
           286,    44,     8,   600,   953,   535,    86,   119,  1234,     6,
         11065,  2746,     3,     9,    72,    20, 21411,  2078,    11,  2746,
            12,  1616,  5139,   912,    45,   422,  2597,    24,  2561,   623,
            45,  6502,   159,    51,    31,     7,   579,  6881,     6,  5377,
           155,   235,   243,     5,   466,   744,    31,    17,  1243, 11065,
            19,     8,   166, 19068,  5982,    12,     3,     9,   102,  2700,
           895, 10270,     7,    45,     8,  2421,   296,     6,   713,     5,
         22738,    16,     8, 13978,     7,     6,    46,  3094,   381,    13,
          6271,   797,  2078,   904,   130,  2650,   895, 10270,     7,     6,
            11,    16,     8,  8754,     7,     6,   472,     5,  1079,     3,
             4,     4, 13671,     6,  4068, 11065,    54,   106,  1601,   336,
           215,     6,  7817,     8,   166,   895, 10270,     7,    45,  3411,
             6,     8, 12729,    11,  2648,     5,    86,   811,    12,     8,
           627,   126,   895, 10270,     7, 11065,  2650,    30,  1771,     6,
           874, 10611, 11508, 11514, 10776,     7,    11, 25214,     7,    56,
            92,    36, 13242,    38,   895, 10270,     7,     5,  2506,   215,
             6, 17384, 11065,  7817,   957,   126,   895, 10270,     7,     6,
           379, 25214,     7,    45, 22179,    11,  4152,  2917,     9,  1699,
             7,    32,     5, 19602,    31,     7,  4173, 27575,    11,  2144,
         10333,   109,   377,  8038,    76,  9859,    12,    48,   934,     5,
             1]]), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]])}

summary_ids =  tensor([[    0,   126,   895, 10270,     7,   369,    45,  1179,  1440,    45,
           334, 10829,     3,     5,    79,    33,  1664,     3,  4822,    12]])

['new cardinals come from 13 countries from every continent . they are sometimes referred to']
['new cardinals come from 13 countries from every continent . they are sometimes referred to']

Process finished with exit code 0

方式02、`from transformers import T5Tokenizer, T5ForConditionalGeneration`

# https://github.com/huggingface/transformers/blob/master/src/transformers/models/t5/modeling_t5.py
from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained(r'D:\Pretrained_Model\t5-base')
model = T5ForConditionalGeneration.from_pretrained(r'D:\Pretrained_Model\t5-base')

text = """
         (CNN)For the second time during his papacy, Pope Francis has announced a new group of bishops and archbishops set to become cardinals -- and they come from all over the world.
        Pope Francis said Sunday that he would hold a meeting of cardinals on February 14 "during which I will name 15 new Cardinals who, coming from 13 countries from every continent, manifest the indissoluble links between the Church of Rome and the particular Churches present in the world," according to Vatican Radio.
        New cardinals are always important because they set the tone in the church and also elect the next pope, CNN Senior Vatican Analyst John L. Allen said. They are sometimes referred to as the princes of the Catholic Church.
        The new cardinals come from countries such as Ethiopia, New Zealand and Myanmar.
        "This is a pope who very much wants to reach out to people on the margins, and you clearly see that in this set," Allen said. "You're talking about cardinals from typically overlooked places, like Cape Verde, the Pacific island of Tonga, Panama, Thailand, Uruguay."
        But for the second time since Francis' election, no Americans made the list.
        "Francis' pattern is very clear: He wants to go to the geographical peripheries rather than places that are already top-heavy with cardinals," Allen said.
        Christopher Bellitto, a professor of church history at Kean University in New Jersey, noted that Francis announced his new slate of cardinals on the Catholic Feast of the Epiphany, which commemorates the visit of the Magi to Jesus' birthplace in Bethlehem.
        "On feast of three wise men from far away, the Pope's choices for cardinal say that every local church deserves a place at the big table."
        In other words, Francis wants a more decentralized church and wants to hear reform ideas from small communities that sit far from Catholicism's power centers, Bellitto said.
        That doesn't mean Francis is the first pontiff to appoint cardinals from the developing world, though. Beginning in the 1920s, an increasing number of Latin American churchmen were named cardinals, and in the 1960s, St. John XXIII, whom Francis canonized last year, appointed the first cardinals from Japan, the Philippines and Africa.
        In addition to the 15 new cardinals Francis named on Sunday, five retired archbishops and bishops will also be honored as cardinals.
        Last year, Pope Francis appointed 19 new cardinals, including bishops from Haiti and Burkina Faso.
        CNN's Daniel Burke and Christabelle Fombu contributed to this report.
"""
# CNN/DM答案：
# @highlight
# The 15 new cardinals will be installed on February 14
# @highlight
# They come from countries such as Myanmar and Tonga
# @highlight
# No Americans made the list this time or the previous time in Francis' papacy

inputs = tokenizer(text, max_length=1024, truncation=True, return_tensors='pt')

print('inputs = ', inputs)

summary_ids = model.generate(inputs['input_ids'])

print('\nsummary_ids = ', summary_ids)

print([tokenizer.decode(g, skip_special_tokens=True, clean_up_tokenization_spaces=False) for g in summary_ids])
print(tokenizer.batch_decode(summary_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False))

打印结果：

inputs =  {'input_ids': tensor([[   41,   254, 17235,    61,  3809,     8,   511,    97,   383,   112,
             3, 16281,  4710,     6, 17384, 11065,    65,  2162,     3,     9,
           126,   563,    13, 25214,     7,    11, 11508, 11514, 10776,     7,
           356,    12,   582,   895, 10270,     7,  1636,    11,    79,   369,
            45,    66,   147,     8,   296,     5, 17384, 11065,   243,  1771,
            24,     3,    88,   133,  1520,     3,     9,  1338,    13,   895,
         10270,     7,    30,  2083,   968,    96,    26,  7920,    84,    27,
            56,   564,   627,   126, 21967,     7,   113,     6,  1107,    45,
          1179,  1440,    45,   334, 10829,     6,  6571,     8,     3,  8482,
             7, 26175,  2416,   344,     8,  2345,    13,  7332,    11,     8,
          1090,  2345,    15,     7,   915,    16,     8,   296,   976,  1315,
            12, 25770,  5061,     5,   368,   895, 10270,     7,    33,   373,
           359,   250,    79,   356,     8,  5739,    16,     8,  2078,    11,
            92, 11924,     8,   416,  2783,    15,     6, 19602,  5523, 25770,
         25224,  1079,   301,     5, 10618,   243,     5,   328,    33,  1664,
             3,  4822,    12,    38,     8, 22277,     7,    13,     8,  6502,
          2345,     5,    37,   126,   895, 10270,     7,   369,    45,  1440,
           224,    38, 22138,     6,   368,  5725,    11, 27274,     5,    96,
          3713,    19,     3,     9,  2783,    15,   113,   182,   231,  2746,
            12,  1535,    91,    12,   151,    30,     8,  6346,     7,     6,
            11,    25,  3133,   217,    24,    16,    48,   356,   976, 10618,
           243,     5,    96,  3774,    31,    60,  2508,    81,   895, 10270,
             7,    45,  3115, 20633,  1747,     6,   114,  9702,   781,   221,
             6,     8,  5824,  3368,    13,   304,  1725,     9,     6, 21099,
             6, 10508,     6, 30758,   535,   299,    21,     8,   511,    97,
           437, 11065,    31,  4356,     6,   150,  5452,   263,     8,   570,
             5,    96,   371,    52, 11389,     7,    31,  3275,    19,   182,
           964,    10,   216,  2746,    12,   281,    12,     8, 20187,   158,
          5082,    88,  2593,  1066,   145,  1747,    24,    33,   641,   420,
            18,    88, 19649,    28,   895, 10270,     7,   976, 10618,   243,
             5, 14702,  5377,   155,   235,     6,     3,     9,  5812,    13,
          2078,   892,    44,  2566,   152,   636,    16,   368,  5092,     6,
          4466,    24, 11065,  2162,   112,   126, 21079,    13,   895, 10270,
             7,    30,     8,  6502,   377, 11535,    13,     8, 12741,  8237,
            63,     6,    84, 18681,    15,     7,     8,   719,    13,     8,
         22673,    12,  1850,    31,  3879,  4687,    16, 15659,   109,  6015,
             5,    96,  7638, 18886,    13,   386,  7624,  1076,    45,   623,
           550,     6,     8, 17384,    31,     7,  3703,    21,   895, 10270,
           497,    24,   334,   415,  2078, 15314,     3,     9,   286,    44,
             8,   600,   953,   535,    86,   119,  1234,     6, 11065,  2746,
             3,     9,    72,    20, 21411,  2078,    11,  2746,    12,  1616,
          5139,   912,    45,   422,  2597,    24,  2561,   623,    45,  6502,
           159,    51,    31,     7,   579,  6881,     6,  5377,   155,   235,
           243,     5,   466,   744,    31,    17,  1243, 11065,    19,     8,
           166, 19068,  5982,    12,     3,     9,   102,  2700,   895, 10270,
             7,    45,     8,  2421,   296,     6,   713,     5, 22738,    16,
             8, 13978,     7,     6,    46,  3094,   381,    13,  6271,   797,
          2078,   904,   130,  2650,   895, 10270,     7,     6,    11,    16,
             8,  8754,     7,     6,   472,     5,  1079,     3,     4,     4,
         13671,     6,  4068, 11065,    54,   106,  1601,   336,   215,     6,
          7817,     8,   166,   895, 10270,     7,    45,  3411,     6,     8,
         12729,    11,  2648,     5,    86,   811,    12,     8,   627,   126,
           895, 10270,     7, 11065,  2650,    30,  1771,     6,   874, 10611,
         11508, 11514, 10776,     7,    11, 25214,     7,    56,    92,    36,
         13242,    38,   895, 10270,     7,     5,  2506,   215,     6, 17384,
         11065,  7817,   957,   126,   895, 10270,     7,     6,   379, 25214,
             7,    45, 22179,    11,  4152,  2917,     9,  1699,     7,    32,
             5, 19602,    31,     7,  4173, 27575,    11,  2144, 10333,   109,
           377,  8038,    76,  9859,    12,    48,   934,     5,     1]]), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]])}

summary_ids =  tensor([[    0,   126,   895, 10270,     7,   369,    45,  1179,  1440,    45,
           334, 10829,     3,     5,    79,    33,   557,     3,  4822,    12]])

['new cardinals come from 13 countries from every continent . they are often referred to']
['new cardinals come from 13 countries from every continent . they are often referred to']

Process finished with exit code 0

三、微调T5（用于文本摘要、用数据集xsum来微调T5）

# https://github.com/huggingface/notebooks/blob/master/examples/summarization.ipynb
import nltk
import numpy as np
from datasets import load_dataset, load_metric
from transformers import AutoTokenizer
from transformers import AutoModelForSeq2SeqLM, DataCollatorForSeq2Seq, Seq2SeqTrainingArguments, Seq2SeqTrainer

model_checkpoint = r"D:\Pretrained_Model\t5-base"
raw_datasets = load_dataset("xsum")
metric = load_metric("rouge")

print('raw_datasets = ', raw_datasets)
print("raw_datasets['train'][0] = ", raw_datasets['train'][0])

tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

prefix = "summarize: "


def preprocess_function(examples):
    inputs = [prefix + doc for doc in examples["document"]]
    model_inputs = tokenizer(inputs, max_length=1024, truncation=True)

    # Setup the tokenizer for targets
    with tokenizer.as_target_tokenizer():
        labels = tokenizer(examples["summary"], max_length=128, truncation=True)

    model_inputs["labels"] = labels["input_ids"]
    return model_inputs


def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    decoded_preds = tokenizer.batch_decode(predictions, skip_special_tokens=True)
    # Replace -100 in the labels as we can't decode them.
    labels = np.where(labels != -100, labels, tokenizer.pad_token_id)
    decoded_labels = tokenizer.batch_decode(labels, skip_special_tokens=True)

    # Rouge expects a newline after each sentence
    decoded_preds = ["\n".join(nltk.sent_tokenize(pred.strip())) for pred in decoded_preds]
    decoded_labels = ["\n".join(nltk.sent_tokenize(label.strip())) for label in decoded_labels]

    result = metric.compute(predictions=decoded_preds, references=decoded_labels, use_stemmer=True)
    # Extract a few results
    result = {key: value.mid.fmeasure * 100 for key, value in result.items()}

    # Add mean generated length
    prediction_lens = [np.count_nonzero(pred != tokenizer.pad_token_id) for pred in predictions]
    result["gen_len"] = np.mean(prediction_lens)

    return {k: round(v, 4) for k, v in result.items()}


tokenized_datasets = raw_datasets.map(preprocess_function, batched=True)

# ----------------------------------- Fine-tuning the model -----------------------------------
model = AutoModelForSeq2SeqLM.from_pretrained(model_checkpoint)
batch_size = 1
model_name = model_checkpoint.split("/")[-1]
args = Seq2SeqTrainingArguments(
    "finetuned-xsum",
    evaluation_strategy="epoch",
    learning_rate=2e-5,
    per_device_train_batch_size=batch_size,
    per_device_eval_batch_size=batch_size,
    weight_decay=0.01,
    save_total_limit=3,
    num_train_epochs=1,
    predict_with_generate=True,
    fp16=True,
    push_to_hub=False,
)

data_collator = DataCollatorForSeq2Seq(tokenizer, model=model)

trainer = Seq2SeqTrainer(
    model,
    args,
    train_dataset=tokenized_datasets["test"],
    eval_dataset=tokenized_datasets["validation"],
    data_collator=data_collator,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics
)

trainer.train()

输出：

raw_datasets =  DatasetDict({
    train: Dataset({
        features: ['document', 'summary', 'id'],
        num_rows: 204045
    })
    validation: Dataset({
        features: ['document', 'summary', 'id'],
        num_rows: 11332
    })
    test: Dataset({
        features: ['document', 'summary', 'id'],
        num_rows: 11334
    })
})
raw_datasets['train'][0] =  {'document': 'Recent reports have linked some France-based players with returns to Wales.\n"I\'ve always felt - and this is with my rugby hat on now; this is not region or WRU - I\'d rather spend that money on keeping players in Wales," said Davies.\nThe WRU provides £2m to the fund and £1.3m comes from the regions.\nFormer Wales and British and Irish Lions fly-half Davies became WRU chairman on Tuesday 21 October, succeeding deposed David Pickering following governing body elections.\nHe is now serving a notice period to leave his role as Newport Gwent Dragons chief executive after being voted on to the WRU board in September.\nDavies was among the leading figures among Dragons, Ospreys, Scarlets and Cardiff Blues officials who were embroiled in a protracted dispute with the WRU that ended in a £60m deal in August this year.\nIn the wake of that deal being done, Davies said the £3.3m should be spent on ensuring current Wales-based stars remain there.\nIn recent weeks, Racing Metro flanker Dan Lydiate was linked with returning to Wales.\nLikewise the Paris club\'s scrum-half Mike Phillips and centre Jamie Roberts were also touted for possible returns.\nWales coach Warren Gatland has said: "We haven\'t instigated contact with the players.\n"But we are aware that one or two of them are keen to return to Wales sooner rather than later."\nSpeaking to Scrum V on BBC Radio Wales, Davies re-iterated his stance, saying keeping players such as Scarlets full-back Liam Williams and Ospreys flanker Justin Tipuric in Wales should take precedence.\n"It\'s obviously a limited amount of money [available]. The union are contributing 60% of that contract and the regions are putting £1.3m in.\n"So it\'s a total pot of just over £3m and if you look at the sorts of salaries that the... guys... have been tempted to go overseas for [are] significant amounts of money.\n"So if we were to bring the players back, we\'d probably get five or six players.\n"And I\'ve always felt - and this is with my rugby hat on now; this is not region or WRU - I\'d rather spend that money on keeping players in Wales.\n"There are players coming out of contract, perhaps in the next year or so… you\'re looking at your Liam Williams\' of the world; Justin Tipuric for example - we need to keep these guys in Wales.\n"We actually want them there. They are the ones who are going to impress the young kids, for example.\n"They are the sort of heroes that our young kids want to emulate.\n"So I would start off [by saying] with the limited pot of money, we have to retain players in Wales.\n"Now, if that can be done and there\'s some spare monies available at the end, yes, let\'s look to bring players back.\n"But it\'s a cruel world, isn\'t it?\n"It\'s fine to take the buck and go, but great if you can get them back as well, provided there\'s enough money."\nBritish and Irish Lions centre Roberts has insisted he will see out his Racing Metro contract.\nHe and Phillips also earlier dismissed the idea of leaving Paris.\nRoberts also admitted being hurt by comments in French Newspaper L\'Equipe attributed to Racing Coach Laurent Labit questioning their effectiveness.\nCentre Roberts and flanker Lydiate joined Racing ahead of the 2013-14 season while scrum-half Phillips moved there in December 2013 after being dismissed for disciplinary reasons by former club Bayonne.', 'id': '29750031', 'summary': 'New Welsh Rugby Union chairman Gareth Davies believes a joint £3.3m WRU-regions fund should be used to retain home-based talent such as Liam Williams, not bring back exiled stars.'}
Ignored unknown kwarg option direction
  0%|          | 0/205 [00:00<?, ?ba/s]Ignored unknown kwarg option direction
  0%|          | 1/205 [00:00<01:27,  2.33ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  1%|          | 2/205 [00:00<01:23,  2.43ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  1%|▏         | 3/205 [00:01<01:21,  2.49ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  2%|▏         | 4/205 [00:01<01:17,  2.58ba/s]Ignored unknown kwarg option direction
...
...
...
 97%|█████████▋| 199/205 [01:25<00:02,  2.07ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 98%|█████████▊| 200/205 [01:26<00:02,  2.10ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 98%|█████████▊| 201/205 [01:26<00:01,  2.09ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 99%|█████████▊| 202/205 [01:27<00:01,  2.13ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 99%|█████████▉| 203/205 [01:27<00:00,  2.14ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
100%|██████████| 205/205 [01:28<00:00,  2.32ba/s]
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  0%|          | 0/12 [00:00<?, ?ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  8%|▊         | 1/12 [00:00<00:05,  2.16ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 17%|█▋        | 2/12 [00:00<00:04,  2.07ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 25%|██▌       | 3/12 [00:01<00:04,  2.13ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 33%|███▎      | 4/12 [00:01<00:03,  2.07ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 42%|████▏     | 5/12 [00:02<00:03,  2.09ba/s]Ignored unknown kwarg option direction
 50%|█████     | 6/12 [00:02<00:02,  2.12ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 58%|█████▊    | 7/12 [00:03<00:02,  2.15ba/s]Ignored unknown kwarg option direction
 67%|██████▋   | 8/12 [00:03<00:01,  2.19ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 75%|███████▌  | 9/12 [00:04<00:01,  2.20ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 83%|████████▎ | 10/12 [00:04<00:00,  2.20ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 92%|█████████▏| 11/12 [00:05<00:00,  2.18ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
100%|██████████| 12/12 [00:05<00:00,  2.26ba/s]
Ignored unknown kwarg option direction
  0%|          | 0/12 [00:00<?, ?ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
  8%|▊         | 1/12 [00:00<00:05,  1.85ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 17%|█▋        | 2/12 [00:01<00:05,  1.92ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 25%|██▌       | 3/12 [00:01<00:04,  1.97ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 33%|███▎      | 4/12 [00:01<00:03,  2.02ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 42%|████▏     | 5/12 [00:02<00:03,  2.09ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 50%|█████     | 6/12 [00:02<00:02,  2.10ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 58%|█████▊    | 7/12 [00:03<00:02,  2.11ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 67%|██████▋   | 8/12 [00:03<00:01,  2.11ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 75%|███████▌  | 9/12 [00:04<00:01,  2.11ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 83%|████████▎ | 10/12 [00:04<00:00,  2.11ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
 92%|█████████▏| 11/12 [00:05<00:00,  2.13ba/s]Ignored unknown kwarg option direction
Ignored unknown kwarg option direction
100%|██████████| 12/12 [00:05<00:00,  2.22ba/s]
Using amp half precision backend
The following columns in the training set  don't have a corresponding argument in `T5ForConditionalGeneration.forward` and have been ignored: document, summary, id.
C:\Program_Files_AI\Anaconda3531\lib\site-packages\transformers\optimization.py:309: FutureWarning: This implementation of AdamW is deprecated and will be removed in a future version. Use thePyTorch implementation torch.optim.AdamW instead, or set `no_deprecation_warning=True` to disable this warning
  FutureWarning,
***** Running training *****
  Num examples = 11334
  Num Epochs = 1
  Instantaneous batch size per device = 1
  Total train batch size (w. parallel, distributed & accumulation) = 1
  Gradient Accumulation steps = 1
  Total optimization steps = 11334
  4%|▍         | 500/11334 [02:30<1:12:22,  2.49it/s]Saving model checkpoint to finetuned-xsum\checkpoint-500
Configuration saved in finetuned-xsum\checkpoint-500\config.json
{'loss': 2.5572, 'learning_rate': 1.9124757367213695e-05, 'epoch': 0.04}
Model weights saved in finetuned-xsum\checkpoint-500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-500\spiece.model
  9%|▉         | 1000/11334 [05:06<1:01:50,  2.78it/s]Saving model checkpoint to finetuned-xsum\checkpoint-1000
{'loss': 2.3531, 'learning_rate': 1.8244220928180694e-05, 'epoch': 0.09}
Configuration saved in finetuned-xsum\checkpoint-1000\config.json
Model weights saved in finetuned-xsum\checkpoint-1000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-1000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-1000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-1000\spiece.model
 13%|█▎        | 1500/11334 [07:38<48:28,  3.38it/s]Saving model checkpoint to finetuned-xsum\checkpoint-1500
Configuration saved in finetuned-xsum\checkpoint-1500\config.json
{'loss': 2.2812, 'learning_rate': 1.736191988706547e-05, 'epoch': 0.13}
Model weights saved in finetuned-xsum\checkpoint-1500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-1500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-1500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-1500\spiece.model
 18%|█▊        | 2000/11334 [10:15<45:42,  3.40it/s]Saving model checkpoint to finetuned-xsum\checkpoint-2000
Configuration saved in finetuned-xsum\checkpoint-2000\config.json
{'loss': 2.2919, 'learning_rate': 1.648138344803247e-05, 'epoch': 0.18}
Model weights saved in finetuned-xsum\checkpoint-2000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-2000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-2000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-2000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-500] due to args.save_total_limit
 22%|██▏       | 2500/11334 [12:53<42:04,  3.50it/s]Saving model checkpoint to finetuned-xsum\checkpoint-2500
Configuration saved in finetuned-xsum\checkpoint-2500\config.json
{'loss': 2.2519, 'learning_rate': 1.5602611611081703e-05, 'epoch': 0.22}
Model weights saved in finetuned-xsum\checkpoint-2500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-2500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-2500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-2500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-1000] due to args.save_total_limit
 26%|██▋       | 3000/11334 [15:30<40:44,  3.41it/s]Saving model checkpoint to finetuned-xsum\checkpoint-3000
{'loss': 2.2395, 'learning_rate': 1.4720310569966474e-05, 'epoch': 0.26}
Configuration saved in finetuned-xsum\checkpoint-3000\config.json
Model weights saved in finetuned-xsum\checkpoint-3000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-3000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-3000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-3000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-1500] due to args.save_total_limit
 31%|███       | 3500/11334 [18:06<37:08,  3.52it/s]Saving model checkpoint to finetuned-xsum\checkpoint-3500
{'loss': 2.2298, 'learning_rate': 1.3839774130933477e-05, 'epoch': 0.31}
Configuration saved in finetuned-xsum\checkpoint-3500\config.json
Model weights saved in finetuned-xsum\checkpoint-3500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-3500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-3500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-3500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-2000] due to args.save_total_limit
 35%|███▌      | 4000/11334 [20:38<37:43,  3.24it/s]Saving model checkpoint to finetuned-xsum\checkpoint-4000
{'loss': 2.224, 'learning_rate': 1.2959237691900476e-05, 'epoch': 0.35}
Configuration saved in finetuned-xsum\checkpoint-4000\config.json
Model weights saved in finetuned-xsum\checkpoint-4000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-4000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-4000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-4000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-2500] due to args.save_total_limit
 40%|███▉      | 4500/11334 [23:12<48:04,  2.37it/s]Saving model checkpoint to finetuned-xsum\checkpoint-4500
{'loss': 2.2665, 'learning_rate': 1.207870125286748e-05, 'epoch': 0.4}
Configuration saved in finetuned-xsum\checkpoint-4500\config.json
Model weights saved in finetuned-xsum\checkpoint-4500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-4500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-4500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-4500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-3000] due to args.save_total_limit
 44%|████▍     | 5000/11334 [25:45<30:40,  3.44it/s]Saving model checkpoint to finetuned-xsum\checkpoint-5000
{'loss': 2.2154, 'learning_rate': 1.1196400211752252e-05, 'epoch': 0.44}
Configuration saved in finetuned-xsum\checkpoint-5000\config.json
Model weights saved in finetuned-xsum\checkpoint-5000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-5000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-5000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-5000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-3500] due to args.save_total_limit
 49%|████▊     | 5500/11334 [28:22<32:52,  2.96it/s]Saving model checkpoint to finetuned-xsum\checkpoint-5500
{'loss': 2.185, 'learning_rate': 1.0315863772719253e-05, 'epoch': 0.49}
Configuration saved in finetuned-xsum\checkpoint-5500\config.json
Model weights saved in finetuned-xsum\checkpoint-5500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-5500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-5500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-5500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-4000] due to args.save_total_limit
 53%|█████▎    | 6000/11334 [31:02<27:55,  3.18it/s]Saving model checkpoint to finetuned-xsum\checkpoint-6000
Configuration saved in finetuned-xsum\checkpoint-6000\config.json
{'loss': 2.2635, 'learning_rate': 9.433562731604025e-06, 'epoch': 0.53}
Model weights saved in finetuned-xsum\checkpoint-6000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-6000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-6000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-6000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-4500] due to args.save_total_limit
 57%|█████▋    | 6500/11334 [33:37<27:12,  2.96it/s]Saving model checkpoint to finetuned-xsum\checkpoint-6500
Configuration saved in finetuned-xsum\checkpoint-6500\config.json
{'loss': 2.2082, 'learning_rate': 8.553026292571027e-06, 'epoch': 0.57}
Model weights saved in finetuned-xsum\checkpoint-6500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-6500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-6500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-6500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-5000] due to args.save_total_limit
 62%|██████▏   | 7000/11334 [36:12<18:01,  4.01it/s]Saving model checkpoint to finetuned-xsum\checkpoint-7000
{'loss': 2.201, 'learning_rate': 7.670725251455797e-06, 'epoch': 0.62}
Configuration saved in finetuned-xsum\checkpoint-7000\config.json
Model weights saved in finetuned-xsum\checkpoint-7000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-7000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-7000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-7000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-5500] due to args.save_total_limit
 66%|██████▌   | 7500/11334 [38:52<16:44,  3.82it/s]Saving model checkpoint to finetuned-xsum\checkpoint-7500
{'loss': 2.1945, 'learning_rate': 6.791953414505029e-06, 'epoch': 0.66}
Configuration saved in finetuned-xsum\checkpoint-7500\config.json
Model weights saved in finetuned-xsum\checkpoint-7500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-7500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-7500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-7500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-6000] due to args.save_total_limit
 71%|███████   | 8000/11334 [41:33<16:26,  3.38it/s]Saving model checkpoint to finetuned-xsum\checkpoint-8000
{'loss': 2.1742, 'learning_rate': 5.911416975472032e-06, 'epoch': 0.71}
Configuration saved in finetuned-xsum\checkpoint-8000\config.json
Model weights saved in finetuned-xsum\checkpoint-8000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-8000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-8000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-8000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-6500] due to args.save_total_limit
 75%|███████▍  | 8500/11334 [44:16<14:14,  3.32it/s]Saving model checkpoint to finetuned-xsum\checkpoint-8500
{'loss': 2.2351, 'learning_rate': 5.029115934356803e-06, 'epoch': 0.75}
Configuration saved in finetuned-xsum\checkpoint-8500\config.json
Model weights saved in finetuned-xsum\checkpoint-8500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-8500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-8500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-8500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-7000] due to args.save_total_limit
 79%|███████▉  | 9000/11334 [47:02<13:46,  2.82it/s]Saving model checkpoint to finetuned-xsum\checkpoint-9000
{'loss': 2.2096, 'learning_rate': 4.146814893241574e-06, 'epoch': 0.79}
Configuration saved in finetuned-xsum\checkpoint-9000\config.json
Model weights saved in finetuned-xsum\checkpoint-9000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-9000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-9000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-9000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-7500] due to args.save_total_limit
 84%|████████▍ | 9500/11334 [49:46<08:49,  3.47it/s]Saving model checkpoint to finetuned-xsum\checkpoint-9500
{'loss': 2.1603, 'learning_rate': 3.2662784542085763e-06, 'epoch': 0.84}
Configuration saved in finetuned-xsum\checkpoint-9500\config.json
Model weights saved in finetuned-xsum\checkpoint-9500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-9500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-9500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-9500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-8000] due to args.save_total_limit
 88%|████████▊ | 10000/11334 [52:28<07:06,  3.13it/s]Saving model checkpoint to finetuned-xsum\checkpoint-10000
{'loss': 2.161, 'learning_rate': 2.3839774130933478e-06, 'epoch': 0.88}
Configuration saved in finetuned-xsum\checkpoint-10000\config.json
Model weights saved in finetuned-xsum\checkpoint-10000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-10000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-10000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-10000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-8500] due to args.save_total_limit
 93%|█████████▎| 10500/11334 [55:12<03:52,  3.58it/s]Saving model checkpoint to finetuned-xsum\checkpoint-10500
{'loss': 2.1606, 'learning_rate': 1.501676371978119e-06, 'epoch': 0.93}
Configuration saved in finetuned-xsum\checkpoint-10500\config.json
Model weights saved in finetuned-xsum\checkpoint-10500\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-10500\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-10500\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-10500\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-9000] due to args.save_total_limit
 97%|█████████▋| 11000/11334 [57:53<01:48,  3.08it/s]Saving model checkpoint to finetuned-xsum\checkpoint-11000
Configuration saved in finetuned-xsum\checkpoint-11000\config.json
{'loss': 2.1629, 'learning_rate': 6.211399329451209e-07, 'epoch': 0.97}
Model weights saved in finetuned-xsum\checkpoint-11000\pytorch_model.bin
tokenizer config file saved in finetuned-xsum\checkpoint-11000\tokenizer_config.json
Special tokens file saved in finetuned-xsum\checkpoint-11000\special_tokens_map.json
Copy vocab file to finetuned-xsum\checkpoint-11000\spiece.model
Deleting older checkpoint [finetuned-xsum\checkpoint-9500] due to args.save_total_limit
100%|██████████| 11334/11334 [59:38<00:00,  3.15it/s]The following columns in the evaluation set  don't have a corresponding argument in `T5ForConditionalGeneration.forward` and have been ignored: document, summary, id.
***** Running Evaluation *****
  Num examples = 11332
  Batch size = 1

  0%|          | 0/11332 [00:00<?, ?it/s]
  0%|          | 2/11332 [00:00<43:52,  4.30it/s]
  0%|          | 3/11332 [00:00<48:16,  3.91it/s]
  0%|          | 4/11332 [00:01<48:45,  3.87it/s]
  0%|          | 5/11332 [00:01<54:24,  3.47it/s]
  0%|          | 6/11332 [00:01<1:03:02,  2.99it/s]
  0%|          | 7/11332 [00:02<1:00:54,  3.10it/s]
  0%|          | 8/11332 [00:02<1:08:59,  2.74it/s]
  0%|          | 9/11332 [00:02<1:06:21,  2.84it/s]
  0%|          | 10/11332 [00:03<1:07:00,  2.82it/s]
  0%|          | 11/11332 [00:03<1:12:58,  2.59it/s]
  0%|          | 12/11332 [00:04<1:08:00,  2.77it/s]
...
...
...
100%|█████████▉| 11327/11332 [1:07:16<00:01,  2.83it/s]
100%|█████████▉| 11328/11332 [1:07:16<00:01,  2.89it/s]
100%|█████████▉| 11329/11332 [1:07:16<00:00,  3.00it/s]
100%|█████████▉| 11330/11332 [1:07:16<00:00,  3.09it/s]
100%|█████████▉| 11331/11332 [1:07:17<00:00,  3.07it/s]
                                                     
{'eval_loss': 1.9903812408447266, 'eval_rouge1': 32.2647, 'eval_rouge2': 10.6523, 'eval_rougeL': 25.628, 'eval_rougeLsum': 25.6236, 'eval_gen_len': 18.713, 'eval_runtime': 4055.0046, 'eval_samples_per_second': 2.795, 'eval_steps_per_second': 2.795, 'epoch': 1.0}
{'train_runtime': 7633.6468, 'train_samples_per_second': 1.485, 'train_steps_per_second': 1.485, 'train_loss': 2.2352082908984445, 'epoch': 1.0}
100%|██████████| 11334/11334 [2:07:13<00:00,  3.15it/s]
100%|██████████| 11332/11332 [1:07:34<00:00,  3.16it/s]
                                                       

Training completed. Do not forget to share your model on huggingface.co/models =)


100%|██████████| 11334/11334 [2:07:13<00:00,  1.48it/s]

Process finished with exit code 0

四、使用微调后的T5

# https://github.com/huggingface/transformers/blob/master/src/transformers/models/t5/modeling_t5.py
from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained(r'D:\Pretrained_Model\t5-base-finetuning')
model = T5ForConditionalGeneration.from_pretrained(r'D:\Pretrained_Model\t5-base-finetuning')

text = """
         (CNN)For the second time during his papacy, Pope Francis has announced a new group of bishops and archbishops set to become cardinals -- and they come from all over the world.
        Pope Francis said Sunday that he would hold a meeting of cardinals on February 14 "during which I will name 15 new Cardinals who, coming from 13 countries from every continent, manifest the indissoluble links between the Church of Rome and the particular Churches present in the world," according to Vatican Radio.
        New cardinals are always important because they set the tone in the church and also elect the next pope, CNN Senior Vatican Analyst John L. Allen said. They are sometimes referred to as the princes of the Catholic Church.
        The new cardinals come from countries such as Ethiopia, New Zealand and Myanmar.
        "This is a pope who very much wants to reach out to people on the margins, and you clearly see that in this set," Allen said. "You're talking about cardinals from typically overlooked places, like Cape Verde, the Pacific island of Tonga, Panama, Thailand, Uruguay."
        But for the second time since Francis' election, no Americans made the list.
        "Francis' pattern is very clear: He wants to go to the geographical peripheries rather than places that are already top-heavy with cardinals," Allen said.
        Christopher Bellitto, a professor of church history at Kean University in New Jersey, noted that Francis announced his new slate of cardinals on the Catholic Feast of the Epiphany, which commemorates the visit of the Magi to Jesus' birthplace in Bethlehem.
        "On feast of three wise men from far away, the Pope's choices for cardinal say that every local church deserves a place at the big table."
        In other words, Francis wants a more decentralized church and wants to hear reform ideas from small communities that sit far from Catholicism's power centers, Bellitto said.
        That doesn't mean Francis is the first pontiff to appoint cardinals from the developing world, though. Beginning in the 1920s, an increasing number of Latin American churchmen were named cardinals, and in the 1960s, St. John XXIII, whom Francis canonized last year, appointed the first cardinals from Japan, the Philippines and Africa.
        In addition to the 15 new cardinals Francis named on Sunday, five retired archbishops and bishops will also be honored as cardinals.
        Last year, Pope Francis appointed 19 new cardinals, including bishops from Haiti and Burkina Faso.
        CNN's Daniel Burke and Christabelle Fombu contributed to this report.
"""
# CNN/DM答案：
# @highlight
# The 15 new cardinals will be installed on February 14
# @highlight
# They come from countries such as Myanmar and Tonga
# @highlight
# No Americans made the list this time or the previous time in Francis' papacy

inputs = tokenizer(text, max_length=1024, truncation=True, return_tensors='pt')

print('inputs = ', inputs)

summary_ids = model.generate(inputs['input_ids'])

print('\nsummary_ids = ', summary_ids)

print([tokenizer.decode(g, skip_special_tokens=True, clean_up_tokenization_spaces=False) for g in summary_ids])
print(tokenizer.batch_decode(summary_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False))

打印结果：

inputs =  {'input_ids': tensor([[   41,   254, 17235,    61,  3809,     8,   511,    97,   383,   112,
             3, 16281,  4710,     6, 17384, 11065,    65,  2162,     3,     9,
           126,   563,    13, 25214,     7,    11, 11508, 11514, 10776,     7,
           356,    12,   582,   895, 10270,     7,  1636,    11,    79,   369,
            45,    66,   147,     8,   296,     5, 17384, 11065,   243,  1771,
            24,     3,    88,   133,  1520,     3,     9,  1338,    13,   895,
         10270,     7,    30,  2083,   968,    96,    26,  7920,    84,    27,
            56,   564,   627,   126, 21967,     7,   113,     6,  1107,    45,
          1179,  1440,    45,   334, 10829,     6,  6571,     8,     3,  8482,
             7, 26175,  2416,   344,     8,  2345,    13,  7332,    11,     8,
          1090,  2345,    15,     7,   915,    16,     8,   296,   976,  1315,
            12, 25770,  5061,     5,   368,   895, 10270,     7,    33,   373,
           359,   250,    79,   356,     8,  5739,    16,     8,  2078,    11,
            92, 11924,     8,   416,  2783,    15,     6, 19602,  5523, 25770,
         25224,  1079,   301,     5, 10618,   243,     5,   328,    33,  1664,
             3,  4822,    12,    38,     8, 22277,     7,    13,     8,  6502,
          2345,     5,    37,   126,   895, 10270,     7,   369,    45,  1440,
           224,    38, 22138,     6,   368,  5725,    11, 27274,     5,    96,
          3713,    19,     3,     9,  2783,    15,   113,   182,   231,  2746,
            12,  1535,    91,    12,   151,    30,     8,  6346,     7,     6,
            11,    25,  3133,   217,    24,    16,    48,   356,   976, 10618,
           243,     5,    96,  3774,    31,    60,  2508,    81,   895, 10270,
             7,    45,  3115, 20633,  1747,     6,   114,  9702,   781,   221,
             6,     8,  5824,  3368,    13,   304,  1725,     9,     6, 21099,
             6, 10508,     6, 30758,   535,   299,    21,     8,   511,    97,
           437, 11065,    31,  4356,     6,   150,  5452,   263,     8,   570,
             5,    96,   371,    52, 11389,     7,    31,  3275,    19,   182,
           964,    10,   216,  2746,    12,   281,    12,     8, 20187,   158,
          5082,    88,  2593,  1066,   145,  1747,    24,    33,   641,   420,
            18,    88, 19649,    28,   895, 10270,     7,   976, 10618,   243,
             5, 14702,  5377,   155,   235,     6,     3,     9,  5812,    13,
          2078,   892,    44,  2566,   152,   636,    16,   368,  5092,     6,
          4466,    24, 11065,  2162,   112,   126, 21079,    13,   895, 10270,
             7,    30,     8,  6502,   377, 11535,    13,     8, 12741,  8237,
            63,     6,    84, 18681,    15,     7,     8,   719,    13,     8,
         22673,    12,  1850,    31,  3879,  4687,    16, 15659,   109,  6015,
             5,    96,  7638, 18886,    13,   386,  7624,  1076,    45,   623,
           550,     6,     8, 17384,    31,     7,  3703,    21,   895, 10270,
           497,    24,   334,   415,  2078, 15314,     3,     9,   286,    44,
             8,   600,   953,   535,    86,   119,  1234,     6, 11065,  2746,
             3,     9,    72,    20, 21411,  2078,    11,  2746,    12,  1616,
          5139,   912,    45,   422,  2597,    24,  2561,   623,    45,  6502,
           159,    51,    31,     7,   579,  6881,     6,  5377,   155,   235,
           243,     5,   466,   744,    31,    17,  1243, 11065,    19,     8,
           166, 19068,  5982,    12,     3,     9,   102,  2700,   895, 10270,
             7,    45,     8,  2421,   296,     6,   713,     5, 22738,    16,
             8, 13978,     7,     6,    46,  3094,   381,    13,  6271,   797,
          2078,   904,   130,  2650,   895, 10270,     7,     6,    11,    16,
             8,  8754,     7,     6,   472,     5,  1079,     3,     4,     4,
         13671,     6,  4068, 11065,    54,   106,  1601,   336,   215,     6,
          7817,     8,   166,   895, 10270,     7,    45,  3411,     6,     8,
         12729,    11,  2648,     5,    86,   811,    12,     8,   627,   126,
           895, 10270,     7, 11065,  2650,    30,  1771,     6,   874, 10611,
         11508, 11514, 10776,     7,    11, 25214,     7,    56,    92,    36,
         13242,    38,   895, 10270,     7,     5,  2506,   215,     6, 17384,
         11065,  7817,   957,   126,   895, 10270,     7,     6,   379, 25214,
             7,    45, 22179,    11,  4152,  2917,     9,  1699,     7,    32,
             5, 19602,    31,     7,  4173, 27575,    11,  2144, 10333,   109,
           377,  8038,    76,  9859,    12,    48,   934,     5,     1]]), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
         1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]])}

summary_ids =  tensor([[    0,    37, 17384,    65,  2650,   627,   126,   895, 10270,     7,
            45,  1179,  1440,     6,   379,     8, 12729,     6, 22179,    11]])

['The Pope has named 15 new cardinals from 13 countries, including the Philippines, Haiti and']
['The Pope has named 15 new cardinals from 13 countries, including the Philippines, Haiti and']

Process finished with exit code 0

参考资料：
T5，一个探索迁移学习边界的模型
T5 模型：NLP Text-to-Text 预训练模型超大规模探索
Google预训练语言模型T5
Transformers预训练模型使用：文本摘要 Summarization

你可能感兴趣的:(#,NLP/词向量_预训练模型,#,NLP应用/文本摘要,自然语言处理,深度学习,人工智能,T5,预训练语言模型)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
element实现动态路由+面包屑软件技术NINI vue案例 vue.js 前端
el-breadcrumb是ElementUI组件库中的一个面包屑导航组件，它用于显示当前页面的路径，帮助用户快速理解和导航到应用的各个部分。在Vue.js项目中，如果你已经安装了ElementUI，就可以很方便地使用el-breadcrumb组件。以下是一个基本的使用示例：安装ElementUI（如果你还没有安装的话）:你可以通过npm或yarn来安装ElementUI。bash复制代码npmi
理解Gunicorn：Python WSGI服务器的基石范范0825 ipython linux 运维
理解Gunicorn：PythonWSGI服务器的基石介绍Gunicorn，全称GreenUnicorn，是一个为PythonWSGI（WebServerGatewayInterface）应用设计的高效、轻量级HTTP服务器。作为PythonWeb应用部署的常用工具，Gunicorn以其高性能和易用性著称。本文将介绍Gunicorn的基本概念、安装和配置，帮助初学者快速上手。1.什么是Gunico
swagger访问路径 igotyback swagger
Swagger2.x版本访问地址：http://{ip}:{port}/{context-path}/swagger-ui.html{ip}是你的服务器IP地址。{port}是你的应用服务端口，通常为8080。{context-path}是你的应用上下文路径，如果应用部署在根路径下，则为空。Swagger3.x版本对于Swagger3.x版本（也称为OpenAPI3）访问地址：http://{ip
绘本讲师训练营【24期】8/21阅读原创《独生小孩》 1784e22615e0
24016-孟娟《独生小孩》图片发自App今天我想分享一个蛮特别的绘本，讲的是一个特殊的群体，我也是属于这个群体，80后的独生小孩。这是一本中国绘本，作者郭婧，也是一个80厚。全书一百多页，均为铅笔绘制，虽然为黑白色调，但并不显得沉闷。全书没有文字，犹如“默片”，但并不影响读者对该作品的理解，反而显得神秘，梦幻，給读者留下想象的空间。作者在前蝴蝶页这样写到：“我更希望父母和孩子一起分享这本书，使他
店群合一模式下的社区团购新发展——结合链动 2+1 模式、AI 智能名片与 S2B2C 商城小程序源码说私域人工智能小程序
摘要：本文探讨了店群合一的社区团购平台在当今商业环境中的重要性和优势。通过分析店群合一模式如何将互联网社群与线下终端紧密结合，阐述了链动2+1模式、AI智能名片和S2B2C商城小程序源码在这一模式中的应用价值。这些创新元素的结合为社区团购带来了新的机遇，提升了用户信任感、拓展了营销渠道，并实现了线上线下的完美融合。一、引言随着互联网技术的不断发展，社区团购作为一种新兴的商业模式，在满足消费者日常需
消息中间件有哪些常见类型 xmh-sxh-1314 java
消息中间件根据其设计理念和用途，可以大致分为以下几种常见类型：点对点消息队列（Point-to-PointMessagingQueues）：在这种模型中，消息被发送到特定的队列中，消费者从队列中取出并处理消息。队列中的消息只能被一个消费者消费，消费后即被删除。常见的实现包括IBM的MQSeries、RabbitMQ的部分使用场景等。适用于任务分发、负载均衡等场景。发布/订阅消息模型（Pub/Sub
腾讯云技术深度探索：构建高效云原生微服务架构我的运维人生云原生架构腾讯云运维开发技术共享
腾讯云技术深度探索：构建高效云原生微服务架构在当今快速发展的技术环境中，云原生技术已成为企业数字化转型的关键驱动力。腾讯云作为行业领先的云服务提供商，不断推出创新的产品和技术，助力企业构建高效、可扩展的云原生微服务架构。本文将深入探讨腾讯云在微服务领域的最新进展，并通过一个实际案例展示如何在腾讯云平台上构建云原生应用。腾讯云微服务架构概览腾讯云微服务架构基于云原生理念，旨在帮助企业快速实现应用的容
LLM 词汇表落难Coder LLMs NLP 大语言模型大模型 llama 人工智能
Contextwindow“上下文窗口”是指语言模型在生成新文本时能够回溯和参考的文本量。这不同于语言模型训练时所使用的大量数据集，而是代表了模型的“工作记忆”。较大的上下文窗口可以让模型理解和响应更复杂和更长的提示，而较小的上下文窗口可能会限制模型处理较长提示或在长时间对话中保持连贯性的能力。Fine-tuning微调是使用额外的数据进一步训练预训练语言模型的过程。这使得模型开始表示和模仿微调数
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
基于社交网络算法优化的二维最大熵图像分割智能算法研学社（Jack旭）智能优化算法应用图像分割算法 php 开发语言
智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码文章目录智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码1.前言2.二维最大熵阈值分割原理3.基于社交网络优化的多阈值分割4.算法结果：5.参考文献：6.Matlab代码摘要：本文介绍基于最大熵的图像分割，并且应用社交网络算法进行阈值寻优。1.前言阅读此文章前，请阅读《图像分割：直方图区域划分及信息统计介绍》htt
使用 FinalShell 进行远程连接（ssh 远程连接 Linux 服务器）编程经验分享开发工具服务器 ssh linux
目录前言基本使用教程新建远程连接连接主机自定义命令路由追踪前言后端开发，必然需要和服务器打交道，部署应用，排查问题，查看运行日志等等。一般服务器都是集中部署在机房中，也有一些直接是云服务器，总而言之，程序员不可能直接和服务器直接操作，一般都是通过ssh连接来登录服务器。刚接触远程连接时，使用的是XSHELL来远程连接服务器，连接上就能够操作远程服务器了，但是仅用XSHELL并没有上传下载文件的功能
2020-04-12每天三百字之连接与替代冷眼看潮
不知道是不是好为人师，有时候还真想和别人分享一下我对某些现象的看法或者解释。人类社会不断发展进步的过程，就是不断连接与替代的过程。人类发现了火并应用火以后，告别了茹毛饮血的野兽般的原始生活（火烧、烹饪替代了生食）人类用石器代替了完全手工，工具的使用使人类进步一大步。类似这样的替代还有很多，随着科技的发展，有更多的原始的事物被替代，代之以更高效、更先进的技术。在近现代，汽车替代了马车，高速公路和铁路
【加密社】Solidity 中的事件机制及其应用加密社闲侃区块链智能合约区块链
加密社引言在Solidity合约开发过程中，事件（Events）是一种非常重要的机制。它们不仅能够让开发者记录智能合约的重要状态变更，还能够让外部系统（如前端应用）监听这些状态的变化。本文将详细介绍Solidity中的事件机制以及如何利用不同的手段来触发、监听和获取这些事件。事件存储的地方当我们在Solidity合约中使用emit关键字触发事件时，该事件会被记录在区块链的交易收据中。具体而言，事件
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
使用Faiss进行高效相似度搜索 llzwxh888 faiss python
在现代AI应用中，快速和高效的相似度搜索是至关重要的。Faiss（FacebookAISimilaritySearch）是一个专门用于快速相似度搜索和聚类的库，特别适用于高维向量。本文将介绍如何使用Faiss来进行相似度搜索，并结合Python代码演示其基本用法。什么是Faiss？Faiss是一个由FacebookAIResearch团队开发的开源库，主要用于高维向量的相似性搜索和聚类。Faiss
使用Apify加载Twitter消息以进行微调的完整指南 nseejrukjhad twitter easyui 前端 python
#使用Apify加载Twitter消息以进行微调的完整指南##引言在自然语言处理领域，微调模型以适应特定任务是提升模型性能的常见方法。本文将介绍如何使用Apify从Twitter导出聊天信息，以便进一步进行微调。##主要内容###使用Apify导出推文首先，我们需要从Twitter导出推文。Apify可以帮助我们做到这一点。通过Apify的强大功能，我们可以批量抓取和导出数据，适用于各类应用场景。
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
利用LangChain的StackExchange组件实现智能问答系统 nseejrukjhad langchain microsoft 数据库 python
利用LangChain的StackExchange组件实现智能问答系统引言在当今的软件开发世界中，StackOverflow已经成为程序员解决问题的首选平台之一。而LangChain作为一个强大的AI应用开发框架，提供了StackExchange组件，使我们能够轻松地将StackOverflow的海量知识库集成到我们的应用中。本文将详细介绍如何使用LangChain的StackExchange组件
如何部分格式化提示模板:LangChain中的高级技巧 nseejrukjhad langchain java 服务器 python
标题:如何部分格式化提示模板:LangChain中的高级技巧内容:如何部分格式化提示模板:LangChain中的高级技巧引言在使用大型语言模型(LLM)时,提示工程是一个关键环节。LangChain提供了强大的提示模板功能,让我们能更灵活地构建和管理提示。本文将介绍LangChain中一个高级特性-部分格式化提示模板,这个技巧可以让你的提示管理更加高效和灵活。什么是部分格式化提示模板?部分格式化提
在一台Ubuntu计算机上构建Hyperledger Fabric网络落叶无声9 区块链超级账本 Hyperledger fabric 区块链 ubuntu 构建 hyperledger fabric
在一台Ubuntu计算机上构建HyperledgerFabric网络Hyperledgerfabric是一个开源的区块链应用程序平台，为开发基于区块链的应用程序提供了一个起点。当我们提到HyperledgerFabric网络时，我们指的是使用HyperledgerFabric的正在运行的系统。即使只使用最少数量的组件，部署Fabric网络也不是一件容易的事。Fabric社区创建了一个名为Cello
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
MongoDB Oplog 窗口喝醉酒的小白 MongoDB 运维
在MongoDB中，oplog（操作日志）是一个特殊的日志系统，用于记录对数据库的所有写操作。oplog允许副本集成员（通常是从节点）应用主节点上已经执行的操作，从而保持数据的一致性。它是MongoDB副本集实现数据复制的基础。MongoDBOplog窗口oplog窗口是指在MongoDB副本集中，从节点可以用来同步数据的时间范围。这个窗口通常由以下因素决定：Oplog大小：oplog的大小是有限
pyecharts——绘制柱形图折线图 2224070247 信息可视化 python java 数据可视化
一、pyecharts概述自2013年6月百度EFE(ExcellentFrontEnd）数据可视化团队研发的ECharts1.0发布到GitHub网站以来，ECharts一直备受业界权威的关注并获得广泛好评，成为目前成熟且流行的数据可视化图表工具，被应用到诸多数据可视化的开发领域。Python作为数据分析领域最受欢迎的语言，也加入ECharts的使用行列，并研发出方便Python开发者使用的数据
CX8836：小体积大功率升降压方案推荐（附Demo设计指南）诚芯微科技社交电子
CX8836是一颗同步四开关单向升降压控制器，在4.5V-40V宽输入电压范围内稳定工作，持续负载电流10A，能够在输入高于或低于输出电压时稳定调节输出电压，可适用于USBPD快充、车载充电器、HUB、汽车启停系统、工业PC电源等多种升降压应用场合，为大功率TYPE-CPD车载充电器提供最优解决方案。提供CX8836Demo测试、CX8836样品申请及CX8836方案开发技术支持。CX8836同升
穷人做什么生意最赚钱？10个适合穷人赚钱的路子？氧惠爱高省
不管在什么地方，一般都是穷人占大量数，而富人只有少数，但是它们却掌握着大量的财富。对于穷人来说，想要买车、买房等奢侈品就难如登天，因为他们只能通过打工来赚取几千元的月薪。➤推荐网购返利app“氧惠”，一个领隐藏优惠券+现金返利的平台。氧惠只提供领券返利链接，下单全程都在淘宝、京东、拼多多等原平台，更支持抖音、快手电商、外卖红包返利等。（应用市场搜“氧惠”下载，邀请码:521521，全网优惠上氧惠！
数据仓库——维度表一致性墨染丶eye 背诵数据仓库
数据仓库基础笔记思维导图已经整理完毕，完整连接为：数据仓库基础知识笔记思维导图维度一致性问题从逻辑层面来看，当一系列星型模型共享一组公共维度时，所涉及的维度称为一致性维度。当维度表存在不一致时，短期的成功难以弥补长期的错误。维度时确保不同过程中信息集成起来实现横向钻取货活动的关键。造成横向钻取失败的原因维度结构的差别，因为维度的差别，分析工作涉及的领域从简单到复杂，但是都是通过复杂的报表来弥补设计
01-Git初识 Meereen Git git
01-Git初识概念：一个免费开源，分布式的代码版本控制系统，帮助开发团队维护代码作用：记录代码内容。切换代码版本，多人开发时高效合并代码内容如何学：个人本机使用：Git基础命令和概念多人共享使用：团队开发同一个项目的代码版本管理Git配置用户信息配置：用户名和邮箱，应用在每次提交代码版本时表明自己的身份命令：查看git版本号git-v配置用户名gitconfig--globaluser.name
展现思维导图魅力，不断挖掘人生宝藏思维导图讲师Mandy
第13期最强思维导图训练营已经结束一周了，但是我依旧是感觉所有学员还在努力的学习，这些学员中有教师、学生、白领、公务员、宝妈等等，只要你努力，只要你想改变自己，任何行业，任何岗位都可以参与进来，28天足以让你见成效，在这28天中，我们的学员不仅仅是收获了一枚毕业证，最重要的是让自己的思维方式得到升级，今天的你为自己投资，明天的你就会感谢你今天的付出，我们来听一听来自13期最强思维导图训练营优秀学员
2019-05-13 王健_100a
【撒下18:2】大卫打发军兵出战，分为三队：一队在约押手下，一队在洗鲁雅的儿子约押兄弟亚比筛手下，一队在迦特人以太手下。大卫对军兵说：“我必与你们一同出战。”解释：大卫检阅部队，将它分成三队，每队由一位元帅统领；约押与兄弟亚比筛，并迦特人以太共同指挥。大卫想与他们一同出战！应用：作为领袖与军兵一起出战是很重要。领袖在事奉中与信徒一起，领袖在任何的环境里与信徒一起走过。我们要同心协力为主而战。祷告：
如何用ruby来写hadoop的mapreduce并生成jar包 wudixiaotie mapreduce
ruby来写hadoop的mapreduce，我用的方法是rubydoop。怎么配置环境呢： 1.安装rvm：不说了网上有 2.安装ruby：由于我以前是做ruby的，所以习惯性的先安装了ruby，起码调试起来比jruby快多了。 3.安装jruby： rvm install jruby然后等待安
java编程思想 -- 访问控制权限百合不是茶 java 访问控制权限单例模式
访问权限是java中一个比较中要的知识点,它规定者什么方法可以访问,什么不可以访问一:包访问权限; 自定义包: package com.wj.control; //包 public class Demo { //定义一个无参的方法 public void DemoPackage(){ System.out.println("调用
[生物与医学]请审慎食用小龙虾 comsci 生物
现在的餐馆里面出售的小龙虾,有一些是在野外捕捉的,这些小龙虾身体里面可能带有某些病毒和细菌,人食用以后可能会导致一些疾病,严重的甚至会死亡..... 所以,参加聚餐的时候,最好不要点小龙虾...就吃养殖的猪肉,牛肉,羊肉和鱼,等动物蛋白质
org.apache.jasper.JasperException: Unable to compile class for JSP: 商人shang maven 2.2 jdk1.8
环境： jdk1.8 maven tomcat7-maven-plugin 2.0 原因： tomcat7-maven-plugin 2.0 不知吃 jdk 1.8，换成 tomcat7-maven-plugin 2.2就行，即 <plugin>
你的垃圾你处理掉了吗?GC oloz GC
前序:本人菜鸟，此文研究学习来自网络，各位牛牛多指教　 1.垃圾收集算法的核心思想　　Java语言建立了垃圾收集机制，用以跟踪正在使用的对象和发现并回收不再使用(引用)的对象。该机制可以有效防范动态内存分配中可能发生的两个危险：因内存垃圾过多而引发的内存耗尽，以及不恰当的内存释放所造成的内存非法引用。　　垃圾收集算法的核心思想是：对虚拟机可用内存空间，即堆空间中的对象进行识别
shiro 和 SESSSION 杨白白 shiro
shiro 在web项目里默认使用的是web容器提供的session，也就是说shiro使用的session是web容器产生的，并不是自己产生的，在用于非web环境时可用其他来源代替。在web工程启动的时候它就和容器绑定在了一起，这是通过web.xml里面的shiroFilter实现的。通过session.getSession()方法会在浏览器cokkice产生JESSIONID，当关闭浏览器，此
移动互联网终端淘宝客如何实现盈利小桔子移動客戶端淘客淘寶App
2012年淘宝联盟平台为站长和淘宝客带来的分成收入突破30亿元，同比增长100%。而来自移动端的分成达1亿元，其中美丽说、蘑菇街、果库、口袋购物等App运营商分成近5000万元。可以看出，虽然目前阶段PC端对于淘客而言仍旧是盈利的大头，但移动端已经呈现出爆发之势。而且这个势头将随着智能终端(手机，平板)的加速普及而更加迅猛
wordpress小工具制作 aichenglong wordpress 小工具
wordpress 使用侧边栏的小工具，很方便调整页面结构小工具的制作过程 1 在自己的主题文件中新建一个文件夹(如widget)，在文件夹中创建一个php(AWP_posts-category.php) 小工具是一个类,想侧边栏一样，还得使用代码注册，他才可以再后台使用，基本的代码一层不变 <?php class AWP_Post_Category extends WP_Wi
JS微信分享 AILIKES js
// 所有功能必须包含在 WeixinApi.ready 中进行 WeixinApi.ready(function(Api) { // 微信分享的数据 var wxData = { &nb
封装探讨百合不是茶 JAVA面向对象封装
//封装属性方法将某些东西包装在一起，通过创建对象或使用静态的方法来调用，称为封装；封装其实就是有选择性地公开或隐藏某些信息，它解决了数据的安全性问题，增加代码的可读性和可维护性在 Aname类中申明三个属性，将其封装在一个类中：通过对象来调用例如 1： //属性将其设为私有姓名 name 可以公开
jquery radio/checkbox change事件不能触发的问题 bijian1013 JavaScript jquery
我想让radio来控制当前我选择的是机动车还是特种车，如下所示： <html> <head> <script src="http://ajax.googleapis.com/ajax/libs/jquery/1.7.1/jquery.min.js" type="text/javascript"><
AngularJS中安全性措施 bijian1013 JavaScript AngularJS 安全性 XSRF JSON漏洞
在使用web应用中，安全性是应该首要考虑的一个问题。AngularJS提供了一些辅助机制，用来防护来自两个常见攻击方向的网络攻击。一.JSON漏洞当使用一个GET请求获取JSON数组信息的时候（尤其是当这一信息非常敏感，
[Maven学习笔记九]Maven发布web项目 bit1129 maven
基于Maven的web项目的标准项目结构 user-project user-core user-service user-web src
【Hive七】Hive用户自定义聚合函数(UDAF) bit1129 hive
用户自定义聚合函数，用户提供的多个入参通过聚合计算(求和、求最大值、求最小值)得到一个聚合计算结果的函数。问题：UDF也可以提供输入多个参数然后输出一个结果的运算，比如加法运算add(3，5)，add这个UDF需要实现UDF的evaluate方法,那么UDF和UDAF的实质分别究竟是什么？ Double evaluate(Double a, Double b)
通过 nginx-lua 给 Nginx 增加 OAuth 支持 ronin47
前言：我们使用Nginx的Lua中间件建立了OAuth2认证和授权层。如果你也有此打算，阅读下面的文档，实现自动化并获得收益。SeatGeek 在过去几年中取得了发展，我们已经积累了不少针对各种任务的不同管理接口。我们通常为新的展示需求创建新模块，比如我们自己的博客、图表等。我们还定期开发内部工具来处理诸如部署、可视化操作及事件处理等事务。在处理这些事务中，我们使用了几个不同的接口来认证： &n
利用tomcat-redis-session-manager做session同步时自定义类对象属性保存不上的解决方法 bsr1983 session
在利用tomcat-redis-session-manager做session同步时，遇到了在session保存一个自定义对象时，修改该对象中的某个属性，session未进行序列化，属性没有被存储到redis中。在 tomcat-redis-session-manager的github上有如下说明： Session Change Tracking As noted in the &qu
《代码大全》表驱动法-Table Driven Approach-1 bylijinnan java 算法
关于Table Driven Approach的一篇非常好的文章： http://www.codeproject.com/Articles/42732/Table-driven-Approach package com.ljn.base; import java.util.Random; public class TableDriven { public
Sybase封锁原理 chicony Sybase
昨天在操作Sybase IQ12.7时意外操作造成了数据库表锁定，不能删除被锁定表数据也不能往其中写入数据。由于着急往该表抽入数据，因此立马着手解决该表的解锁问题。无奈此前没有接触过Sybase IQ12.7这套数据库产品，加之当时已属于下班时间无法求助于支持人员支持，因此只有借助搜索引擎强大的
java异常处理机制 CrazyMizzz java
java异常关键字有以下几个，分别为 try catch final throw throws 他们的定义分别为 try： Opening exception-handling statement. catch： Captures the exception. finally： Runs its code before terminating
hive 数据插入DML语法汇总 daizj hive DML 数据插入
Hive的数据插入DML语法汇总1、Loading files into tables语法：1) LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2 ...)]解释：1)、上面命令执行环境为hive客户端环境下： hive>l
工厂设计模式 dcj3sjt126com 设计模式
使用设计模式是促进最佳实践和良好设计的好办法。设计模式可以提供针对常见的编程问题的灵活的解决方案。工厂模式工厂模式（Factory）允许你在代码执行时实例化对象。它之所以被称为工厂模式是因为它负责“生产”对象。工厂方法的参数是你要生成的对象对应的类名称。 Example #1 调用工厂方法（带参数） <?phpclass Example{
mysql字符串查找函数 dcj3sjt126com mysql
FIND_IN_SET(str,strlist) 假如字符串str 在由N 子链组成的字符串列表strlist 中，则返回值的范围在1到 N 之间。一个字符串列表就是一个由一些被‘,’符号分开的自链组成的字符串。如果第一个参数是一个常数字符串，而第二个是type SET列，则 FIND_IN_SET() 函数被优化，使用比特计算。如果str不在strlist 或st
jvm内存管理 easterfly jvm
一、JVM堆内存的划分分为年轻代和年老代。年轻代又分为三部分：一个eden,两个survivor。工作过程是这样的：e区空间满了后，执行minor gc，存活下来的对象放入s0, 对s0仍会进行minor gc，存活下来的的对象放入s1中，对s1同样执行minor gc，依旧存活的对象就放入年老代中；年老代满了之后会执行major gc，这个是stop the word模式，执行
CentOS-6.3安装配置JDK-8 gengzg centos
JAVA_HOME=/usr/java/jdk1.8.0_45 JRE_HOME=/usr/java/jdk1.8.0_45/jre PATH=$PATH:$JAVA_HOME/bin:$JRE_HOME/bin CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar:$JRE_HOME/lib export JAVA_HOME
【转】关于web路径的获取方法 huangyc1210 Web 路径
假定你的web application 名称为news,你在浏览器中输入请求路径： http://localhost:8080/news/main/list.jsp 则执行下面向行代码后打印出如下结果： 1、 System.out.println(request.getContextPath()); //可返回站点的根路径。也就是项
php里获取第一个中文首字母并排序远去的渡口数据结构 PHP
很久没来更新博客了，还是觉得工作需要多总结的好。今天来更新一个自己认为比较有成就的问题吧。最近在做储值结算，需求里结算首页需要按门店的首字母A-Z排序。我的数据结构原本是这样的： Array ( [0] => Array ( [sid] => 2885842 [recetcstoredpay] =&g
java内部类 hm4123660 java 内部类匿名内部类成员内部类方法内部类
　在Java中，可以将一个类定义在另一个类里面或者一个方法里面，这样的类称为内部类。内部类仍然是一个独立的类，在编译之后内部类会被编译成独立的.class文件，但是前面冠以外部类的类名和$符号。内部类可以间接解决多继承问题,可以使用内部类继承一个类，外部类继承一个类，实现多继承。 &nb
Caused by: java.lang.IncompatibleClassChangeError: class org.hibernate.cfg.Exten zhb8015
maven pom.xml关于hibernate的配置和异常信息如下，查了好多资料，问题还是没有解决。只知道是包冲突，就是不知道是哪个包....遇到这个问题的分享下是怎么解决的。。 maven pom: <dependency> <groupId>org.hibernate</groupId> <ar
Spark 性能相关参数配置详解－任务调度篇 Stark_Summer spark cache cpu 任务调度 yarn
随着Spark的逐渐成熟完善, 越来越多的可配置参数被添加到Spark中来, 本文试图通过阐述这其中部分参数的工作原理和配置思路, 和大家一起探讨一下如何根据实际场合对Spark进行配置优化。由于篇幅较长，所以在这里分篇组织，如果要看最新完整的网页版内容，可以戳这里：http://spark-config.readthedocs.org/，主要是便
css3滤镜 wangkeheng html css
经常看到一些网站的底部有一些灰色的图标，鼠标移入的时候会变亮，开始以为是js操作src或者bg呢，搜索了一下，发现了一个更好的方法：通过css3的滤镜方法。 html代码： <a href='' class='icon'><img src='utv.jpg' /></a> css代码： .icon{-webkit-filter: graysc

NLP-预训练模型-2019-NLU+NLG：T5【Text-to-Text 预训练模型超大规模探索】【 微调T5用于文本摘要】

二、直接使用T5预训练模型用于文本摘要

方式01、from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

方式02、from transformers import T5Tokenizer, T5ForConditionalGeneration

三、微调T5（用于文本摘要、用数据集xsum来微调T5）

四、使用微调后的T5

你可能感兴趣的:(#,NLP/词向量_预训练模型,#,NLP应用/文本摘要,自然语言处理,深度学习,人工智能,T5,预训练语言模型)

NLP-预训练模型-2019-NLU+NLG：T5【Text-to-Text 预训练模型超大规模探索】【微调T5用于文本摘要】

方式01、`from transformers import AutoTokenizer, AutoModelForSeq2SeqLM`

方式02、`from transformers import T5Tokenizer, T5ForConditionalGeneration`