无知的本科生

Day 5. Suicidal Ideation Detection: A Review of Machine Learning Methods and Applications综述

Title:
Suicidal Ideation Detection: A Review of Machine Learning Methods and Applications
自杀意念检测：机器学习方法及应用综述

Keywords:
Deep learning 深度学习
feature engineering 特征工程
social contents 社交内容
suicidal ideation detection (SID) 自杀意念检测

Abstract：
Suicide is a critical issue in modern society. Early detection and prevention of suicide attempts should be addressed to save people’s life. Current suicidal ideation detection (SID) methods include clinical methods based on the interaction between social workers or experts and the targeted individuals and machine learning techniques with feature engineering or deep learning for automatic detection based on online social contents. This article is the ﬁrst survey that comprehensively introduces and discusses the methods from these categories. Domain-speciﬁc applications of SID are reviewed according to their data sources, i.e., questionnaires, electronic health records, suicide notes, and online user content. Several speciﬁc tasks and data sets are introduced and summarized to facilitate further research. Finally, we summarize the limitations of current work and provide an outlook of further research directions.
自杀是一个严重的现代问题。早期发现和预防自杀未遂应该致力于拯救人们的生命。目前的自杀意念检测方法主要包括基于社会工作者或专家与目标个体互动的临床方法和基于特征工程或深度学习的机器学习技术，用于基于在线社交内容的自动检测。本文是第一次全面介绍和讨论这些分类方法的调查。根据SID的数据来源，即调查问卷、电子健康记录、自杀记录和在线用户内容，对SID的特定领域应用进行了审查。介绍和总结了几个具体的任务和数据集，以便于进一步的研究。最后，总结了目前工作的局限性，并对下一步的研究方向进行了展望。

1.INTRODUCTION
MENTAL health issues, such as anxiety and depression, are becoming increasingly concerned in modern society, as they turn out to be especially severe in developed countries and emerging markets. Severe mental disorders without effective treatment can turn to suicidal ideation or even suicide attempts. Some online posts contain much negative information and generate problematic phenomena, such as cyberstalking and cyberbullying. Consequences can be severe and risky since such lousy information is often engaged in some form of social cruelty, leading to rumors or even mental damage. Research shows that there is a link between cyberbullying and suicide [1]. Victims overexposed to too many negative messages or events may become depressed and desperate; even worse, some may commit suicide.
焦虑和抑郁等心理健康问题在现代社会越来越受到关注，在发达国家和新兴市场，焦虑和抑郁尤为严重。严重的精神障碍如果得不到有效的治疗，就会变成自杀意念，甚至是自杀未遂。一些网络帖子含有大量负面信息，并产生一些问题现象，如网络跟踪和网络欺凌。后果可能是严重的和危险的，因为这些糟糕的信息往往参与某种形式的社会残酷，导致谣言甚至精神损害。研究表明，网络欺凌和自杀之间有联系[1]。过度暴露在负面信息或事件中的受害者可能会变得沮丧和绝望；更糟糕的是，有些人可能会自杀。

The reasons that people commit suicide are complicated. People with depression are highly likely to commit suicide, but many without depression can also have suicidal thoughts [2]. According to the American Foundation for Suicide Prevention (AFSP), suicide factors fall under three categories: health factors, environmental factors, and historical factors [3]. Ferrari et al. [4] found that mental health issues and substance use disorders are attributed to the factors of suicide. O’Connor and Nock [5] conducted a thorough review of the psychology of suicide and summarized psychological risks as personality and individual differences, cognitive factors, social factors, and negative life events.
人们自杀的原因很复杂。抑郁症患者极有可能自杀，但许多没有抑郁症的人也可能有自杀念头[2]。根据美国自杀预防基金会（AFSP），自杀因素分为三类：健康因素、环境因素和历史因素[3]。Ferrari等人[4]发现精神健康问题和物质使用障碍是导致自杀的因素。O’Connor和Nock [5] 对自杀心理进行了全面的回顾，并将心理风险归纳为个性和个体差异、认知因素、社会因素和消极生活事件。

Suicidal ideation detection (SID) determines whether the person has suicidal ideation or thoughts by given tabular data of a person or textual content written by a person. Due to the advances in social media and online anonymity, an increasing number of individuals turn to interact with others on the Internet. Online communication channels are becoming a new way for people to express their feelings, suffering, and suicidal tendencies. Hence, online channels have naturally started to act as a surveillance tool for suicidal ideation, and mining social content can improve suicide prevention [6]. Strange social phenomena are emerging, e.g., online communities reaching an agreement on self-mutilation and copycat suicide. For example, a social network phenomenon called the “Blue Whale Game” in 2016 uses many tasks (such as self-harming) and leads game members to commit suicide in the end. Suicide is a critical social issue and takes thousands of lives every year. Thus, it is necessary to detect suicidality and prevent suicide before victims end their life. Early detection and treatment are regarded as the most effective ways to prevent potential suicide attempts.
自杀意念检测（SID）是通过给定的个人表格数据或个人书写的文本内容来判断一个人是否有自杀意念。由于社交媒体和在线匿名技术的进步，越来越多的人开始在互联网上与他人互动。在线交流渠道正在成为人们表达情感、痛苦和自杀倾向的新方式。因此，网络频道自然开始充当自杀意念的监控工具，挖掘社交内容可以提高自杀预防[6]。奇怪的社会现象正在出现，例如，网上社区就自残和模仿自杀达成协议。例如，2016年一个名为“蓝鲸游戏”的社交网络现象使用了许多任务（如自残），并导致游戏成员最终自杀。自杀是一个重要的社会问题，每年夺去成千上万人的生命。因此，有必要在受害者结束生命之前发现自杀行为并防止自杀。早期发现和治疗被认为是防止潜在自杀企图的最有效方法。

Potential victims with suicidal ideation may express their thoughts of committing suicide in ﬂeeting thoughts, suicide plans, and role-playing. SID is to ﬁnd out these risks of ntentions or behaviors before tragedy strikes. A meta-analysis conducted by McHugh et al. [7] shown statistical limitations of ideation as a screening tool but also pointed out that people’s expression of suicidal ideation represents their psychological distress. Effective detection of early signals of suicidal ideation can identify people with suicidal thoughts and open a communication portal to let social workers mitigate their mental issues. The reasons for suicide are complicated and attributed to a complex interaction of manyfactors [5], [8]. To detect suicidal ideation, many researchers conducted psychological and clinical studies [9] and classiﬁed responses of questionnaires [10]. Based on their social media data, artiﬁcial intelligence (AI) and machine learning techniques can predict people’s likelihood of suicide [11], which can better understand people’s intentions and pave the way for early intervention. Detection on social content focuses on feature engineering [12], [13], sentiment analysis [14], [15], and deep learning [16]–[18]. Those methods generally require heuristics to select features or design artiﬁcial neural network (ANN) architectures for learning rich representation. The research trend focuses on selecting more useful features from people’s health records and developing neural architectures to understand the language with suicidal ideation better.
有自杀意念的潜在受害者可能会在自杀想法、自杀计划和角色扮演中表达他们的自杀想法。SID是在悲剧发生之前找出这些意图或行为的风险。McHugh等人进行的荟萃分析[7]显示了意念作为一种筛选工具的统计局限性，同时也指出人们对自杀意念的表达代表了他们的心理困扰。对自杀意念的早期信号进行有效检测，可以识别出有自杀想法的人，并打开一个沟通门户，让社会工作者缓解他们的心理问题。自杀的原因是复杂的，并归因于许多复杂因素的相互作用[5], [8]。为了检测自杀意念，许多研究人员进行了临床研究 [9] 和问卷调查分类[10]。基于他们的社交媒体数据，人工智能（AI）和机器学习技术可以预测人们自杀的可能性[11]，这可以更好地了解人们的意图，为早期干预铺平道路。对社交内容的检测集中在特征工程[12], [13]、情感分析[14], [15]和深度学习[16]–[18]。这些方法通常需要启发式来选择特征或设计人工神经网络（ANN）结构来学习丰富的表示。研究趋势集中在从人们的健康记录中选择更有用的特征，开发神经结构，以便更好地理解带有自杀意念的语言。

Mobile technologies have been studied and applied to suicide prevention, for example, the mobile suicide intervention application iBobbly [19] developed by the Black Dog 2 Institute. Many other suicide prevention tools integrated with social networking services have also been developed, including Samaritans Radar 3 and Woebot 4. The former was a Twitter plugin that was later discontinued because of privacy issues. For monitoring alarming posts, the latter is a Facebook chatbot based on cognitive behavioral therapy and natural language processing (NLP) techniques for relieving people’s depression
and anxiety.
移动技术已经被研究并应用于自杀预防，例如黑狗研究所2开发的移动自杀干预应用iBobbly [19]。还开发了许多其他与社交网络服务相结合的自杀预防工具，包括Samaritans Radar 3和Woebot 4。前者是一个Twitter插件，后来因为隐私问题而停止使用。后者是一个基于认知行为疗法和自然语言处理（NLP）技术的Facebook聊天机器人，用于监控人们的抑郁和焦虑。
2https://blackdoginstitute.org.au/research/digital-dog/programs/ibobbly-app
3https://samaritans.org/about-samaritans/research-policy/internet-suicide/samaritans-radar
4https://woebot.io

Applying cutting-edge AI technologies for SID inevitably comes with privacy issues [20] and ethical concerns [21]. Linthicum et al. [22] put forward three ethical issues, including the inﬂuence of bias on machine learning algorithms, the prediction on time of suicide act, and ethical and legal questions raised by false positive and false negative prediction. It is not easy to answer ethical questions for AI as these require algorithms to reach a balance between competing values,issues, and interests [20].
将尖端人工智能技术应用于SID不可避免地会带来隐私问题[20]和伦理问题[21]。Linthicum等人[22] 提出了三个伦理问题，包括偏差对机器学习算法的影响、对自杀行为时间的预测以及假阳性和假阴性预测所引发的伦理和法律问题。要回答人工智能的伦理问题并不容易，因为这些问题需要算法在相互竞争的价值观、问题和利益之间达到平衡[20]。

AI has been applied to solve many challenging social problems. Detection of suicidal ideation with AI techniquesis one of the potential applications for social good and should be addressed to improve people’s wellbeing meaningfully. The research problems include feature selection on tabular and text data and representation learning on natural language. Many AI-based methods have been applied to classify suicide risks. However, there remain some challenges. There are a limited number of benchmarks for training and evaluating SID. AI-powered models, sometimes, learn statistical clues but fail to understand people’s intentions. Moreover, many neural models are lack of interpretability. This survey reviews SID methods from the perspective of AI and machine learning and speciﬁc domain applications with social impact. The categorization from these two perspectives is shown in Fig. 1. This article provides a comprehensive review of the increasingly important ﬁeld of SID with machine learning methods. It proposes a summary of current research progress and an outlook of future work. The contributions of our survey are summarized as follows.
人工智能已被应用于解决许多具有挑战性的社会问题。用人工智能技术检测自杀意念是切实改善人民生活的潜在应用之一。研究问题包括表格和文本数据的特征选择和自然语言的表示学习。许多基于人工智能的方法被应用于自杀风险的分类。然而，仍然存在一些挑战。训练和评估SID的基准数量有限。人工智能驱动的模型，有时会学习统计线索，但无法理解人们的意图。此外，许多神经网络模型缺乏可解释性。这项调查从人工智能和机器学习以及具有社会影响的特定领域应用的角度回顾了SID方法。图1显示了从这两个角度进行的分类。本文全面回顾了机器学习方法在SID中日益重要的领域。对目前的研究进展进行了总结，并对今后的工作进行了展望。我们调查的贡献总结如下。

1)To the best of our knowledge, this is the ﬁrst survey that conducts a comprehensive review of SID, its methods, and its applications from a machine learning perspective.
2)We introduce and discuss the classical content analysis and modern machine learning techniques, plus their application to questionnaires, EHR data, suicide notes, and online social content.
3)We enumerate existing and less explored tasks and discuss their limitations. We also summarize existing data sets and provide an outlook of future research directions in this ﬁeld.
1）据我们所知，这是第一次从机器学习的角度对SID、其方法和应用进行全面回顾的调查。
2）我们介绍和讨论了经典内容分析和现代机器学习技术，以及它们在问卷调查、EHR数据、自杀笔记和在线社交内容中的应用。
3）我们列举现有的和较少探索的任务，并讨论它们的局限性。我们还总结了现有的数据集，并对该领域未来的研究方向进行了展望。

The remainder of this article is organized as follows. Methods and applications are introduced and summarized in Sections II and III, respectively. Section IV enumerates speciﬁc tasks and some data sets. Finally, we have a discussion and propose some future directions in Section V.
本文的其余部分组织如下。方法和应用分别在第二节和第三节中介绍和总结。第四节列举了具体任务和一些数据集。最后，我们在第五部分进行了讨论，并提出了未来的发展方向。

图1。自杀意念检测的分类：方法和领域。左侧部分表示方法分类，而右侧部分显示域的类别。箭头和实心点表示子类别。

2.METHODS AND CATEGORIZATION
Suicide detection has drawn the attention of many researchers due to an increasing suicide rate in recent years and has been studied extensively from many perspectives.The research techniques used to examine suicide also span many ﬁelds and methods, for example, clinical methods with patient–clinic interaction [9] and automatic detection from user-generated content (mainly text) [12], [17]. Machine learning techniques are widely applied for automatic detection.
近年来，随着自杀率的不断上升，自杀检测引起了许多研究者的关注，并从多个角度进行了广泛的研究。用于检测自杀的研究技术也跨越了许多领域和方法，例如，具有患者-诊所交互的临床方法 [9] 和从用户生成的内容（主要是文本）的自动检测方法 [12], [17]。机器学习技术广泛应用于自动检测。

Traditional suicide detection relies on clinical methods, including self-reports and face-to-face interviews. Venek et al. [9] designed a ﬁve-item ubiquitous questionnaire for the assessment of suicidal risks and applied a hierarchical classiﬁer on the patients’ response to determine their suicidal intentions. Through face-to-face interaction, verbal and acoustic information can be utilized. Scherer [23] investigated the prosodic speech characteristics and voice quality in a dyadic interview to identify suicidal and nonsuicidal juveniles.
Other clinical methods examine the resting state heart rate from converted sensing signals [24] and classify the functional magnetic resonance imaging-based neural representations of death-and life-related words [25] and event-related instigators converted from EEG signals [26]. Another aspect of clinical treatment is the understanding of the psychology behind suicidal behavior [5], which, however, relies heavily on the clinician’s knowledge and face-to-face interaction. Suicide risk assessment scales with clinical interview can reveal informative cues for predicting suicide [27]. Tan et al. [28] conducted an interview and survey study in Weibo, a Twitter-like service in China, to explore the engagement of suicide attempters with intervention by direct messages.
传统的自杀检测依赖于临床方法，包括自我报告和面对面访谈。Venek等人[9]设计了一个五项普遍存在的问卷，用于评估自杀风险，并对患者的反应应用分层分类器来确定他们的自杀意图。通过面对面交流，可以利用言语和听觉信息。在一次二元访谈（？？）中，Scherer [23]研究了韵律语音特征和语音质量，以识别自杀和非自杀青少年。其他临床方法通过转换的感应信号[24]检查静息状态心率，并对基于功能磁共振成像的死亡和生命相关词汇[25]的神经表征以及从脑电图信号 [26] 转换的事件相关刺激因素进行分类。临床治疗的另一个方面是对自杀行为背后心理的理解，然而，自杀行为严重依赖于临床医生的知识和面对面的互动。结合临床访谈的自杀风险评估量表可以揭示预测自杀的信息线索[27]。Tan等人[28]在中国类似Twitter的微博上进行了一项采访和调查研究，以探讨自杀未遂者与直接信息干预的关系。

A. Content Analysis内容分析
Users’ post on social websites reveals rich information and their language preferences. Through exploratory data analysis on the user-generated content, we can have an insight into language usage and linguistic clues of suicide attempters. The detailed analysis includes lexicon-based ﬁltering, statistical linguistic features, and topic modeling within suicide-related posts.
用户在社交网站上发布的帖子揭示了丰富的信息和他们的语言偏好。通过对用户生成内容的探索性数据分析，我们可以深入了解自杀未遂者的语言使用和语言线索。详细分析包括基于词汇的过滤、统计语言特征和自杀相关帖子中的主题建模。

Suicide-related keyword dictionary and lexicon are manually built to enable keyword ﬁltering [29], [30] and phrases ﬁltering [31]. Suicide-related keywords and phrases include “kill,” “suicide,” “feel alone,” “depressed,” and “cutting myself.” Vioulès et al. [3] built a pointwise mutual information symptom lexicon using an annotated Twitter data set. Gunn and Lester [32] analyzed posts from Twitter in the 24 h before the death of a suicide attempter. Coppersmith et al. [33] analyzed the language usage of data from the same platform. Suicidal thoughts may involve strong negative feelings, anxiety, hopelessness, or other social factors, such as family and friends. Ji et al. [17] performed word cloud visualization and topics modeling over suicide-related content and found that suicide-related discussion covers personal and social issues. Colombo et al. [34] analyzed the graphical characteristics of connectivity and communication in the Twitter social network. Coppersmith et al. [35] provided an exploratory analysis of language patterns and emotions on Twitter. Other methods and techniques include Google Trends analysis for suicide risk monitoring [36], the reply bias assessment through linguistic clues [37], human–machine hybrid method for analysis of the language effect of social support on suicidal ideation risk [38], social media content detection, and speech patterns analysis [39].
与自杀相关的关键词词典和词典是手动构建的，以启用关键字过滤[29], [30]和短语过滤 [31]。与自杀相关的关键词和短语包括“杀死”、“自杀”、“感觉孤独”、“抑郁”和“割伤自己”。Vioulès等人[3]使用带注释的Twitter数据集建立了一个点式互信息症状词典。Gunn和Lester [32]分析了一名自杀未遂者死亡前24小时来自Twitter的帖子。Coppersmith等人[33]分析了来自同一平台的数据的语言使用情况。自杀念头可能包括强烈的消极情绪、焦虑、绝望或其他社会因素，如家庭和朋友。Ji等人[17]对自杀相关内容进行了词云可视化和主题建模，发现自杀相关讨论涵盖个人和社会问题。 Colombo等人[34]分析了Twitter社交网络中连接和通信的图形特征。Coppersmith等人[35对Twitter上的语言模式和情绪进行了探索性分析。其他方法和技术包括用于自杀风险监测的谷歌趋势分析 [36]、通过语言线索 [37]进行回答偏差评估、用于分析社会支持对自杀意念风险的语言影响的人机混合方法[38]、社交媒体内容检测和语音模式分析 [39]。

B. Feature Engineering特征工程
The goal of text-based suicide classiﬁcation is to determine whether candidates, through their posts, have suicidal ideations. Machine learning methods and NLP have also been applied in this ﬁeld.
以文本为基础的自杀分类的目的是通过候选人发布的帖子确认是否有自杀的想法。机器学习方法和NLP方法也被应用于这一领域。

Tabular Features: Tabular data for SID consist of questionnaire responses and structured statistical information extracted from websites. Such structured data can be directly used as features for classiﬁcation or regression. Masuda et al. [40] applied logistic regression to classify suicide and control groups based on users’ characteristics and social behavior variables. The authors found variables such as community number, local clustering coefﬁcient, and homophily have a more substantial inﬂuence on suicidal ideation in an SNS of Japan. Chattopadhyay [41] applied pierce suicidal intent scale (PSIS) to assess suicide factors and conducted regression analysis. Questionnaires act as a good source of tabular features. Delgado-Gomez et al. [42] used the international personality disorder examination screening questionnaire and the Holmes–Rahe social readjustment rating scale (SRRS). Chattopadhyay [43] proposed to apply a multilayer feedforward neural network, as shown in Fig. 2(a), to classify suicidal intention indicators according to Beck’s suicide intent scale.
1）表格特征：SID的表格数据包括问卷调查结果和从网站中提取的结构化统计信息。这种结构化数据可以直接用作分类或回归的特征。Masuda等人[40] 应用逻辑回归分析，根据用户特征和社会行为变量对自杀组和对照组进行分类。作者发现，在日本的一个社交网站中（Social Networking Services），群体数量、局部聚类系数和同质性等变量对自杀意念有更大的影响。Chattopadhayy [41] 应用皮尔斯自杀意向量表（PSIS）评估自杀因素并进行回归分析。问卷可以作为表格特征的良好来源。Delgado-Gomez等人[42]采用国际人格障碍筛查问卷和霍姆斯-克拉赫社会适应量表（SRRS）。Chattopadhyay [43]建议应用多层前馈神经网络，如图2（a）所示，根据Beck的自杀意图量表对自杀意向指标进行分类。
General Text Features: Another direction of feature engineering is to extract features from unstructured text. The main features consist of N-gram features, knowledge-based features, syntactic features, context features, and class-speciﬁc features [44]. Abboute et al. [45] built a set of keywords for vocabulary feature extraction within nine suicidal topics. Okhapkina et al. [46] built a dictionary of terms about suicidal content. They introduced term frequency-inverse document frequency (TF-IDF) matrices for messages and a singular value decomposition (SVD) for matrices. Mulholland and Quinn [47] extracted vocabulary and syntactic features to build a classiﬁer to predict the likelihood of a lyricist’s suicide. Huang et al. [48] built a psychological lexicon dictionary by extending HowNet (a commonsense word collection) and used a support vector machine (SVM) to detect cybersuicide in Chinese microblogs. The topic model [49] is incorporated with other machine learning techniques for identifying suicide in Sina Weibo. Ji et al. [17] extracted several informative sets of features, including statistical, syntactic, linguistic inquiry and word count (LIWC), word embedding, and topic features, and then put the extracted features into classiﬁers, as shown in Fig. 2(b), where four traditional supervised classiﬁers are compared. Shing et al. [13] extracted several features as a bag of words (BoWs), empath, readability, syntactic features, topic model posteriors, word embeddings, LIWC, emotion features, and mental disease lexicon.
2）一般文本特征：特征工程的另一个方向是从非结构化文本中提取特征。主要特征包括N-gram特征、基于知识的特征、句法特征、上下文特征和类特定特征[44]。Aboute等人[45]在9个自杀主题中构建了一组词汇特征提取关键词。Okhapkina等人[46]建立了一本关于自杀内容的词典。他们为消息引入了术语频率逆文档频率（TF-IDF）矩阵和矩阵的奇异值分解（SVD）。Mulholland和Quinn [47] 提取了词汇量和句法特征来构建分类器来预测抒情诗人自杀的可能性。Huang等人[48]通过扩展知网（常识词汇集）建立了心理词典，并使用支持向量机（SVM）对中文微博中的网络自杀进行了检测。主题模型[49]与其他机器学习技术相结合，用于识别新浪微博中的自杀行为。Ji等人[17]提取多个信息特征集，包括统计特征、句法特征、语言查询和词数特征（LIWC）、单词嵌入和主题特征，然后将提取的特征放入分类器中，如图2（b）所示，其中比较了四个传统的监督分类器。Shing等人[13]提取了若干特征作为袋词（BoWs）、移情、可读性、句法特征、主题模式后验、词嵌入、LIWC、情感特征和精神疾病词汇。

Models for SID with feature engineering include SVM [44], ANNs [50], and conditional random ﬁeld (CRF) [51]. Tai and Chiu [50] selected several features, including the history of suicide ideation and self-harm behavior, religious belief, family status, mental disorder history of candidates, and their family. Pestian et al. [52] compared the performance of different multivariate techniques with features of word counts, POS, concepts, and readability scores. Similarly, Ji et al. [17] compared four classiﬁcation methods of logistic regression, random forest, gradient boosting decision tree, and XGBoost. Braithwaite et al. [53] validated that machine learning algorithms can effectively identify high suicidal risk.
具有特征工程的SID模型包括SVM[44]、ANNs[50]和条件随机场 (CRF) [51]。Tai和Chiu[50] 选择了几个特征，包括自杀意念和自残行为史、宗教信仰、家庭状况、精神障碍史，和他们的家人。Pestian等人 [52] 比较了不同多元技术的表现与字数、词性、概念和可读性得分的特征。同样，Ji等人[17]比较了逻辑回归、随机森林、梯度提升决策树和XGBoost四种分类方法。Braithwaite等人[53]验证了机器学习算法能够有效识别高自杀风险。
Affective Characteristics: Affective characteristics are among the most distinct differences between those who attempt suicide and healthy individuals, which has drawn considerable attention from both computer scientists and mental health researchers. To detect the emotions in suicide notes, Liakata et al. [51] used manual emotion categories, including anger, sorrow, hopefulness, happiness/ peacefulness, fear, pride, abuse, and forgiveness. Wang et al. [44] employed combined characteristics of both factual (two categories) and sentimental aspects (13 categories) to discover ﬁne-grained sentiment analysis. Similarly, Pestian et al. [52] identiﬁed emotions of abuse, anger, blame, fear, guilt, hopelessness, sorrow, forgiveness, happiness, peacefulness, hopefulness, love, pride, thankfulness, instructions, and information. Ren et al. [14] proposed a complex emotion topic model and applied it to analyze accumulated emotional traits in suicide blogs and to detect suicidal intentions from a blog stream. Especially, the authors studied accumulate emotional traits, including emotion accumulation, emotion covariance, and emotion transition among eight basic emotions of joy, love, expectation, surprise, anxiety, sorrow, anger, and hate with a ﬁve-level intensity.
3）情感特征：情感特征是自杀未遂者与健康个体之间最显著的差异之一，这引起了计算机科学家和心理健康研究人员的极大关注。为了检测自杀笔记中的情绪，Liakata等人[51] 使用手动情绪类别，包括愤怒、悲伤、希望、快乐/和平、恐惧、骄傲、虐待和宽恕。Wang等人[44]采用事实（两个范畴）和情感方面（13个范畴）的组合特征来发现细粒度的情感分析。同样，Pestian等人[52]识别出虐待、愤怒、责备、恐惧、内疚、绝望、悲伤、宽恕、幸福、和平、希望、爱、骄傲、感恩、指示和信息。Ren等人[14]提出了一个复杂情绪话题模型，并将其应用于分析自杀博客中累积的情绪特征，并从博客流中检测自杀意图。特别地，学者们研究了基于快乐、爱、期待、惊喜、焦虑、悲伤、愤怒、憎恨等八种基本情绪的情绪积累、情绪协方差和情绪转换等情绪积累特征。

图2。特征工程方法说明。（a）神经网络与特征工程。（b）特征工程分类器。

C. Deep Learning深度学习
Deep learning has been a great success in many applications, including computer vision, NLP, and medical diagnosis. In the ﬁeld of suicide research, it is also an important method for automatic SID and suicide prevention. It can effectively learn text features automatically without sophisticated feature engineering techniques. At the same time, some also take extracted features into deep neural networks (DNNs); for example, Nobles et al. [54] fed psycholinguistic features and word occurrence into the multilayer perceptron (MLP). Popular DNNs include convolutional neural networks (CNNs), recurrent neural networks (RNNs), and bidirectional encoder representations from transformers (BERT), as shown in Fig. 3(a)–©. Natural language text is usually embedded into distributed vector space with popular word embedding techniques, such as word2vec [55] and GloVe [56]. Shing et al. [13] applied user-level CNN with the ﬁlter size of 3, 4, and 5 to encode users’ posts. The long short-term memory (LSTM) network, a popular variant of RNN, is applied to encode textual sequences and then process for classiﬁcation with fully connected layers [17].
深度学习在计算机视觉、自然语言处理和医学诊断等许多领域都取得了巨大的成功。在自杀研究领域，它也是自动SID和自杀预防的重要方法。它可以有效地自动学习文本特征且没有复杂的特征工程技术。同时，有的还将提取的特征提取到深度神经网络（DNNs）中；例如，Nobles等人[54]将心理语言学特征和单词出现情况输入多层感知器（MLP）。常用的DNN包括卷积神经网络（CNNs）、递归神经网络（RNNs）和来自transformers的双向编码器表示（BERT），如图3（a）-（C）所示。自然语言文本通常是通过流行的词嵌入技术嵌入到分布式向量空间中，如word2vec [55]和GloVe[56]。Shing等人[13]应用用户级CNN，过滤器大小为3、4和5，对用户帖子进行编码。长短期记忆（LSTM）网络是RNN的一个流行变体，它被用来对文本序列进行编码，通过全连接层进行分类[17]。

Recent methods introduce other advanced learning paradigms to integrate with DNNs for SID. Ji et al. [57] proposed model aggregation methods for updating neural networks, i.e., CNNs and LSTMs, targeting to detect suicidal ideation in private chatting rooms. However, decentralized training relies on coordinators in chatting rooms to label user posts for supervised training, which can only be applied to minimal scenarios. One possible better way is to use unsupervised or semisupervised learning methods. Benton et al. [16] predicted suicide attempt and mental health with neural models under the framework of multitask learning by predicting the gender of users as an auxiliary task. Gaur et al. [58] incorporated external knowledge bases and suicide-related ontology into a text representation and gained an improved performance with a CNN model. Coppersmith et al. [59] developed a deep learning model with GloVe for word embedding, bidirectional LSTM for sequence encoding, and self-attention mechanism for capturing the most informative subsequence. Sawhney et al. [60] used LSTM, CNN, and RNN for SID. Similarly, Tadesse et al. [61] employed the LSTM-CNN model. Ji et al. [62] proposed an attentive relation network with LSTM and topic modeling for encoding text and risk indicators.
最近的方法引入了其他先进的学习范式来与用于SID的DNNs集成。Ji等人[57]提出了用于更新神经网络的模型聚合方法，即CNNs和LSTMs，旨在检测私人聊天室中的自杀意念。然而，分散式训练依赖于聊天室中的协调员为监督训练的用户帖子添加标签，这只能应用于最小的场景。一个可能更好的方法是使用无监督或半监督的学习方法。Benton等人[16]在多任务学习框架下通过预测用户性别作为辅助任务来预测自杀未遂与心理健康的神经网络模型。Gaur等人 [58]将外部知识库和自杀相关本体整合到文本表示中，并使用CNN模型获得了更好的性能。Coppersmith等人 [59] 开发了一个深度学习模型，其中包括用于单词嵌入的GloVe、用于序列编码的双向LSTM以及用于捕获信息量最大的子序列的自我注意机制。Sawhney等人 [60] 使用LSTM、CNN和RNN进行SID。同样，Tadesse等人[61]人采用LSTM-CNN模式。Ji等人[62]提出了一个与LSTM的密切关系网络和用于编码文本和风险指标的主题模型。

In the 2019 CLPsych Shared Task [63], many popular DNN architectures were applied. Hevia et al. [64] evaluated the effect of pretraining using different models, including GRU-based RNN. Morales et al. [65] studied several popular deep learning models, such as CNN, LSTM, and Neural Network Synthesis (NeuNetS). Matero et al. [66] proposed a dual-context model using hierarchically attentive RNN and BERT.
在2019年CLPsych共享任务中[63]，应用了许多流行的DNN架构。Hevia等人[64] 使用不同的模型（包括基于GRU的RNN）评估预训练的效果。Morales等人 [65] 研究了几种流行的深度学习模型，如CNN、LSTM和神经网络综合技术（NeuNetS）。Matero等人[66]提出了一种基于分层注意RNN和BERT的双重语境模型。

Another subdirection is the so-called hybrid method that cooperates minor feature engineering with representation learning techniques. Chen et al. [67] proposed a hybrid classiﬁcation model of the behavioral model and the suicide language model. Zhao et al. [68] proposed the D-CNN model taking word embedding and external tabular features as inputs for classifying suicide attempters with depression.
另一个分支是所谓的混合方法，它将次要特征工程与表示学习技术相结合。Chen等人[67]提出了行为模型和自杀语言模型的混合分类模型。Zhao等 [68]提出了以词嵌入和外部表格特征为输入的D-CNN模型来对自杀未遂者进行抑郁分类。

D. Summary总结
The popularization of machine learning has facilitated research on SID from multimodal data and provided a promising way for effective early warning. Current research focuses on text-based methods by extracting features and deep learning for automatic feature learning. Researchers widely use many canonical NLP features, such as TF-IDF, topics, syntactic, affective characteristics, readability, and deep learning models, such as CNN and LSTM. Those methods, especially DNNs with automatic feature learning, boosted predictive performance and preliminary success on suicidal intention understanding. However, some methods may only learn statistical cues and lack of commonsense. The recent work [58] incorporated external knowledge using knowledge bases and suicide ontology for knowledge-aware suicide risk assessment. It took a remarkable step toward knowledge-aware detection.
机器学习的普及促进了多模态数据SID的研究，为有效的早期预警提供了一条有希望的途径。目前的研究主要集中在基于文本的特征提取和自动特征学习的深度学习方法上。研究者们广泛地使用了许多典型的NLP特征，如TF-IDF、主题、句法、情感特征、可读性和深度学习模型，如CNN和LSTM。这些方法，特别是具有自动特征学习的DNNs，提高了自杀意图的预测能力和在自杀意图理解上取得初步成功。然而，有些方法可能只学习统计线索，缺乏常识。最近的工作 [58]结合了外部知识，使用知识库和自杀本体来进行有知识意识的自杀风险评估。它朝着知识感知检测迈出了显著的一步。

3. APPLICATIONS ON DOMAINS
Many machine learning techniques have been introduced for SID. The relevant extant research can also be viewed according to the data source. Speciﬁc applications cover a wide range of domains, including questionnaires, electronic health records (EHRs), suicide notes, and online user content. Fig. 4 shows some examples of data source for SID, where Fig. 4(a) lists selected questions of the “International Personal Disorder Examination Screening Questionnaire” (IPDE-SQ) adapted from [42], Fig. 4(b) shows selected patient’s records from [69], Fig. 4© shows a suicide note from a website, 5 and Fig. 4(d) shows a tweet and its corresponding comments from Twitter.com. Nobles et al. [54] identiﬁed suicide risk using text messages. Some researchers also developed softwares for suicide prevention. Berrouiguet et al. [70] developed a mobile application for health status self-report. Meyer et al. [71] developed an e-PASS Suicidal Ideation Detector (eSID) tool for medical practitioners. Shah et al. [72] utilized social media videos and studied multimodal behavioral markers.
许多机器学习技术已经被引入到SID中。现有的相关研究也可以根据数据来源进行查看。具体的应用程序涵盖了广泛的领域，包括问卷调查、电子健康记录（EHRs）、自杀笔记和在线用户内容。图4显示了SID的一些数据源示例，其中图4（a）列出了国际个人障碍检查筛查问卷（IPDE-SQ）的部分问题，该问卷来自文献[42]，图4（b）选自文献[69]中患者的记录，图4（c）显示了一个网站的自杀笔记，图4（d）显示了一条来自推特的帖子和相应的评论。Nobles等人[54]通过短信识别自杀风险。一些研究人员还开发了自杀预防软件。Berrouiguet等人[70] 开发了一个用于健康状况自我报告的移动应用程序。Meyer等人。71为执业医师开发了一种e-PASS自杀意念检测仪（eSID）工具。Shah等人[72]利用社交媒体视频并研究多模态行为标记。

图4. SID的内容示例（a）问卷调查（b）电子健康记录（c）自杀笔记（d）推特

A. Questionnaires
Mental disorder scale criteria, such as DSM-IV, 6 ICD-10,7 and the IPDE-SQ, provides good tool for evaluating an individual’s mental status and their potential for suicide. Those criteria and examination metrics can be used to design questionnaires for self-measurement or face-to-face clinician-patient interview.
精神障碍量表标准，如DSM-IV、ICD-10、IPDE-SQ，为评估个人的心理状态和自杀可能性提供了很好的工具。这些标准和检查指标可用于设计自测问卷或临床诊断。

To study the assessment of suicidal behavior, Delgado-Gomez et al. [10] applied and compared the IPDE-SQ and the “Barrat’s Impulsiveness Scale” (version 11, BIS-11) to identify people likely to attempt suicide. The authors also conducted a study on individual items from those two scales. The BIS-11 scale has 30 items with four-point ratings, while the IPDE-SQ in DSM-IV has 77 true-false screening questions. Furthermore, Delgado-Gomez et al. [42] introduced the “Holmes–Rahe SRRS” and the IPDE-SQ as well to two comparison groups of suicide attempters and nonsuicide attempters. The SRRS consists of 43 ranked life events of different levels of severity. Harris et al. [73] surveyed understanding suicidal individuals’ online behaviors to assist suicide prevention. Sueki [74] conducted an online panel survey among Internet users to study the association between suicide-related Twitter use and suicidal behavior. Based on the questionnaire results, they applied several supervised learning methods, including linear regression, stepwise linear regression, decision trees, Lars-en, and SVMs, to classify suicidal behaviors.
为了研究自杀行为的评估，Delgado Gomez等人 [10] 应用并比较了IPDE-SQ和Barrat的冲动性量表（第11版，BIS-11）来识别可能企图自杀的人。作者还对这两个量表中的个别项目进行了研究。BIS-11量表有30个项目为4分评级，而DSM-IV中的IPDE-SQ有77个对错筛选问题。此外，Delgado Gomez等人 [42] 引入了福尔摩斯-克拉赫SRRS和IPDE-SQ，以及自杀未遂者和非自杀未遂者的两个比较组。SRRS由43个不同严重程度的生活事件组成。Harris等人 [73] 全面了解了自杀个人的在线行为来帮助预防自杀。Sueki [74] 在互联网用户中进行了一项在线小组调查，研究自杀相关Twitter的使用与自杀行为之间的关系。基于问卷调查结果，他们运用了几种监督学习方法，包括线性回归、逐步线性回归、决策树、Lars-en和支持向量机对自杀行为进行分类。

B. Electronic Health Records
The increasing volume of EHRs has paved the way for machine learning techniques for suicide attempter prediction. Patient records include demographical information and diagnosis-related history, such as admissions and emergency visits. However, due to the data characteristics, such as sparsity, variable length of clinical series, and heterogeneity of patient records, many challenges remain in modeling medical data for suicide attempt prediction. Besides, the recording procedures may change because of the change of healthcare policies and the update of diagnosis codes.
EHRs的不断增加为机器学习技术预测自杀未遂者铺平了道路。患者记录包括人口统计信息和与诊断相关的历史记录，如入院和急诊。然而，由于数据的稀疏性、临床序列长度的不确定性以及患者记录的异质性等特点，对自杀未遂预测的医学数据建模仍然存在许多挑战。此外，由于医疗政策的变化和诊断代码的更新，记录程序可能会发生变化。

There are several works of predicting suicide risk based on EHRs [75], [76]. Tran et al. [69] proposed an integrated suicide risk prediction framework with a feature extraction scheme, risk classiﬁers, and risk calibration procedure. Explicitly, each patient’s clinical history is represented as a temporal image. Iliou et al. [77] proposed a data preprocessing method to boost machine learning techniques for suicide tendency prediction of patients suffering from mental disorders. Nguyen et al. [78] explored real-world administrative data of mental health patients from the hospital for short- and medium-term suicide risk assessments. By introducing random forests, gradient boosting machines, and DNNs, the authors managed to deal with high dimensionality and redundancy issues of data. Although the previous method gained preliminary success, Iliou et al. [77] and Nguyen et al. [78] have a limitation on the source of data, which focuses on patients with mental disorders in their historical records. Bhat and Goldman-Mellor [79] used an anonymized general EHR data set to relax the restriction on patient’s diagnosis-related history and applied neural networks as a classiﬁcation model to predict suicide attempters.
有几项工作是基于EHRs来预测自杀风险的[75], [76]。Tran等人[69] 提出了一个结合特征提取主题、风险分类和风险校准程序的自杀风险预测框架。明确地说，每个病人的临床病史都被表示为一个时间图像。Iliou等人[77] 提出了一种数据预处理方法，以促进机器学习技术对心理障碍患者自杀倾向的预测。Nguyen等人[78] 对医院的心理健康患者的实际管理数据进行了短期和中期自杀风险评估。通过引入随机森林、梯度提升机和DNNs，作者成功地处理了数据的高维和冗余问题。虽然前面的方法取得了初步成功，但Iliou等人[77]和Nguyen等人[78]在数据来源上存在局限性，这些数据集中在历史记录中有心理障碍的患者。Bhat 和 Goldman-Mellor[79] 使用匿名的一般EHR数据集来放宽对患者诊断相关历史的限制，并应用神经网络作为分类模型来预测自杀未遂者。

C. Suicide Notes
Suicide notes are the written notes left by people before committing suicide. They are usually written on letters and online blogs and recorded in audio or video. Suicide notes provide material for NLP research. Previous approaches have examined suicide notes using content analysis [52], sentiment analysis [44], [80], and emotion detection [51]. Pestian et al. [52] used transcribed suicide notes with two groups of completers and elicitors from people who have a personality disorder or potential morbid thoughts. White and Mazlack [81] analyzed word frequencies in suicide notes using a fuzzy cognitive map to discern causality. Liakata et al. [51] employed machine learning classiﬁers to 600 suicide messages with varied length, different readability quality, and multiclass annotations.
自杀笔记是人们在自杀前留下的书面笔记。它们通常写在信件和在线博客上，并以音频或视频形式记录下来。自杀笔记为NLP研究提供了素材。以前的方法使用内容分析 [52]、情绪分析 [44], [80] 和情绪检测 [51] 来研究自杀笔记。Pestian等人 [52] 人使用了两组完成者和激发者的转录自杀笔记，这些人来自有人格障碍或潜在病态想法的人。White和Mazlack[81]使用模糊认知图分析自杀笔记中的词频，以辨别因果关系。Liakata等人[51]人使用机器学习分类器对600条不同长度、不同可读性质量和多类注释的自杀信息进行分类。

Emotion in text provides sentimental cues of suicidal ideation understanding. Desmet and Hoste [82] conducted a ﬁne-grained emotion detection on suicide notes of 2011 i2b2 task. Wicentowski and Sydes [83] used an ensemble of maximum entropy classiﬁcation. Wang et al. [44] and Kovaˇcevi´ c et al. [84] proposed hybrid machine learning and rule-based method for the i2b2 sentiment classiﬁcation task in suicide notes.
文本中的情感提供了理解自杀意念的情感线索。Desmet和Hoste [82]对2011年i2b2任务的自杀笔记进行了细粒度的情绪检测。Wicentowski和Sydes [83] 使用了最大熵分类的集合。Wang等人[44]和Kovaˇcevi´ c等人[84] 提出了自杀笔记i2b2情感分类任务的混合机器学习和基于规则的方法。

In the age of cyberspace, more suicide notes are now written in the form of web blogs and can be identiﬁed as carrying the potential risk of suicide. Huang et al. [29] monitored online blogs from MySpace.com to identify at-risk bloggers. Schoene and Dethlefs [85] extracted linguistic and sentiment features to identity genuine suicide notes and comparison corpus.
在网络时代，越来越多的自杀笔记是以网络博客的形式写出来的，可以被认定有潜在的自杀风险。Huang等人[29]监控在线博客MySpace.com网站中有风险的博主。Schoene和Dethlefs [85] 提取了语言和情感特征来识别真实的自杀笔记并比较了语料库。

D. Online User Content
The widespread use of mobile Internet and social networking services facilitates people’s expressing their life events and feelings freely. As social websites provide an anonymous space for online discussion, an increasing number of people suffering from mental disorders turn to seek for help. There is a concerning tendency that potential suicide victims post their suicidal thoughts on social websites, such as Facebook, Twitter, Reddit, and MySpace. Social media platforms are becoming a promising tunnel for monitoring suicidal thoughts and preventing suicide attempts [86]. Massive user-generated data provide a good source to study online users’ language patterns. Using data mining techniques on social networks and applying machine learning techniques provide an avenue to understand the intent within online posts, provide early warnings, and even relieve a person’s suicidal intentions.
移动互联网和社交网络服务的广泛应用，使人们能够自由地表达自己的生活事件和情感。由于社交网站提供了一个匿名的在线讨论空间，越来越多的精神障碍患者求助于寻求帮助。有一种令人关注的倾向是，潜在的自杀者会在社交网站上发布他们的自杀想法，比如Facebook、Twitter、Reddit和MySpace。社交媒体平台成为监测自杀想法和防止自杀未遂的一条有希望的途径[86]。大量的用户生成的数据为研究在线用户的语言模式提供了一个很好的来源。在社交网络上使用数据挖掘技术和应用机器学习技术提供了一种途径，可以了解在线帖子中的潜在意图，提供早期预警，甚至可以缓解一个人的自杀意念。

Twitter provides a good source for research on suicidality. O’Dea et al. [12] collected tweets using the public API and developed automatic suicide detection by applying logistic regression and SVM on TF-IDF features. Wang et al. [87] further improved the performance with effective feature engineering. Shepherd et al. [88] conducted psychology-based data analysis for contents that suggests suicidal tendencies in Twitter social networks. The authors used the data from an online conversation called #dearmentalhealthprofessionals.
Twitter为研究自杀行为提供了一个很好的来源。O’Dea等人[12]使用公共API收集推文，并在TF-IDF特征上应用逻辑回归和支持向量机开发了自动自杀检测。Wang等人[87]通过有效的特征工程进一步提高了性能。Shepherd等人[88]对Twitter社交网络中暗示自杀倾向的内容进行了基于心理学的数据分析。作者使用了一个名为dearmentalhealthprofessionals的在线对话中的数据。

Another famous platform Reddit is an online forum with topic-speciﬁc discussions has also attracted much research interest for studying mental health issues [89] and suicidal ideation [37]. A community on Reddit called SuicideWatch is intensively used for studying suicidal intention [17], [90]. De Choudhury et al. [90] applied a statistical methodology to discover the transition from mental health issues to suicidality. Kumar et al. [91] examined the posting activity following the celebrity suicides, studied the effect of celebrity suicides on suicide-related contents, and proposed a method to prevent the high-proﬁle suicides.
另一个著名的平台Reddit是一个具有主题特定讨论的在线论坛，也吸引了许多研究兴心理健康问题 [89] 和自杀意念[37]的学者。Reddit上的一个名为“自杀观察”的社区被集中用于研究自杀意图 [17], [90]。De Choudhury等人[90]应用一种统计方法来发现从心理健康问题到自杀的转变。Kumar等人[91]调查了名人自杀后的发帖活动，研究了名人自杀对自杀相关内容的影响，并提出了防止高调自杀的方法。

Many pieces of research [48], [49] work on detecting suicidal ideation in Chinese microblogs. Guan et al. [92] studied user proﬁle and linguistic features for estimating suicide probability in Chinese microblogs. There also remains some work using other platforms for SID. For example, Cash et al. [93] conducted a study on adolescents’ comments and content analysis on MySpace. Steaming data provides a good source for user pattern analysis. Vioulès et al. [3] conducted user- and post-centric behavior analysis and applied a martingale framework to detect sudden emotional changes in the Twitter data stream for monitoring suicide warning signs. Ren et al. [14] used the blog stream collected from public blog articles written by suicide victims to study the accumulated emotional information.
许多研究[48], [49] 都致力于检测中国微博中的自杀意念。Guan等人[92] 研究了中国微博中用于估计自杀概率的用户特征和语言特征。还有一些工作需要使用其他平台来实现SID。例如，Cash等人 [93]在MySpace上进行了一项青少年评论和内容分析的研究。流式数据为用户模式分析提供了一个很好的来源。Vioulès等人 [3] 进行了用户和后中心行为分析，并应用鞅框架检测Twitter数据流中的突然情绪变化，以监控自杀警告信号。Ren等人[14] 利用从自杀者撰写的公开博客文章中收集的博客流来研究积累的情感信息。

E. Summary
Applications of SID mainly consist of four domains: questionnaires, EHRs, suicide notes, and online user content. Table II gives a summary of categories, data sources, and methods. Among these four main domains, questionnaires and EHRs require self-report measurement or patient–clinician interactions and rely highly on social workers or mental health professions. Suicide notes have a limitation on immediate prevention, as many suicide attempters commit suicide in a short time after they write suicide notes. However, they provide a good source for content analysis and the study of suicide factors. The last online user content domain is one of the most promising ways of early warning and suicide prevention when empowered with machine learning techniques. With the rapid development of digital technology, user-generated content will play a more important role in SID. Other forms of data, such as health data generated by wearable devices, can be very likely to help with suicide risk monitoring in the future.
SID的应用主要包括四个领域：调查问卷、电子健康记录、自杀笔记和在线用户内容。表二总结了分类、数据来源和方法。在这四个主要领域中，问卷调查和电子健康记录需要自我报告测量或患者-临床互动，并且高度依赖社会工作者或心理健康专业人员。自杀笔记对即时预防有局限性，因为许多自杀者在写自杀笔记后很短时间内自杀。然而，它们为内容分析和自杀因素研究提供了一个很好的来源。最后一个用机器学习研究在线用户内容是一个最有希望的早期预警和自杀预防方法。随着数字技术的飞速发展，用户生成内容将在SID中扮演越来越重要的角色。其他形式的数据，例如由可穿戴设备生成的健康数据，很有可能在未来帮助监测自杀风险。

IV. TASKS AND DATA SETS
In this section, we summarize speciﬁc tasks in SID and other suicide-related tasks about mental disorders. Some tasks, such as reasoning suicidal messages, generating a response, and suicide attempters’ detection on a social graph, may lack benchmarks for evaluation. However, they are critical for effective detection. We propose these tasks together with the current research direction and call for contribution to these tasks from the research community. Meanwhile, an elaborate list of data sets for currently available tasks is provided, and some potential data sources are also described to promote the research efforts.
在这一节中，我们将总结SID中的具体任务和其他与自杀有关的心理障碍任务。有些任务，如推理自杀信息、生成响应、在社交图上检测企图自杀者，可能缺乏评估基准。然而，它们对有效检测至关重要。我们提出这些任务和当前的研究方向，并呼吁研究界为这些任务作出贡献。同时，为当前可用任务提供了一份精心制作的数据集列表，并对一些潜在的数据源进行了描述，以促进研究工作的开展。

A. Tasks

Suicide Text Classiﬁcation: The ﬁrst task—suicide text classiﬁcation—can be viewed as a domain-speciﬁc application of general text classiﬁcation, which includes binary and multiclass classiﬁcation. Binary suicidality classiﬁcation simply determines text with suicidal ideation or not, while multiclass suicidality classiﬁcation conducts ﬁne-grained suicide risk assessment. For example, some studies divide suicide risk into four levels: no, low, moderate, and severe. Alternatively, it can also consider four types of class labels according to mental and behavioral procedures, i.e., nonsuicidal, suicidal thoughts/wishes, suicidal intentions, and suicidal act/plan.
1）自杀文本分类：自杀文本分类的第一个任务可以看作是通用文本分类的一个特定领域的应用，包括二元分类和多元分类。二元自杀分类只是简单地确定是否有自杀意念的文本，而多元自杀分类则进行细粒度的自杀风险评估。例如，一些研究将自杀风险分为四个等级：没有、低、中、重度。或者，它也可以根据心理和行为程序考虑四种类型的分类标签，即非自杀性的、自杀性的想法/愿望、自杀意图和自杀行为/计划。

Another subtask is risk assessment by learning from multiaspect suicidal posts. Adopting the deﬁnition of characteristics of suicidal messages, Gilat et al. [94] manually tagged suicidal posts with multiaspect labels, including mental pain, cognitive attribution, and level of suicidal risk. Mental pain includes the loss of control, acute loneliness, emptiness, narcissistic wounds, irreversibility loss of energy, and emotional ﬂooding, scaled into [0, 7]. Cognitive attribution is the frustration of needs associated with interpersonal relationships, or there is no indication of attribution.
另一个子任务是通过学习多方面自杀帖子进行风险评估。采用自杀信息特征的定义，Gilat等人[94]给多方面的自杀帖子人工贴标签，包括精神痛苦、认知归因和自杀风险水平。心理痛苦包括失控、极度孤独、空虚、自恋的创伤、不可逆转的能量损失和情绪激动，按比例分为0至7分。认知归因是指与人际关系相关的需求的挫折感，或没有归因的迹象。

Reasoning Suicidal Messages: Massive data mining and machine learning algorithms have achieved remarkable outcomes by using DNNs. However, simple feature sets and classiﬁcation models are not predictive enough to detect complicated suicidal intentions. Machine learning techniques require reasoning suicidal messages to have a more in-depth insight into suicidal factors and the innermost being from textual posts. This task aims to employ interpretable methods to investigate suicidal factors and incorporate them with commonsense reasoning, which may improve the prediction of suicidal factors. Speciﬁc tasks include automatic summarization of suicide factor, ﬁnding an explanation of suicidal risk in mental pain, and cognitive attribution aspects associated with suicide.
2）推理自杀信息：大量数据挖掘和机器学习算法使用DNNs取得了显著的成果。然而，简单的特征集和分类模型不足以预测复杂的自杀意图。机器学习技术需要推理自杀信息，以便从文本帖子中更深入地洞察自杀因素和内心深处。本研究旨在运用可解释的方法来研究自杀因素，并将其与常识推理相结合，以提高自杀因素的预测能力。具体任务包括自动总结自杀因素，寻找心理疼痛中自杀风险的解释，以及与自杀相关的认知归因方面。
Suicide Attempter Detection: The two tasks mentioned earlier focus on a single text itself. However, the primary purpose of SID is the identify suicide attempters. Thus, it is vital to achieving user-level detection, which consists of two folds, i.e., user-level multiinstance suicidality detection and suicide attempt detection on a graph. The former takes a bag of posts from individuals as input and conducts multiinstance learning over a bag of messages. The later identiﬁes suicide attempters in a speciﬁc social graph built by the interaction between users in social networks. It considers the relationship between social users and can be regarded as a node classiﬁcation problem in a graph.
3）自杀企图者检测：前面提到的两个任务集中在单个文本本身。然而，SID的主要目的是识别自杀企图者。因此，实现用户级检测至关重要，它包括两个方面，图上的用户级多实例自杀行为检测和自杀企图检测。前者以个人的一袋帖子作为输入，通过一袋消息进行多实例学习。后者识别自杀企图者在一个特定的社会关系图中由用户在社交网络中的互动建立。它考虑了社会用户之间的关系，可以看作是一个图中的节点分类问题。
Generating Response: The ultimate goal of SID is intervention and suicide prevention. Many people with suicidal intentions tend to post their suffering at midnight. Another task is generating a thoughtful response for counseling potential suicidal victims to enable immediate social care and relieve their suicidal intention. Gilat et al. [94] introduced eight types of response strategies; they are emotional support, offering group support, empowerment, interpretation, cognitive change inducement, persuasion, advising, and referring. This task requires machine learning techniques, especially sequence-to-sequence learning, to have the ability to adopt effective response strategies to generate better response and eliminate people’s suicidality. When social workers or volunteers go back online, this response generation technique can also generate hints for them to compose a thoughtful response.
4）生成响应：SID的最终目标是干预和自杀预防。许多有自杀意图的人往往会在午夜发帖。另一个任务是劝告潜在的自杀者，并产生一个深思熟虑的回应，使他们能够立即得到社会照顾，并减轻他们的自杀意图。Gilat等人 [94] 提出了八种应对策略：情感支持、提供群体支持、授权、解释、认知变化诱导、说服、建议和咨询。这项任务需要机器学习技术，特别是序列到序列的学习，能够采取有效的反应策略来产生更好的反应，消除人们的自杀行为。当社会工作者或志愿者回到网上时，这种生成响应技术也能产生提示，让他们做出深思熟虑的回应。
Mental Disorders and Self-Harm Risk: Suicidal ideation has a strong relationship with a mental health issue and self-harm risks. Thus, detecting severe mental disorders or self-harm risks is also an important task. Such works include depression detection [95], self-harm detection [96], stressful periods and stressor events detection [97], building knowledge graph for depression [98], and correlation analysis on depression and anxiety [99]. Corresponding subtasks in this ﬁeld are similar to suicide text classiﬁcation in Section IV-A1.
5）精神障碍和自残风险：自杀意念与心理健康问题和自残风险有密切关系。因此，发现严重精神障碍或自残风险也是一项重要任务。这些工作包括抑郁检测[95]、自残检测[96]、应激期和应激事件检测 [97]、构建抑郁知识图[98]、抑郁与焦虑相关分析[99]。该领域的相应子任务类似于第IV-A1节中的自杀文本分类。

B. Data Sets

Text Data:
a) Reddit: Reddit is a registered online community that aggregates social news and online discussions. It consists of many topic categories, and each area of interest within a topic is called a subreddit. A subreddit called “Suicide Watch” (SW) 8 is intensively used for further annotation as positive samples. Posts without suicidal content are sourced from other popular subreddits. Ji et al. [17] released a data set with 3549 posts with suicidal ideation. Shing et al. [13] published their UMD Reddit Suicidality data set with 11129 users and total 1 556 194 posts and sampled 934 users for further annotation. Alada˘ g et al. [100] collected 508 398 posts using Google Cloud BigQuery and manually annotated 785 posts.
a） Reddit:Reddit是一个注册的在线社区，收集社会新闻和在线讨论。由许多主题类别组成，主题中的每个感兴趣的区域都称为subreddit。一个名为自杀观察（SW）的subreddit被密集地用作阳性样本的进一步注释。没有自杀内容的帖子来源于其他热门网站。Ji等人[17]公布了一组有3549个有自杀意念的帖子。Shing等人[13]发布了他们的UMD Reddit自杀率数据集，有11129个用户和总计1556194个帖子，并抽样934个用户进行进一步注释。Alada˘ g等人[100]使用google cloud bigquery收集了508398个帖子，并手动标注了785个帖子。

b) Twitter: Twitter is a popular social networking service, where many users also talk about their suicidal ideation. Twitter is quite different from Reddit in post length, anonymity,and the way communication and interaction. Twitter user data with suicidal ideation and depression are collected by Coppersmith et al. [33]. Ji et al. [17] collected an imbalanced data set of 594 tweets with suicidal ideation out of a total of 10 288 tweets. Vioulès et al. collected 5446 tweets using Twitter streaming API [3], of which 2381 and 3065 tweets are from the distressed users and normal users, respectively. However, most Twitter-based data sets are no longer available as per the policy of Twitter.
b） Twitter: Twitter是一个很受欢迎的社交网络服务，很多用户也会谈论他们的自杀意图。Twitter与Reddit在帖子长度、匿名性、以及沟通和互动的方式有很大不同。Coppersmith等人 [33]收集了Twitter用户自杀和抑郁的数据。Ji等人[17]收集了一组不平衡的数据集，在总共10288条推文中，有自杀念头的推文占594条。Vioulès等人[3]使用TwitterAPI收集了5446条推文，其中2381条和3065条tweet分别来自苦恼用户和正常用户。然而，根据Twitter的政策，大多数基于Twitter的数据集不再可用。

c) ReachOut: ReachOut Forum9 is a peer support platform provided by an Australian mental health care organization. The ReachOut data set [101] was ﬁrst released in the CLPsych17 shared task. Participants were initially given a training data set of 65 756 forum posts, of which 1188 were annotated manually with the expected category, and a test set of 92 207 forum posts, of which 400 were identiﬁed as requiring annotation. The speciﬁc four categories are described as follows.
c） ReachOut: ReachOut论坛是一个由澳大利亚心理卫生保健组织提供的同侪支持平台。ReachOut数据集 [101]首次在CLPsych17共享任务中发布。参与者最初获得的训练数据集包括65756个论坛帖子，其中1188个帖子被手动标注为预期类别，另一个测试集为92207个论坛帖子，其中400个被确定需要注释。具体的四个类别如下。

Crisis: The author or someone else is at risk of harm.
Red: The post should be responded to as soon as possible.
Amber: The post should be responded to at some point if the community does not rally strongly around it.
Green: The post can be safely ignored or left for the community to address.
1）危机：作者或其他人有受到伤害的危险。
2）红色：帖子应该尽快回复。
3）琥珀色：如果社区没有强烈的关注它，这个帖子应该在某个时候得到回应。
4）绿色：这个帖子可以被安全地忽略或者留给社区来解决。
EHR: EHR data contain demographical information, admissions, diagnostic reports, and physician notes. A collection of EHRs is from the California emergency department encounter and hospital admission. It contains 522 056 anonymous EHR records from California-resident adolescents. However, it is not public for access. Bhat and Goldman-Mellor [79] ﬁrst used these records from 2006 to 2009 to predict the suicide attempt in 2010. Haerian et al. [102] selected 280 cases for evaluation from the clinical data warehouse (CDW) and WebCIS database at the NewYork Presbyterian Hospital/Columbia University Medical Center. Tran et al. [69] studied emergency attendances with a least one risk assessment from the Barwon Health data warehouse. The selected data set contains 7746 patients and 17771 assessments.
2） EHR：EHR数据包含人口统计信息、入院、诊断报告和医生注释。一组EHR是从加州急诊科和住院部收集的。它包含522056份来自加利福尼亚州居住青少年的匿名EHR记录。但是，它不是公开的。Bhat和Goldman-Mellor [79] 首次利用2006年至2009年的这些记录预测2010年的自杀企图。Haerian等人[102]从纽约长老会医院/哥伦比亚大学医学中心的临床数据仓库（CDW）和WebCIS数据库中选择280例进行评估。Tran等人 [69] 研究了Barwon健康数据仓库紧急状态下至少一个风险评估。所选数据集包含7746名患者和17771个评估。
Mental Disorders: Mental health issues, such as depression without effective treatment, can turn into suicidal ideation. For the convenience of research on mental disorders, we also list several resources for monitoring mental disorders. The eRisk data set of early detection of signs of depression [103] is released by the ﬁrst task of the 2018 workshop at the Conference and Labs of the Evaluation Forum (CLEF), which focuses on early risk prediction on the Internet. 10 This data set contains sequential text from social media. Another data set is the Reddit Self-reported Depression Diagnosis (RSDD) data set [95], which contains 9000 diagnosed users with depression
and approximately 107000 matched control users.
3）精神障碍：精神健康问题，如抑郁症，如果没有有效的治疗，会演变成自杀意念。为了方便精神障碍研究，我们还列出了几种精神障碍监测来源。早期发现抑郁症征兆的eRisk数据集 [103] 是由评估论坛会议和实验室（CLEF）2018年研讨会的第一项任务发布的，该论坛专注于互联网上的早期风险预测。此数据集包含来自社交媒体的连续文本。另一个数据集是Reddit自报告抑郁症诊断（RSDD）数据集 [95]，其中包含9000名抑郁症诊断用户和大约107000名匹配的对照用户。

V. DISCUSSION AND FUTURE WORK
Many preliminary works have been conducted for SID, especially boosted by manual feature engineering and DNN-based representation learning techniques. However, current research has several limitations, and there are still great challenges for future work.
SID已经做了很多前期工作，特别是人工特征工程和基于DNN的表示学习技术。然而，目前的研究存在一些局限性，对今后的工作仍有很大的挑战。

A. Limitations

Data Deﬁciency: The most critical issue of current research is data deﬁciency. Current methods mainly apply supervised learning techniques that require manual annotation. However, there are not enough annotated data to support further research. For example, labeled data with ﬁne-grained suicide risk only have limited instances, and there are no multiaspect data and data with social relationships.
1）数据不足：当前研究中最关键的问题是数据不足。目前的方法主要采用有监督的学习技术，需要人工标注。然而，没有足够的注释数据来支持进一步的研究。例如，带有细粒度自杀风险的标签数据只有有限的实例，没有多方面的数据和具有社会关系的数据。
Annotation Bias: There is little evidence to conﬁrm the suicide action to obtain ground truth. Thus, current data are obtained by manual labeling with some predeﬁned annotation rules. The crowdsourcing-based annotation may lead to bias of labels. Shing et al. [13] asked experts for labeling but only obtained a limited number of labeled instances. As for the demographical data, the quality of suicide data is concerning, and mortality estimation is general death but not suicide. 11 Some cases are misclassiﬁed as accidents or death of undetermined intent.
2）注释偏差：几乎没有证据证实自杀行为以获得基本事实。因此，当前数据是通过使用一些预定义的注释规则手动标记来获得的。基于注释的众包可能导致标签的偏差。Shing等人 [13] 请求专家贴标签，但得到的标签数量有限。至于人口数据，自杀数据的质量是相关的，死亡率估计是一般死亡，而不是自杀。有些案例被错误地归类为意外事故或不明原因死亡。
Data Imbalance: Posts with suicidal intention account for a tiny proportion of massive social posts. However, most works built data sets in an approximately even manner to collect relatively balanced positive and negative samples rather than treating it as an ill-balanced data distributed.
3）数据不平衡：有自杀意图的帖子在大量社交帖子中所占比例很小。然而，大多数工作是以近似均匀的方式建立数据集，以收集相对平衡的正负样本，而不是将其视为分布不平衡的数据。

) Lack of Intention Understanding: The current statistical learning method failed to have a good understanding of suicidal intention. The psychology behind suicidal attempts is complex. However, mainstream methods focus on selecting features or using complex neural architectures to boost the predictive performance. From the phenomenology of suicidal posts in social content, machine learning methods learned statistical clues. However, they failed to reason over the risk factors by incorporating the psychology of suicide.
4）意图理解不足：目前的统计学习方法未能很好地理解自杀意图。自杀企图背后的心理是复杂的。然而，主流的方法主要集中在选择特征或使用复杂的神经网络结构来提高预测性能。从自杀帖子的现象学社会内容中，机器学习方法学习统计线索。然而，他们没有合并心理学来考虑自杀的危险因素。

B. Future Work

Emerging Learning Techniques: The advances of deep learning techniques have boosted research on SID. More emerging learning techniques, such as attention mechanism and graph neural networks, can be introduced for suicide text representation learning. Other learning paradigms, such as transfer learning, adversarial training, and reinforcement learning, can also be utilized. For example, knowledge of the mental health detection domain can be transferred for SID, and generative adversarial networks can be used to generate adversarial samples for data augmentation.
1）新兴学习技术：深度学习技术的进步推动了对SID的研究。更多新兴的学习技术，如注意机制和图神经网络，可以引入自杀文本表征学习。其他的学习模式，如迁移学习，对抗训练，强化学习，也可以利用。例如，心理健康检测领域的知识可用于SID，生成性对抗网络可用于生成对抗性样本以进行数据扩充。

In social networking services, posts with suicidal ideation are in the long tail of the distribution of different post categories. To achieve effective detection in the ill-balanced distribution of real-world scenarios, few-shot learning can be utilized to train on a few labeled posts with suicidal ideation among the large social corpus.
在社交网络服务中，有自杀意念的帖子在不同帖子类别的分布中处于长尾。为了在真实场景的不平衡分布中实现有效的检测，可以利用少样本学习来训练大量社会群体中带有自杀意念的标签帖子。

Suicidal Intention Understanding and Interpretability: Many factors are correlated with suicide, such as mental health, economic recessions, gun prevalence, daylight patterns, divorce laws, media coverage of suicide, and alcohol use. A better understanding of suicidal intention can provide a guideline for effective detection and intervention. A new research direction is to equip deep learning models with commonsense reasoning, for example, by incorporating external suicide-related knowledge bases.
2）自杀意图的理解和解释：许多因素与自杀有关，如心理健康、经济衰退、枪支泛滥、日光模式、离婚法律、媒体对自杀的报道以及饮酒。**更好地了解自杀意图可以为有效的发现和干预提供指导。**一个新的研究方向是用常识推理来装备深度学习模型，例如通过整合外部自杀相关知识库。

Deep learning techniques can learn an accurate prediction model. However, this would be a black-box model. In order to better understand people’s suicidal intentions and have a reliable prediction, new interpretable models should be developed.
深度学习技术可以学习一个精确的预测模型。然而，这将是一个黑盒模型。为了更好地了解人们的自杀意图并做出可靠的预测，需要开发新的可解释模型。

Temporal Suicidal Ideation Detection: Another direction is to detect suicidal ideation over the data stream and consider the temporal information. There exist several stages of suicide attempts, including stress, depression, suicidal thoughts, and suicidal plan. Modeling people’s posts’ temporal trajectory can effectively monitor the change of mental status and is essential for detecting early signs of suicidal ideation.
3）暂时性自杀意念侦测：另一个方向是透过数据流侦测自杀意念，并考虑时间资讯。自杀企图有几个阶段，包括压力、抑郁、自杀念头和自杀计划。对人的后颞轨迹进行建模，可以有效地监测心理状态的变化，对早期发现自杀意念的迹象至关重要。
Proactive Conversational Intervention: The ultimate aim of SID is intervention and prevention. Very little work is undertaken to enable proactive intervention. Proactive suicide prevention online (PSPO) [105] provides a new perspective with the combination of suicidal identiﬁcation and crisis management. An effective way is through conversations. Automatic response generation becomes a promising technical solution to enable timely intervention for suicidal thoughts. Natural language generation techniques can be utilized to generate counseling responses to comfort people’s depression or suicidal ideation. Reinforcement learning can also be applied for conversational suicide intervention. After suicide attempters post suicide messages (as the initial state), online volunteers and lay individuals will take action to comment on the original posts and persuade attempters to give up their suicidality. The attempter may do nothing, reply to the comments, or get their suicidality relieved. A score will be deﬁned by observing the reaction from a suicide attempter as a reward. The conversational suicide intervention uses a policy gradient for agents to generated responses with maximum rewards to best relieve people’s suicidal thoughts.
4）主动会话干预：SID的最终目的是干预和预防。几乎没有采取任何措施来进行主动干预。主动预防自杀在线（PSPO） [105]结合自杀识别和危机管理提供了一个新的视角。有效的方法是通过交谈。自动反应生成是一种很有前途的技术解决方案，可以及时干预自杀念头。自然语言生成技术可以用来产生心理咨询反应来安慰人们的抑郁或自杀意念。强化学习也可以应用于会话自杀干预。当自杀企图者发布自杀信息（初始状态）后，在线志愿者和非专业人员将采取行动对原始帖子进行评论，并劝说企图者放弃自杀行为。尝试者可能什么也不做，回复评论，或者让他们的自杀行为得到缓解。分数将通过观察自杀企图者的反应作为奖励来确定。对话式自杀干预使用政策梯度，让代理人产生最大回报的反应，以最好地缓解人们的自杀念头。

VI. CONCLUSION
Suicide prevention remains an essential task in our modern society. Early detection of suicidal ideation is an important and effective way to prevent suicide. This survey investigates existing methods for SID from a broad perspective that covers clinical methods, such as patient–clinician interaction and medical signal sensing; textual content analysis, such as lexicon-based ﬁltering and word cloud visualization; feature engineering, including tabular, textual, and affective features; and deep learning-based representation learning, such as CNN-and LSTM-based text encoders. Four main domain-speciﬁc applications on questionnaires, EHRs, suicide notes, and online user content are introduced.
自杀预防仍然是现代社会的一项重要任务。早期发现自杀意念是预防自杀的重要而有效的方法。这项调查从一个广泛的角度研究了SID的现有方法，包括临床方法，如患者临床交互和医学信号感知；文本内容分析，如基于词典的过滤和词云可视化；特征工程，包括表格、文本和情感特征；以及基于深度学习的表示学习，如基于CNN和LSTM的文本编码器。介绍了问卷调查、EHR、自杀笔记和在线用户内容的四个主要领域特定应用。

Psychological experts have conducted most work in this ﬁeld with statistical analysis and computer scientists with feature engineering-based machine learning and deep learning-based representation learning. Based on current research, we summarized existing tasks and further proposed new possible tasks. Last but not least, we discuss some limitations of current research and propose a series of future directions, including utilizing emerging learning techniques, interpretable intention understanding, temporal detection, and proactive conversational intervention.
在这一领域，心理学专家与统计分析和计算机科学家进行了大量的工作，基于特征工程的机器学习和基于深度学习的表征学习。在现有研究的基础上，我们总结了现有的任务，并进一步提出了新的可能任务。最后，我们讨论了当前研究的一些局限性，并提出了一系列未来的研究方向，包括利用新兴的学习技巧、可解释的意图理解、时间检测和主动会话干预。

Online social content is very likely to be the main channel for SID in the future. Therefore, it is essential to develop new methods, which can heal the schism between clinical mental health detection and automatic machine detection, to detect online texts containing suicidal ideation in the hope that suicide can be prevented.
在线社交内容很可能成为SID未来的主渠道。因此，有必要开发新的方法来检测含有自杀意念的在线文本，以期能够预防自杀，从而弥合临床心理健康检测与自动机器检测之间的鸿沟。

你可能感兴趣的:(Day 5. Suicidal Ideation Detection: A Review of Machine Learning Methods and Applications综述)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
30天风格练习-DAY2 黄希夷
Day2（重义）在一个周日/一周的最后一天，我来到位于市中心/市区繁华地带的一家购物中心/商场，中心内人很多/熙熙攘攘。我注意到/看见一个独行/孤身一人的年轻女孩/，留着一头引人注目/长过腰际的头发，上身穿一件暗红色/比正红色更深的衣服/穿在身体上的东西。走下扶梯的时候，她摔倒了/跌向地面，在她正要站起来/让身体离开地面的时候，过长/超过一般人长度的头发被支撑身体/躯干的手掌压/按在下面，她赶紧用
第四天旅游线路预览——从换乘中心到喀纳斯湖陟彼高冈yu 基于Google earth studio 的旅游规划和预览旅游
第四天：从贾登峪到喀纳斯风景区入口，晚上住宿贾登峪；换乘中心有4路车，喀纳斯①号车，去喀纳斯湖，路程时长约5分钟；将上面的的行程安排进行动态展示，具体步骤见”Googleearthstudio进行动态轨迹显示制作过程“、“Googleearthstudio入门教程”和“Googleearthstudio进阶教程“相关内容，得到行程如下所示：Day4-2-480p
基于社交网络算法优化的二维最大熵图像分割智能算法研学社（Jack旭）智能优化算法应用图像分割算法 php 开发语言
智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码文章目录智能优化算法应用：基于社交网络优化的二维最大熵图像阈值分割-附代码1.前言2.二维最大熵阈值分割原理3.基于社交网络优化的多阈值分割4.算法结果：5.参考文献：6.Matlab代码摘要：本文介绍基于最大熵的图像分割，并且应用社交网络算法进行阈值寻优。1.前言阅读此文章前，请阅读《图像分割：直方图区域划分及信息统计介绍》htt
Day17笔记-高阶函数 ~在杰难逃~ Python 笔记 python 开发语言 pycharm 数据分析
高阶函数【重点掌握】函数的本质：函数是一个变量，函数名是一个变量名，一个函数可以作为另一个函数的参数或返回值使用如果A函数作为B函数的参数，B函数调用完成之后，会得到一个结果，则B函数被称为高阶函数常用的高阶函数：map(),reduce(),filter(),sorted()1.map()map(func,iterable)，返回值是一个iterator【容器，迭代器】func:函数iterab
Day1笔记-Python简介&标识符和关键字&输入输出 ~在杰难逃~ Python python 开发语言大数据数据分析数据挖掘
大家好，从今天开始呢，杰哥开展一个新的专栏，当然，数据分析部分也会不定时更新的，这个新的专栏主要是讲解一些Python的基础语法和知识，帮助0基础的小伙伴入门和学习Python，感兴趣的小伙伴可以开始认真学习啦！一、Python简介【了解】1.计算机工作原理编程语言就是用来定义计算机程序的形式语言。我们通过编程语言来编写程序代码，再通过语言处理程序执行向计算机发送指令，让计算机完成对应的工作，编程
自我意识徐立华
----读帕克.帕尔默《教学勇气》（P18----19）5.铸造我们的学科帕克.帕尔默说学科知识对我们的自身认同和外部世界有启发意义。学科会铸造我们。“在我们与学科的命题概念和学科的生活框架相遇之前，自我意识知识处于潜伏状态，通过回想学科是怎样唤醒自我意识，我们就可以找回教学心灵。”《教学勇气》（P18）我们的自我意识像冰山表面下无限延伸的冰层，常常处于潜伏状态。但是在我们对所教授的学科进行深入思
简单了解 JVM 记得开心一点啊 jvm
目录♫什么是JVM♫JVM的运行流程♫JVM运行时数据区♪虚拟机栈♪本地方法栈♪堆♪程序计数器♪方法区/元数据区♫类加载的过程♫双亲委派模型♫垃圾回收机制♫什么是JVMJVM是JavaVirtualMachine的简称，意为Java虚拟机。虚拟机是指通过软件模拟的具有完整硬件功能的、运行在一个完全隔离的环境中的完整计算机系统（如：JVM、VMwave、VirtualBox）。JVM和其他两个虚拟机
JVM、JRE和 JDK：理解Java开发的三大核心组件 Y雨何时停T Java java
Java是一门跨平台的编程语言，它的成功离不开背后强大的运行环境与开发工具的支持。在Java的生态中，JVM（Java虚拟机）、JRE（Java运行时环境）和JDK（Java开发工具包）是三个至关重要的核心组件。本文将探讨JVM、JDK和JRE的区别，帮助你更好地理解Java的运行机制。1.JVM：Java虚拟机（JavaVirtualMachine）什么是JVM？JVM，即Java虚拟机，是Ja
【韩玲】领读小组2月21日打卡文集合 9ce517ee104c
【输出者】健芳【打卡素材】对财富说是Day50【作者】［澳］奥南朵【标题】让努力看得见【字数】7931建立新信念做事情失败的原因都由我们自己无意识的旧有的信念去掌控着。故步自封，没让自己去更新迭代自己的信念。建立新的信念，相信自己的财富会越来越多。2改掉坏习惯以前的懒床、刷手机、煲剧、这些都是封锁自己思想的坏习惯，以为这样就可以让自己过得充实。其实真的不是，而是带给自己一种伤害，阻碍自己努力上进的
【六项精进】20180930 Kinnfoo
一、学习与实践1.付出不亚于任何人的努力2.要谦虚，不要骄傲3.要每天反省4.活着，就要感谢5.积善行，思利他6.不要有感性的烦恼二、今日分享今天是9月的最后一个工作日，每个支行都在拼命地冲刺业绩，刚好今天同桌休假了，我就替他审核客户。一个上午就进件了6个客户，审核通过5个。这5个审核通过的客户里，1个因费率没谈拢而放弃，1个因车上发现GPS而被拒单，最终确认可放款的只有3个客户。感叹支行同事的不
可以赚钱的app，你们都在用哪些？配音新手圈
1.七猫免费小说2.有柿3.番茄小说兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。4.速读免费小说5.得间免费小说6.快手7.快手极速8.抖音火山版（可提0.2，可能我懒赚的慢，但真不推荐）9.拼多多10.淘宝11.点淘12.美
张芝华49天共修 - 草稿李娟AINI
祈禱、靜心、源代碼編程、觀想發願四根支柱，運用靈性能量的助力，讓夢想和渴望在最大向度中輕鬆實現。共修群指定书籍:1.能断金刚麦克格西2.新世界：灵性的觉醒埃克哈特·托尔3.爱是一切的答案芭芭拉迪安吉莉思4.完美的爱,不完美的关系约翰•威尔伍德5.爱的业力法则麦克格西6.漫画《金刚经》蔡志忠7.蔡志忠典藏国学漫画系列(套装共6册)作业:全部在共修群里完成，并请保存好自己的作业。l一周三次共修觉察作业
vue3中el-table中点击图片放大时，被表格覆盖叫我小鹏呀 vue.js javascript 前端
问题：vue3中el-table中点击图片放大时，被表格覆盖。解决方法：el-image添加preview-teleported
Python入门之Lesson2:Python基础语法小熊同学哦 Python入门课程 python 开发语言算法数据结构青少年编程
目录前言一.介绍1.变量和数据类型2.常见运算符3.输入输出4.条件语句5.循环结构二.练习三.总结前言欢迎来到《Python入门》系列博客的第二课。在上一课中，我们了解了Python的安装及运行环境的配置。在这一课中，我们将深入学习Python的基础语法，这是编写Python代码的根基。通过本节内容的学习，你将掌握变量、数据类型、运算符、输入输出、条件语句等Python编程的基础知识。一.介绍1
Day777--【这届年轻人很懒，“但在你面前，我可以多受些麻烦。”】卖小妞的饼干
图片发自App中学时期，会因为朋友一句误会，手写三页信件解释来龙去脉，必须每天一起上学放学吃饭上洗手间，永远腻在一起才能彰显关系密切。而现在，相比每段关系都闹得轰轰烈烈，我们更喜欢顺其自然。懒得耗费大量时间去维护一段关系，懒得在与朋友产生矛盾后去辩解，懒得因为朋友做出背叛友谊的行为而愤怒诘问。不再把目光放在朋友的数量上，更在乎每一个还在身边的人是否饱含真心。我们都在期待着更舒服自然没有杂质的关系，
阅读《别说你懂思维导图》21～23章day27 Ling宝尔
合理期待——思维导图的应用效果很多人问我，思维导图真的有用么？我常常回答，如果你觉得是它“没用”，一定是因为你没“用”，有“用”才“有用”。实际上，学习思维导图和学习木工、驾驶等技能型学习一样，都要经历从了解到应用、从应用到受益的过程。在使用前，我们很多人的思维处于“无意识的低效”状态，经过一段时间的学习，虽然掌握了思维导图的基本使用方法，但可能并没有太好的效果，这个阶段可称为“有意识的低效”状态
放松的一天 4da9b7687fa0
20190325总结起床07:20图片发自App睡觉:23:00天气:晴今日任务清单学习·信息·阅读•水滴阅读Day40Alice’sAdventuresinWonderlandChapter6.2图片发自App•BBC跟读训练营Day24图片发自App图片发自App图片发自App•潘多拉口语训练营Day6Wow.Whatabigboy!•文化知识学习今日无•阅读时间地狱健康·饮食·锻炼•饮食目标
2019-04-10 shuaigefeng
姓名：王林锋企业名称：三亚蔚蓝时代实业有限公司组别：420期努力6组【日精进打卡251天】【知~学习、诵读】《六项精进》2遍，累计256遍《大学》2遍，累计220遍【经典分享】1、想过成功、想过失败、也想过放弃。【行~实践】一、修身：（对自己个人）1.拍打腿部两侧50下，舌顶上颚50下。2.坚持诵读、阅读。3.坚持锻炼、按时睡觉起床。4.控制健康饮食，饭后走动30分钟。5.每天反省自己的思想和行为
100天持续行动—Day01 Richard_DL
今天开始站着学习，发现效率大幅提升。把fast.ai的Lesson1的后半部分和Lesson2看完了。由于Keras版本和视频中的不一致，运行notebook时经常出现莫名其妙的错误，导致自己只动手实践了视频中的一小部分内容。为了赶时间，我打算先把与CNN相关的视频过一遍。然后尽快开始做自己的项目。明天继续加油，争取把Lesson3和Lesson4看完。
感恩日记6.10 舞动精灵Yummy
1.感恩我早起如约而至去参加讲师互助活动2.感恩我在书店也能放开自己做呼吸法，3.感恩所有今天来参加EHE讲师们，谢谢他们的如约而至，谢谢4.妈妈准备丰盛的早餐和晚餐，谢谢，谢谢5.感恩钱宝宝让我可以坐公交车，可以买好吃的，谢谢，谢谢6.感恩男神一直的在和陪伴，谢谢，谢谢7.感恩陈彧老师的大爱接纳，谢谢，谢谢8.感恩林子姐姐总是对我那么的宠爱和关心，谢谢，谢谢9.感恩林子姐姐的看见，谢谢，谢谢10
AI论文题目生成器怎么用？9款论文写作网站简单3步搞定小猪包333 写论文人工智能深度学习计算机视觉
在当今信息爆炸的时代，AI写作工具的出现极大地提高了写作效率和质量。本文将详细介绍9款优秀的论文写作网站，并重点推荐千笔-AIPassPaper。一、千笔-AIPassPaper千笔-AIPassPaper是一款功能强大的AI论文生成器，基于最新的自然语言处理技术，能够一键生成高质量的毕业论文、开题报告等文本内容。它不仅提供智能选题、文献推荐和论文润色等功能，还具有较高的用户评价。其文献综述生成功
天上的每颗星星都是殉难者的生命摘到月亮了吗
1.“小孩怎么一见到哥哥就脸红”——《偷偷藏不住》2.“娶她是我高中就认定的事”──《痛仰》3.“他扛下一切风雨先你一步成长然后来宠你他真的好喜欢你”——《偏偏宠爱》4.天上的每颗星星都是殉难者的生命——耳东兔子5.未经允许擅自特别喜欢你不好意思了——《默读》6.“许星纯你猜我在干嘛我在等风等风热吻你”——《等风热吻你》7.“我爱你如鲸向海似鸟投林不可避免退无可退”——陆霆骁《许你万丈光芒好》8.
骗子的基本套路山水本原
骗子的基本特点：1.善于观察，通过各种途径迅速了解到你的爱好、渴望和欲望。2.根据你的爱好，渴望或欲望，给你展示得到满足的一条途径。3.走上这条途径，你必须付出代价。4.付出代价后，你不会获得渴望、爱好或欲望的满足。5.骗子的出发点从来就不是如何满足你的爱好，渴望或欲望，而是为了满足他自己的。5是骗子的本质，4是你得到的结果，3是骗子的目标，2是骗子的方法，1是骗子的基本功。上述特点，大家结合具体
代码随想录Day 41|动态规划之买卖股票问题，leetcode题目121. 买卖股票的最佳时机、122. 买卖股票的最佳时机Ⅱ、123. 买卖股票的最佳时机Ⅲ LluckyYH 动态规划 leetcode 算法数据结构
提示：DDU，供自己复习使用。欢迎大家前来讨论~文章目录买卖股票的最佳时机相关题目题目一：121.买卖股票的最佳时机解题思路：题目二：122.买卖股票的最佳时机II解题思路：题目三：123.买卖股票的最佳时机III解题思路总结买卖股票的最佳时机相关题目题目一：121.买卖股票的最佳时机[[121.买卖股票的最佳时机](https://leetcode.cn/problems/combination
Day_11 ROC_bird.. 算法
面试题16.15.珠玑妙算-力扣（LeetCode）/***Note:Thereturnedarraymustbemalloced,assumecallercallsfree().*///下标和对应位置的值都一样，answer[0]+1,对应位置的值猜对了，但是下标不对，answer[1]+1int*masterMind(char*solution,char*guess,int*returnSiz
Android实现监听事件的方法 Amy木婉清
1.通过内部类实现2.通过匿名内部类实现3.通过事件源所在类实现4.通过外部类实现5.布局文件中onclick属性(针对点击事件)1.通过内部类实现代码:privateButtonmBtnEvent;//oncreate中mBtnEvent.setOnClickListener(newOnClick());//内部类实现监听classOnClickimplementsView.OnClickLis
润忻21天跨年魔力打卡D12早.正月初九《感恩日记》尧安妈咪呀
1.感恩阳光透过窗帘照进我家。2.感恩暖气让我们家如此温暖。3.感恩水让我们可以洗漱。4.感恩老公对我的包容。5.感恩及时进入工作状态。6.感恩孩子们一如既往的可爱。7.感恩孩子们对我的爱。8.感恩一切顺利。8.感恩工作高效。9.感恩钱宝宝。10.感恩一切的发生。
2021-11-26 雅雅_201d
感恩活着的美好幸福喜悦。谢谢谢谢谢谢感恩打卡261.感恩亲爱的自己每天坚定的动态静心，让我充满力量充满奇迹。谢谢谢谢谢谢2.感恩遇见的每一个在走心的人儿，高频的能量在深深的吸引着我引领着我。谢谢谢谢谢谢3.感恩范先生有着良好的生命状态，和高能量的智慧传递，让我愉悦安心。谢谢谢谢谢谢4.感恩宝贝女儿，积极正向充满力量，每一声回应都让我更加的安心更加的有力量。谢谢谢谢谢谢5.感谢亲爱的爸爸妈妈非常喜悦
感恩日记Day 236 E姐小酒窝
图片发自App1.感谢16愿意听我"唠叨"，人与人的信任和托付在此刻弥足珍贵珍贵；2.感谢到家就能吃上美味的中餐，辛苦妞爸；3.感谢妞中午愿意听我叼叼旅行中的事儿；4.感谢星巴克就在家附近，让我一杯回魂；5.感谢美妞总结我和爸爸优点并说两者揉和就很棒了。6.感谢看到妞第一天数学成绩后淡定的自己；将责任归回妞自己并总结行动。7.感谢林姐姐信任，又定变啦减脂套餐。8.感谢梅姐知道我旅行回来后约我吃饭；
Spring中@Value注解，需要注意的地方无量 spring bean @Value xml
Spring 3以后,支持@Value注解的方式获取properties文件中的配置值，简化了读取配置文件的复杂操作 1、在applicationContext.xml文件(或引用文件中)中配置properties文件 <bean id="appProperty" class="org.springframework.beans.fac
mongoDB 分片开窍的石头 mongodb
mongoDB的分片。要mongos查询数据时候先查询configsvr看数据在那台shard上，configsvr上边放的是metar信息，指的是那条数据在那个片上。由此可以看出mongo在做分片的时候咱们至少要有一个configsvr,和两个以上的shard（片）信息。第一步启动两台以上的mongo服务 &nb
OVER(PARTITION BY)函数用法 0624chenhong oracle
这篇写得很好，引自 http://www.cnblogs.com/lanzi/archive/2010/10/26/1861338.html OVER(PARTITION BY)函数用法 2010年10月26日 OVER(PARTITION BY)函数介绍开窗函数 &nb
Android开发中，ADB server didn't ACK 解决方法一炮送你回车库 Android开发
首先通知：凡是安装360、豌豆荚、腾讯管家的全部卸载，然后再尝试。一直没搞明白这个问题咋出现的，但今天看到一个方法，搞定了！原来是豌豆荚占用了 5037 端口导致。参见原文章：一个豌豆荚引发的血案——关于ADB server didn't ACK的问题简单来讲，首先将Windows任务进程中的豌豆荚干掉，如果还是不行，再继续按下列步骤排查。 &nb
canvas中的像素绘制问题换个号韩国红果果 JavaScript canvas
pixl的绘制，1.如果绘制点正处于相邻像素交叉线，绘制x像素的线宽，则从交叉线分别向前向后绘制x/2个像素，如果x/2是整数，则刚好填满x个像素，如果是小数，则先把整数格填满，再去绘制剩下的小数部分，绘制时，是将小数部分的颜色用来除以一个像素的宽度，颜色会变淡。所以要用整数坐标来画的话（即绘制点正处于相邻像素交叉线时），线宽必须是2的整数倍。否则会出现不饱满的像素。 2.如果绘制点为一个像素的
编码乱码问题灵静志远 java jvm jsp 编码
1、JVM中单个字符占用的字节长度跟编码方式有关，而默认编码方式又跟平台是一一对应的或说平台决定了默认字符编码方式；2、对于单个字符：ISO-8859-1单字节编码，GBK双字节编码，UTF-8三字节编码；因此中文平台(中文平台默认字符集编码GBK)下一个中文字符占2个字节，而英文平台(英文平台默认字符集编码Cp1252(类似于ISO-8859-1))。 3、getBytes()、getByte
java 求几个月后的日期 darkranger calendar getinstance
Date plandate = planDate.toDate(); SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd"); Calendar cal = Calendar.getInstance(); cal.setTime(plandate); // 取得三个月后时间 cal.add(Calendar.M
数据库设计的三大范式（通俗易懂） aijuans 数据库复习
关系数据库中的关系必须满足一定的要求。满足不同程度要求的为不同范式。数据库的设计范式是数据库设计所需要满足的规范。只有理解数据库的设计范式，才能设计出高效率、优雅的数据库，否则可能会设计出错误的数据库. 目前，主要有六种范式：第一范式、第二范式、第三范式、BC范式、第四范式和第五范式。满足最低要求的叫第一范式，简称1NF。在第一范式基础上进一步满足一些要求的为第二范式，简称2NF。其余依此类推。
想学工作流怎么入手 atongyeye jbpm
工作流在工作中变得越来越重要，很多朋友想学工作流却不知如何入手。很多朋友习惯性的这看一点，那了解一点，既不系统，也容易半途而废。好比学武功，最好的办法是有一本武功秘籍。研究明白，则犹如打通任督二脉。系统学习工作流，很重要的一本书《JBPM工作流开发指南》。本人苦苦学习两个月，基本上可以解决大部分流程问题。整理一下学习思路，有兴趣的朋友可以参考下。 1 首先要
Context和SQLiteOpenHelper创建数据库百合不是茶 android Context创建数据库
一直以为安卓数据库的创建就是使用SQLiteOpenHelper创建,但是最近在android的一本书上看到了Context也可以创建数据库,下面我们一起分析这两种方式创建数据库的方式和区别,重点在SQLiteOpenHelper 一:SQLiteOpenHelper创建数据库: 1,SQLi
浅谈group by和distinct bijian1013 oracle 数据库 group by distinct
group by和distinct只了去重意义一样，但是group by应用范围更广泛些，如分组汇总或者从聚合函数里筛选数据等。譬如：统计每id数并且只显示数大于3 select id ,count(id) from ta
vi opertion 征客丶 mac opration vi
进入 command mode （命令行模式）按 esc 键再按 shift + 冒号注：以下命令中带 $ 【在命令行模式下进行】，不带 $ 【在非命令行模式下进行】一、文件操作 1.1、强制退出不保存 $ q! 1.2、保存 $ w 1.3、保存并退出 $ wq 1.4、刷新或重新加载已打开的文件 $ e 二、光标移动 2.1、跳到指定行数字
【Spark十四】深入Spark RDD第三部分RDD基本API bit1129 spark
对于K/V类型的RDD,如下操作是什么含义？ val rdd = sc.parallelize(List(("A",3),("C",6),("A",1),("B",5)) rdd.reduceByKey(_+_).collect reduceByKey在这里的操作，是把
java类加载机制 BlueSkator java 虚拟机
java类加载机制 1.java类加载器的树状结构引导类加载器 ^ | 扩展类加载器 ^ | 系统类加载器 java使用代理模式来完成类加载，java的类加载器也有类似于继承的关系，引导类是最顶层的加载器，它是所有类的根加载器，它负责加载java核心库。当一个类加载器接到装载类到虚拟机的请求时，通常会代理给父类加载器，若已经是根加载器了，就自己完成加载。虚拟机区分一个Cla
动态添加文本框 BreakingBad 文本框
<script> var num=1; function AddInput() { var str=""; str+="<input
读《研磨设计模式》-代码笔记-单例模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ public class Singleton { } /* * 懒汉模式。注意，getInstance如果在多线程环境中调用，需要加上synchronized，否则存在线程不安全问题 */ class LazySingleton
iOS应用打包发布常见问题 chenhbc ios iOS发布 iOS上传 iOS打包
这个月公司安排我一个人做iOS客户端开发，由于急着用，我先发布一个版本，由于第一次发布iOS应用，期间出了不少问题，记录于此。 1、使用Application Loader 发布时报错：Communication error.please use diagnostic mode to check connectivity.you need to have outbound acc
工作流复杂拓扑结构处理新思路 comsci 设计模式工作算法企业应用 OO
我们走的设计路线和国外的产品不太一样，不一样在哪里呢？国外的流程的设计思路是通过事先定义一整套规则(类似XPDL)来约束和控制流程图的复杂度(我对国外的产品了解不够多，仅仅是在有限的了解程度上面提出这样的看法)，从而避免在流程引擎中处理这些复杂的图的问题，而我们却没有通过事先定义这样的复杂的规则来约束和降低用户自定义流程图的灵活性，这样一来，在引擎和流程流转控制这一个层面就会遇到很
oracle 11g新特性Flashback data archive daizj oracle
1. 什么是flashback data archive Flashback data archive是oracle 11g中引入的一个新特性。Flashback archive是一个新的数据库对象，用于存储一个或多表的历史数据。Flashback archive是一个逻辑对象，概念上类似于表空间。实际上flashback archive可以看作是存储一个或多个表的所有事务变化的逻辑空间。
多叉树:2-3-4树 dieslrae 树
平衡树多叉树,每个节点最多有4个子节点和3个数据项,2,3,4的含义是指一个节点可能含有的子节点的个数,效率比红黑树稍差.一般不允许出现重复关键字值.2-3-4树有以下特征: 1、有一个数据项的节点总是有2个子节点(称为2-节点) 2、有两个数据项的节点总是有3个子节点(称为3-节
C语言学习七动态分配 malloc的使用 dcj3sjt126com c language malloc
/* 2013年3月15日15:16:24 malloc 就memory(内存) allocate(分配)的缩写本程序没有实际含义，只是理解使用 */ # include <stdio.h> # include <malloc.h> int main(void) { int i = 5; //分配了4个字节静态分配 int * p
Objective-C编码规范[译] dcj3sjt126com 代码规范
原文链接 : The official raywenderlich.com Objective-C style guide 原文作者 : raywenderlich.com Team 译文出自 : raywenderlich.com Objective-C编码规范译者 : Sam Lau
0.性能优化-目录 frank1234 性能优化
从今天开始笔者陆续发表一些性能测试相关的文章，主要是对自己前段时间学习的总结，由于水平有限，性能测试领域很深，本人理解的也比较浅，欢迎各位大咖批评指正。主要内容包括：一、性能测试指标吞吐量、TPS、响应时间、负载、可扩展性、PV、思考时间 http://frank1234.iteye.com/blog/2180305 二、性能测试策略生产环境相同基准测试预热等 htt
Java父类取得子类传递的泛型参数Class类型 happyqing java 泛型父类子类 Class
import java.lang.reflect.ParameterizedType; import java.lang.reflect.Type; import org.junit.Test; abstract class BaseDao<T> { public void getType() { //Class<E> clazz =
跟我学SpringMVC目录汇总贴、PDF下载、源码下载 jinnianshilongnian springMVC
----广告-------------------------------------------------------------- 网站核心商详页开发掌握Java技术，掌握并发/异步工具使用，熟悉spring、ibatis框架；掌握数据库技术，表设计和索引优化，分库分表/读写分离；了解缓存技术，熟练使用如Redis/Memcached等主流技术；了解Ngin
the HTTP rewrite module requires the PCRE library 流浪鱼 rewrite
./configure: error: the HTTP rewrite module requires the PCRE library. 模块依赖性Nginx需要依赖下面3个包 1. gzip 模块需要 zlib 库 ( 下载: http://www.zlib.net/ ) 2. rewrite 模块需要 pcre 库 ( 下载: http://www.pcre.org/ ) 3. s
第12章 Ajax（中） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Optimize query with Query Stripping in Web Intelligence blueoxygen BO
http://wiki.sdn.sap.com/wiki/display/BOBJ/Optimize+query+with+Query+Stripping+in+Web+Intelligence and a very straightfoward video http://www.sdn.sap.com/irj/scn/events?rid=/library/uuid/40ec3a0c-936
Java开发者写SQL时常犯的10个错误 tomcat_oracle java sql
1、不用PreparedStatements 　　有意思的是，在JDBC出现了许多年后的今天，这个错误依然出现在博客、论坛和邮件列表中，即便要记住和理解它是一件很简单的事。开发者不使用PreparedStatements的原因可能有如下几个：　　他们对PreparedStatements不了解　　他们认为使用PreparedStatements太慢了　　他们认为写Prepar
世纪互联与结盟有感阿尔萨斯
10月10日，世纪互联与（Foxcon）签约成立合资公司，有感。全球电子制造业巨头（全球500强企业）与世纪互联共同看好IDC、云计算等业务在中国的增长空间，双方迅速果断出手，在资本层面上达成合作，此举体现了全球电子制造业巨头对世纪互联IDC业务的欣赏与信任，另一方面反映出世纪互联目前良好的运营状况与广阔的发展前景。众所周知，精于电子产品制造（世界第一），对于世纪互联而言，能够与结盟