wendyponcho

DS Wannabe之5-AM Project: DS 30day int prep day12

Q1. Where is the confusion matrix used? Which module would you use to show it?

混淆矩阵

混淆矩阵常用于评估分类模型的性能，特别是在二分类或多分类问题中。它展示了实际类别与模型预测类别之间的关系。在Python中，可以使用sklearn.metrics模块中的confusion_matrix函数来展示混淆矩阵。

Creating a confusion matrix for a testing dataset in SciPy

import pandas as pd
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import confusion_matrix
from sklearn.model_selection import train_test_split

# Load the data
df = pd.read_csv('https://bit.ly/3cManTi', delimiter=",")

# Extract input variables (all rows, all columns but last column)
X = df.values[:, :-1]

# Extract output column (all rows, last column)\
Y = df.values[:, -1]

model = LogisticRegression(solver='liblinear')

X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=.33,
    random_state=10)
model.fit(X_train, Y_train)
prediction = model.predict(X_test)

"""
The confusion matrix evaluates accuracy within each category.
[[truepositives falsenegatives]
 [falsepositives truenegatives]]

The diagonal represents correct predictions,
so we want those to be higher
"""
matrix = confusion_matrix(y_true=Y_test, y_pred=prediction)
print(matrix)

Q2: What is Accuracy?

准确率？

准确率是衡量分类模型性能的指标，表示模型正确预测的样本占总样本的比例。计算公式为：(正确预测的正样本数 + 正确预测的负样本数) / 总样本数。

It is the most intuitive performance measure and it simply a ratio of correctly predicted to the total observations. We can say as, if we have high accuracy, then our model is best. Yes, we could say that accuracy is a great measure but only when you have symmetric datasets where false positives and false negatives are almost same.

Accuracy = True Positive + True Negative / (True Positive +False Positive + False Negative + True Negative)

Q3: What is Precision?

精确率？

精确率是衡量模型预测为正类别准确性的指标，

计算公式为：正确预测的正样本数 / (正确预测的正样本数 + 错误预测为正样本的负样本数)。

It is also called as the positive predictive value. Number of correct positives in your model that predicts compared to the total number of positives it predicts.

Precision = True Positives / (True Positives + False Positives) Precision = True Positives / Total predicted positive

It is the number of positive elements predicted properly divided by the total number of positive elements predicted.

We can say Precision is a measure of exactness, quality, or accuracy. High precision Means that more or all of the positive results you predicted are correct.

Q4: What is Recall?

Recall we can also called as sensitivity or true positive rate.
It is several positives that our model predicts compared to the actual number of positives in our data.

Recall = True Positives / (True Positives + False Positives)
Recall = True Positives / Total Actual Positive

Recall is a measure of completeness. High recall which means that our model classified most or all of the possible positive elements as positive.

Q5: What is F1 Score?

F1分数？

F1分数是精确率和召回率的调和平均值，用于衡量模型的准确性和召回能力的平衡性，特别适用于类别不平衡的情况。

计算公式为：2 * (精确率 * 召回率) / (精确率 + 召回率)。

We use Precision and recall together because they complement each other in how they describe the effectiveness of a model. The F1 score that combines these two as the weighted harmonic mean of precision and recall.

Q6: What is Bias and Variance trade-off?

偏差和方差的权衡？

偏差是模型的预测值与真实值之间的差异，方差是模型对训练集的小变动敏感程度。偏差和方差的权衡是机器学习中一个核心概念，指的是减少偏差可能导致方差增加，反之亦然。目标是找到两者之间的最佳平衡点，以提高模型的泛化能力。

Bias

Bias means it’s how far are the predict values from the actual values. If the average predicted values are far off from the actual values, then we called as this one have high bias.

When our model has a high bias, then it means that our model is too simple and does not capture the complexity of data, thus underfitting the data.

Variance

It occurs when our model performs good on the trained dataset but does not do well on a dataset that it is not trained on, like a test dataset or validation dataset. It tells us that actual value is how much scattered from the predicted value.

Because of High variance it cause overfitting that implies that the algorithm models random noise present in the training data.

When model have high variance, then model becomes very flexible and tune itself to the data points of the training set.

这张图展示了模型复杂度与预测误差之间的关系，并解释了偏差-方差权衡（Bias-Variance Tradeoff）的概念。

红色曲线：代表在测试样本上的预测误差。
青色曲线：代表在训练样本上的预测误差。

当模型复杂度较低时（图左侧），训练误差和测试误差都比较高。这是由于模型过于简单，无法捕捉数据的复杂性，从而导致高偏差。这种情况通常被称为欠拟合（Underfitting）。

随着模型复杂度的增加（向右移动），训练误差逐渐降低，因为模型能够更好地拟合训练数据。然而，测试误差开始下降后会在某一点开始再次上升，这是因为模型变得过于复杂，以至于开始学习到训练数据中的噪声，而不仅仅是潜在的模式。这导致了高方差。这种情况通常被称为过拟合（Overfitting）。

在图的中间部分，训练误差和测试误差之间的差距最小，这通常是模型复杂度的“最佳点”，即模型既不过于简单也不过于复杂，能够很好地泛化到未见数据。

高偏差低方差（图左上角）：模型过于简化，无法捕捉数据的真实结构，但对不同的训练数据集表现出较为一致的行为。
低偏差高方差（图右上角）：模型复杂度高，对训练数据的小变动非常敏感，导致在不同的训练数据集上表现出很大的差异。

整体来说，这个图传达了在模型选择时需要平衡模型的复杂度，以便在偏差和方差之间找到最佳平衡点，从而最小化总体的预测误差

Q7. What is data wrangling? Mention three points to consider in the process.

Data wrangling is a process by which we convert and map data. This changes data from its raw form to a format that is a lot more valuable.

Data wrangling is the first step for machine learning and deep learning. The end goal is to provide data that is actionable and to provide it as fast as possible.

There are three major things to focus on while talking about data wrangling –

1. Acquiring data

The first and probably the most important step in data science is the acquiring, sorting and cleaning of data. This is an extremely tedious process and requires the most amount of time.

One needs to:

Check if the data is valid and up-to-date.
Check if the data acquired is relevant for the problem at hand.

Sources for data collection Data is publicly available on various websites like kaggle.com, data.gov ,World Bank, Five Thirty Eight Datasets, AWS Datasets, Google Datasets.

2. Data cleaning

Data cleaning is an essential component of data wrangling and requires a lot of patience. To make the job easier it is first essential to format the data make the data readable for humans at first.

The essentials involved are:

Format the data to make it more readable
Find outliers (data points that do not match the rest of the dataset) in data
Find missing values and remove them from the data set (without this, any model being

trained becomes incomplete and useless)

3. Data Computation

At times, your machine not have enough resources to run your algorithm e.g. you might not have a GPU. In these cases, you can use publicly available APIs to run your algorithm. These are standard end points found on the web which allow you to use computing power over the web and process data without having to rely on your own system. An example would be the Google Colab Platform.

Q8. Why is normalization required before applying any machine learning model? What module can you use to perform normalization?

Normalization is a process that is required when an algorithm uses something like distance measures. Examples would be clustering data, finding cosine similarities, creating recommender systems.

Normalization is not always required and is done to prevent variables that are on higher scale from

affecting outcomes that are on lower levels. For example, consider a dataset of employees’ income. This data won’t be on the same scale if you try to cluster it. Hence, we would have to normalize the data to prevent incorrect clustering.

A key point to note is that normalization does not distort the differences in the range of values.

A problem we might face if we don’t normalize data is that gradients would take a very long time to descend and reach the global maxima/ minima.

For numerical data, normalization is generally done between the range of 0 to 1.

The general formula is:
Xnew = (x-xmin)/(xmax-xmin)

应用任何机器学习模型之前需要进行归一化？使用哪个模块进行归一化？

归一化是将不同量级的数据转换到同一尺度的过程，这对于大多数机器学习模型非常重要，因为它们对输入特征的量级非常敏感。归一化可以提高模型的收敛速度和性能。在Python中，可以使用sklearn.preprocessing模块中的MinMaxScaler或StandardScaler等函数进行归一化。

Q9. What is the difference between feature selection and feature extraction?

特征选择与特征提取有什么区别？

特征选择是从原始特征中选择一部分重要特征，以减少特征的数量，而不改变原始特征的含义。特征提取是将原始数据转换或压缩成新的特征集（可能减少了特征的维度），这些新的特征是原始特征的变换或组合，可能会改变原始特征的含义。

Feature selection and feature extraction are two major ways of fixing the curse of dimensionality

1. Feature selection:

Feature selection is used to filter a subset of input variables on which the attention should focus. Every other variable is ignored. This is something which we, as humans, tend to do subconsciously.

Many domains have tens of thousands of variables out of which most are irrelevant and redundant.

Feature selection limits the training data and reduces the amount of computational resources used. It can significantly improve a learning algorithms performance.

In summary, we can say that the goal of feature selection is to find out an optimal feature subset. This might not be entirely accurate, however, methods of understanding the importance of features also exist. Some modules in python such as Xgboost help achieve the same.

2. Feature extraction

Feature extraction involves transformation of features so that we can extract features to improve the process of feature selection. For example, in an unsupervised learning problem, the extraction of bigrams from a text, or the extraction of contours from an image are examples of feature extraction.

The general workflow involves applying feature extraction on given data to extract features and then apply feature selection with respect to the target variable to select a subset of data. In effect, this helps improve the accuracy of a model.

Q10. Why is polarity and subjectivity an issue?

Polarity and subjectivity are terms which are generally used in sentiment analysis.

Polarity is the variation of emotions in a sentence. Since sentiment analysis is widely dependent on emotions and their intensity, polarity turns out to be an extremely important factor.

In most cases, opinions and sentiment analysis are evaluations. They fall under the categories of emotional and rational evaluations.

Rational evaluations, as the name suggests, are based on facts and rationality while emotional evaluations are based on non-tangible responses, which are not always easy to detect.

Subjectivity in sentiment analysis, is a matter of personal feelings and beliefs which may or may not be based on any fact. When there is a lot of subjectivity in a text, it must be explained and analysed in context. On the contrary, if there was a lot of polarity in the text, it could be expressed as a positive, negative or neutral emotion.

Q11. When would you use ARIMA?

ARIMA is a widely used statistical method which stands for Auto Regressive Integrated Moving Average. It is generally used for analyzing time series data and time series forecasting. Let’s take a quick look at the terms involved. Auto Regression is a model that uses the relationship between the observation and some numbers of lagging observations.

ARIMA（自回归积分滑动平均模型）是一种广泛使用的统计方法，常用于分析时间序列数据和进行时间序列预测。下面是涉及的一些术语的快速概述：

自回归（AR）：这是一种模型，它使用当前观测值与其自身在之前若干时间点的观测值之间的关系。简言之，它表明了当前值如何被其历史值所影响。
差分（I）：这是使时间序列数据稳定的过程，通常通过计算连续观测值之间的差异来完成。这有助于去除数据中的趋势和季节性成分，使其更适合进行模型拟合。
滑动平均（MA）：这部分模型使用观测值的滑动平均误差来预测未来值。它反映了当前值与历史预测误差之间的关系。

你可能会在以下情况下使用ARIMA模型：

时间序列预测：当你有一个时间序列数据集，并且你想预测未来的数据点时，ARIMA是一个强有力的工具。这在经济学、股市预测、气象学和任何需要预测未来趋势的领域中都非常有用。
数据分析：ARIMA可以帮助你理解数据中的某些模式，例如季节性变化、趋势或周期性波动。
去趋势和去季节性处理：通过ARIMA模型的差分部分，你可以将数据转换成一个更稳定的形式，这对于去除趋势和季节性成分很有帮助。
长期预测：尽管ARIMA模型对于短期预测更加有效，但在某些情况下，它也可以用于长期预测，尤其是当数据表现出明显的稳定模式时。

选择使用ARIMA模型的关键是确保你的时间序列数据是稳定的，或者你可以通过差分等方法使其稳定。这种模型对于非季节性、趋势性的时间序列数据特别有效。

Integrated means use of differences in raw observations which help make the time series stationary.

Moving Averages is a model that uses the relationship and dependency between the observation and residual error from the models being applied to the lagging observations.

Note that each of these components are used as parameters. After the construction of the model, a linear regression model is constructed.

Data is prepared by:

Finding out the differences
Removing trends and structures that will negatively affect the model
Finally, making the model stationary.

Exercise Solutions - 一些往期学习抽查QA

How would you define machine learning?
Machine Learning is about building systems that can learn from data. Learning means getting better at some task, given some performance measure.
Can you name four types of applications where it shines?
Machine Learning is great for complex problems for which we have no algorithmic solution, to replace long lists of hand-tuned rules, to build systems that adapt to fluctuating environments, and finally to help humans learn (e.g., data mining).
What is a labeled training set?
A labeled training set is a training set that contains the desired solution (a.k.a. a label) for each instance.
What are the two most common supervised tasks?
The two most common supervised tasks are regression and classification.
Can you name four common unsupervised tasks?
Common unsupervised tasks include clustering, visualization, dimensionality reduction, and association rule learning.
What type of algorithm would you use to allow a robot to walk in various unknown terrains?
Reinforcement Learning is likely to perform best if we want a robot to learn to walk in various unknown terrains.
What type of algorithm would you use to segment your customers into multiple groups?
If you don't know how to define the groups, then you can use a clustering algorithm (unsupervised learning) to segment your customers into clusters of similar customers.
Would you frame the problem of spam detection as a supervised learning problem or an unsupervised learning problem?
Spam detection is a typical supervised learning problem: the algorithm is fed many emails along with their labels (spam or not spam).
What is an online learning system?
An online learning system can learn incrementally, as opposed to a batch learning system.
What is out-of-core learning?
Out-of-core algorithms can handle vast quantities of data that cannot fit in a computer's main memory.
What type of algorithm relies on a similarity measure to make predictions?
An instance-based learning system learns the training data by heart; then, when given a new instance, it uses a similarity measure to find the most similar learned instances and uses them to make predictions.
What is the difference between a model parameter and a model hyperparameter?
A model parameter determines what the model will predict given a new instance, while a hyperparameter is a parameter of the learning algorithm itself, not of the model.
What do model-based algorithms search for? What is the most common strategy they use to succeed? How do they make predictions?
Model-based learning algorithms search for an optimal value for the model parameters such that the model will generalize well to new instances. They usually minimize a cost function and make predictions by feeding new instance's features into the model's prediction function.
Can you name four of the main challenges in machine learning?
Some of the main challenges are the lack of data, poor data quality, nonrepresentative data, and excessively complex models that overfit the data.
If your model performs great on the training data but generalizes poorly to new instances, what is happening? Can you name three possible solutions?
The model is likely overfitting the training data. Possible solutions include getting more data, simplifying the model, or reducing the noise in the training data.

预售中期 96cd2785c65e
7月27日星期六上午学习了刘润五分钟之“统合综效”——1.尊重差异性激发多样性-不要追求投影，也就是观点、价值观的一致。相反，你应该尊重观点的差异，感激团队的多样性。2.第三，共享目标，创造性合作！从合作，到创造性合作的秘诀是：找到共享的目标。明日计划：上午：9:30开会学习中午：休息维护7-14至7-17的客户（询问）下午：要资源晚上：维护7-18至7-21的客户（分享体式）
院级医疗AI管理流程—基于数据共享、算法开发与工具链治理的系统化框架 Allen_Lyb 医疗高效编程研发人工智能算法时序数据库经验分享健康医疗
医疗AI：从“单打独斗”到“协同共进”在科技飞速发展的今天，医疗人工智能（AI）正以前所未有的速度改变着传统医疗模式。从最初在影像诊断、临床决策支持、药物发现等单一领域的“单点突破”，医疗AI如今已迈向“系统级协同”的新阶段。曾经，医疗AI的应用多集中在某一特定环节，比如利用深度学习算法分析医学影像，辅助医生进行疾病诊断。这种单点突破式的应用虽然在一定程度上提高了医疗效率，但随着医疗行业对AI技术
初识Linux--常用命令
为什么学习命令Linux刚面世是并没有图形界面，所有操作都靠命令完成，如磁盘操作、文件存取、目录操作、进程管理、文件权限等工作中，大量的服务器维护工作都是在远端通过SSH客户端来完成的，并未使用图形界面，所有的维护工作都需要通过命令来完成。Linux用户和用户组管理Linux用户分为以下几种root用户：也称超级用户，UID为0，权限最高。系统用户：也称虚拟用户、伪用户、假用户，是系统自身拥有的用
2019-11-11晨间日记麦新
今天是什么日子起床：6:00就寝：23:00天气：晴朗心情：平静纪念日：节日快乐叫我起床的不是闹钟是梦想年度目标及关键点：国考考研本月重要成果：学习今日三只青蛙/番茄钟开营分班处置一天成功日志-记录三五件有收获的事务开营分班处置一天财务检视-12邮费人际的投入链接新朋友开卷有益-学习/读书/听书《被忽视的孩子》健康与饮食今日步数：10000+好习惯打卡早晚打卡阅读打卡听书打卡社群打卡
未来可期2022-06-11 九九聊
清晨，马路上非常清新，许是昨天的雨把所有的阴霾都扫去了吧？走着路上的脚步变得非常轻快。回忆这四个月来的晨读，有过退缩的想法，有过放弃的想法，但最终一直坚持下来，没事落下一堂课，虽然没有把所学知识及时落下去，但一直在做，一直在进步。因为不是科班出身，所以系统性的执行有些不足，虽然学习了时间管理，但时间管理的精髓还没有真正落地。后面的路还有很长要走。但是我发现自己的潜能正在慢慢的激发出来，接下来我将梳
474天，日精进，只为目标达成找方法！吕You
大家好，我是英丽今天是我的日精进行动第474天，和大家分享我今天的进步，我们互相勉励，携手前行。每天进步一点点，距离成功便不远。2018年经营模式升级，为您的企业打造三个统一：统一形象（广告视觉产品）统一符号（企业形象设计）统一思想（文化标准建设）1、比学习:准备互助会的过程里，学习统筹的运作的重要性，作为主管人员的安排与协调能力很有必要，全局观念让我们感受到不同的人放在不同的位置上，会更有价值，
认识时间教学反思真真_3e13
认识时间教学反思今天是认识时间，这个单元坚持一周的日子，从上周五到今天整整一周的时间在学习二年级上册认识时、分、几时几分和简单的解决问题。在今天下午的测试中，通过孩子们的共同错误的，我有一下几点反思：共同错误点：看到时针指向几就是几时错误示例有时候眼睛看到的不一定是真相。上面这个题看着时针指向9，就误认为是九时多，其实这是错误的，在一年级认识钟表中就学习了，当分针在12的左边时是会几时了，所以打通
python--自动化的机器学习（AutoML） Q_ytsup5681 python 自动化机器学习
自动化机器学习（AutoML）是一种将自动化技术应用于机器学习模型开发流程的方法，旨在简化或去除需要专业知识的复杂步骤，让非专家用户也能轻松创建和部署机器学习模型**[^3^]。具体介绍如下：1.自动化的概念：自动化是指使设备在无人或少量人参与的情况下完成一系列任务的过程。这一概念随着电子计算机的发明和发展而不断进化，从最初的物理机械到后来的数字程序控制，再到现在的人工智能和机器学习，自动化已经渗
电信星卡定向流量可以使用哪些app(电信星卡定向流量app有哪些) 全网优惠分享
电信星卡定向流量可以使用哪些app(电信星卡定向流量app有哪些)关注微•信•公•众•号"卡泡泡"就知道啦！电信的定向流量包括哪些app?1、电信定向流量app如下：头条系应用今日头条、今日头条lite版本、西瓜视频、抖音火山版、抖音视频、多闪、图虫、懂车帝、皮皮虾APP、海豚股票、海豚财富、gogokid、轻颜、好好学习、飞聊、时光相册、半次元。电信星卡大流量版合集：电信星卡定向流量app有哪些
倒计时一天曲晓彤
开工作室摄影在齐河新家。或者买个别的房子或者嘉和馨园挑战一下自己多学习^^一年半时间瘦到150140生祥云
一个40+女人的认知觉醒：经济独立比“你养我”更加自信霸气周芷晴聊情感
身为一个40+的已婚女人，当我拿着自己兼职挣的钱在42岁生日当天完成了人生清单里必须体验一次的海上高空跳伞后，落地第一时间收到了两位教练的生日祝福，朋友圈里也满满都是亲戚朋友的点赞与祝福，只有一个人给了我劈头盖脸一顿骂，因为这样的“高危”活动是他不敢也不能接受的，这个人就是我的老公。一个偶然的机会，我有幸遇到筝小钱老师，走进了她的读书变现训练营。35天的基础班学习，让前半生都在与文字打交道的我找到
java Script笔记
第一章,初始javascript1,javascript的基本概念JavaScript一种直译式脚本语言，一种基于对象和事件驱动并具有安全性的客户端脚本语言；也是一种广泛应用客户端web开发的脚本语言。简单地说，JavaScript是一种运行在浏览器中的解释型的编程语言。2,Javascript的特点解释性的脚本语言（代码不进行预编译）与其他脚本语言一样，JavaScript也是一种解释性语言，它
《自我放松训练》读书笔记 dear心理咨询师黄倩雯
重复背诵一些有自己编排的指令，比如我的双臂在发热或者我的身体在变得越来越轻松，直到自己感觉到由该指令说描述的效果正在身体上出现，这类似于臆想和幻觉的演习，属于潜意识领域的内容。首先设想一个舒适的身体姿势，不要自己支撑着身体。松开身上的衣物首饰其实置身于安静舒适的环境中当发出指令时，要积极的为体察自己的感觉做好准备。发指令是这平时的深呼吸动作。做完一段动作是做些恢复身体灵敏动作。最后，积极的建议结束
【JS笔记】Java Script学习笔记
JavaScript输出语句document.write()：将内容写入html文档console.log()：将内容写入控制台alert()：弹窗变量JS是弱类型语言，变量无类型var：全局变量，可重复声明let：局部变量，不可重复声明const：常量，不可重复声明数据类型number：数字。整数、浮点数、NaNstring：字符串。单引号：'Hello'双引号："Hello"模板字符串：使用反
【Python】人脸识别宅男很神经 python 开发语言
第一章：计算机视觉与图像处理的基石在深入人脸识别之前，我们必须首先牢固掌握计算机视觉和图像处理的基本概念。人脸，本质上就是一张复杂的图像，对图像的理解是所有高级视觉任务的起点。1.1图像的本质：像素与数字化表示图像，在我们看来是连续的画面，但在计算机内部，它却是离散的数值矩阵。1.1.1什么是像素？图像的最小单元像素（Pixel），是构成数字图像的最小单位。可以将其想象成一个微小的彩色点。一张数字
最新1区9+非肿瘤纯生信，逻辑清晰易懂，机器学习筛选关键基因的纯生信也可以发高水平期刊，抓紧上车！生信小课堂
影响因子：9.186关于非肿瘤生信，我们也解读过很多，主要有以下类型1单个疾病WGCNA+PPI分析筛选hub基因2单个疾病结合免疫浸润，热点基因集，机器学习算法等。3两种相关疾病联合分析，包括非肿瘤结合非肿瘤，非肿瘤结合肿瘤或者非肿瘤结合泛癌分析4基于分型的非肿瘤生信分析5单细胞结合普通转录组生信分析目前非肿瘤生信发文的门槛较低，欢迎大家！研究概述：本研究首先使用R语言在三个基因表达数据集中找到
在坚持的路上走远一点眯陌
很迷茫不知所措，我害怕好多东西，可是我又不愿意放弃尝试，这一切的一切我多希望就此结束，可是又会想到自己想干的事还没干，未来那么远，为什么不去创造自己的美好人生呢？疑问???从小到大的坚持上学，可是自己对自己喜欢的东西却没有加以坚持，所以啊！我总是鼓励自己要努力要努力啊！但忽略了自己的感受。唉！人哦！所以进入大学，我不愿意放弃，我希望自己可以经受得住嘲讽，冷眼，然后学习很多东西，增强自己的能力，而坚
2023-02-03 每天微笑愉婉柔
20230203《会痛的不是爱》69笔记每场权力斗争都提醒了我曾经受伤的地方笔记：1有斗争，有挣扎，想防卫，这很正常。会受伤，难受，痛苦，这也很正常。无需去否认，抵触，想要尽快摆脱逃离这些。只需要去承认，去接受，去经历体验这些，并从这些里探索，哪些是过往的自己，以及现在的自己又可以如何。2权力斗争意味着我们活在经验中、恐惧中、防卫中；成长和疗愈带领我们活在体验中、对恐惧的直面中、对当下和关系伙伴的
STC15单片机实战笔记一未来电子机械工程师单片机STC15实战单片机
新建工程一、新建工程前的准备1、添加型号与头文件到keil第一次新建STC工程时，需要将STC的型号与头文件添加到keil软件中。打开STC-ISP下载工具，切换至keil仿真设置栏，按提示添加即可。2、新建工程文件夹①、在新建工程目录下新建软件开发文件夹用于存放工程文件；②、在软件开发目录下新建user文件夹，用于存放main，public等文件；③、在软件开发目录下新建app文件夹，用于存放应
文献笔记八十一：植物长链非编码RNA数据库PLncDB 2.0 小明的数据分析笔记本
论文链接https://academic.oup.com/nar/article/49/D1/D1489/5932847本地文件gkaa910.pdf
学生信息管理系统的VFP数据库设计与实现溪水边小屋
本文还有配套的精品资源，点击获取简介：《学生管理系统vfp数据库》是一个基于VisualFoxPro（VFP）的学生信息管理软件，旨在帮助教育机构记录、管理学生数据并提供分析功能。该系统包括数据库设计、用户界面设计、数据操作、事务处理、报表统计、安全性管理、程序架构及代码优化等核心功能。学生管理系统是VFP数据库开发的学习和实践平台，适用于初学者。1.VisualFoxPro数据库开发基础Visu
AI数字人系统开发上线全攻略：从0到1全流程解析 v_qutudy 人工智能 AI系统开发 AI数字人开发
一、需求分析：定义数字人核心能力1.1功能规划矩阵模块基础功能进阶功能形象生成2D/3D建模实时表情捕捉与驱动语音交互TTS语音合成情感识别与应激反应动作系统预设动作库骨骼动画与物理引擎智能决策规则引擎强化学习驱动决策多模态交互文本/语音输入AR/VR空间交互1.2非功能性指标实时性：唇形同步延迟B[语音识别]A-->C[姿态检测]A-->D[文本理解]B-->E[NLP引擎]C-->F[动作解析
人人皆有神功：AI如何改变程序员的江湖地位？ nbsaas-boot 人工智能大数据
在人类的历史中，每一次技术革命都重新洗牌了社会的力量结构：工业革命带来机器力量的爆发，信息时代成就了程序员的黄金时代。而如今，随着通用人工智能（AGI）和大模型技术的突飞猛进，我们正在步入一个**“人人皆有神功”的AI江湖时代**。当AI成为每个人的智能助手，编程是否还重要？程序员将何去何从？本文将以“武林江湖”的隐喻，探索AI时代的技术平权与社会重构。一、技术平权真的来了吗？过去，程序员之所以被
桌面问题 —— 解决 Windows 桌面部分快捷方式图标变为空白的问题 ice.Ynov23 Windows Solutions windows
解决Windows桌面部分快捷方式图标变为空白的问题第一种文件没有消失的情况打开本地应用数据存储位置（C:\Users\用户名\AppData\Local）快捷打开方式：按下Windows+R键，在弹出的运行对话框中输入%localappdata%，回车确定。在打开的本地应用数据存储窗口中，找到并删除Iconcache.db文件。打开任务管理器，找到Windows资源管理器。右键单击Windows
python 函数校园伴侣
函数函数也是一个对象对象是内存中专门用来存储数据的一块区域，函数可以用来保存一些可执行的代码，并且可以在需要时，对这些语句进行多次的调用创建函数：def函数名([形参1,形参2,…形参n]):代码块函数名必须要符合标识符的规范（可以包含字母、数字、下划线、但是不能以数字开头）函数中保存的代码不会立即执行，需要调用函数代码才会执行-调用函数：函数对象()-定义函数一般都是要实现某种功能的定义函数de
大型语言模型的智能本质是什么 ZhangJiQun&MXP 教学 2021 论文 2024大模型以及算力语言模型人工智能自然语言处理
大型语言模型的智能本质是什么基于海量数据的统计模式识别与生成系统，数据驱动的语言模拟系统，其价值在于高效处理文本任务（如写作、翻译、代码生成），而非真正的理解与创造大型语言模型（如GPT-4、Claude等）的智能本质可概括为基于海量数据的统计模式识别与生成系统，其核心能力源于对语言规律的深度学习，但缺乏真正的理解与意识。以下从本质特征、技术机制、典型案例及争议点展开分析：一、智能本质的核心特征统
（四）Python总结笔记：函数 Laura_Wangzx Python学习笔记 python
Python总结笔记（四）函数python中的函数函数中的参数变量作用域偏函数PFA递归函数高阶函数BIFs中的高阶函数匿名函数lambda闭包Closure装饰器Decorator函数式编程FunctionalProgramming1.python中的函数￭函数的意义:■1.对输入进行变换映射后输出，可以进行反复调用。以函数名对代码块进行封装■2.过程化VS结构化￭函数的创建及结构:■定义函数名
Python 算法基础篇之线性搜索算法：顺序搜索、二分搜索挣扎的蓝藻 Python算法初阶：入门篇 python 算法开发语言
Python算法基础篇之线性搜索算法：顺序搜索、二分搜索引用1.顺序搜索算法2.二分搜索算法3.顺序搜索和二分搜索的对比a)适用性b)时间复杂度c)前提条件4.实例演示实例1：顺序搜索实例2：二分搜索总结引用在算法和数据结构中，搜索是一种常见的操作，用于查找特定元素在数据集合中的位置。线性搜索算法是最简单的搜索算法之一，在一组数据中逐一比较查找目标元素。本篇博客将介绍线性搜索算法的两种实现方式：顺
Python基础（四）函数
一、函数简介函数也是一个对象。对象是内存中专门用来存储数据的一块区域。函数用来保存一些可执行代码，并且在需要时，可以重复调用。创建函数：def函数名([形参1，形参2，.....形参n]):代码块函数名必须要符合标识符规范可以包含字母、数字、下划线，但不能以数字开头。函数中保存的代码，需要被调用才会执行。调用函数：函数对象()二、函数参数定义函数时，可以在函数名后定义数量不等的形参，多个形参以，隔
git 入门格林姆大师
git入门学习笔记----3个入门命令：gitinit、gitadd、gitcommit-v学习场景（首次在github上创建newrepository）：…orcreateanewrepositoryonthecommandlineecho"#blog-02">>README.mdgitinitgitaddREADME.mdgitcommit-m"firstcommit"gitremoteadd
VMware Workstation 11 或者 VMware Player 7安装MAC OS X 10.10 Yosemite iwindyforest vmware mac os 10.10 workstation player
最近尝试了下VMware下安装MacOS 系统，安装过程中发现网上可供参考的文章都是VMware Workstation 10以下， MacOS X 10.9以下的文章，只能提供大概的思路，但是实际安装起来由于版本问题，走了不少弯路，所以我尝试写以下总结，希望能给有兴趣安装OSX的人提供一点帮助。写在前面的话：其实安装好后发现，由于我的th
关于《基于模型驱动的B/S在线开发平台》源代码开源的疑虑？ deathwknight JavaScript java 框架
本人从学习Java开发到现在已有10年整，从一个要自学 java买成javascript的小菜鸟，成长为只会java和javascript语言的老菜鸟（个人邮箱：[email protected]）一路走来，跌跌撞撞。用自己的三年多业余时间，瞎搞一个小东西（基于模型驱动的B/S在线开发平台，非MVC框架、非代码生成）。希望与大家一起分享，同时有许些疑虑，希望有人可以交流下平台
如何把maven项目转成web项目 Kai_Ge maven MyEclipse
创建Web工程，使用eclipse ee创建maven web工程 1.右键项目,选择Project Facets,点击Convert to faceted from 2.更改Dynamic Web Module的Version为2.5.(3.0为Java7的,Tomcat6不支持). 如果提示错误,可能需要在Java Compiler设置Compiler compl
主管？？？ Array_06 工作
转载：http://www.blogjava.net/fastzch/archive/2010/11/25/339054.html 很久以前跟同事参加的培训，同事整理得很详细，必须得转！前段时间，公司有组织中高阶主管及其培养干部进行了为期三天的管理训练培训。三天的课程下来，虽然内容较多，因对老师三天来的课程内容深有感触，故借着整理学习心得的机会，将三天来的培训课程做了一个
python内置函数大全 2002wmj python
最近一直在看python的document，打算在基础方面重点看一下python的keyword、Build-in Function、Build-in Constants、Build-in Types、Build-in Exception这四个方面，其实在看的时候发现整个《The Python Standard Library》章节都是很不错的，其中描述了很多不错的主题。先把Build-in Fu
JSP页面通过JQUERY合并行 357029540 JavaScript jquery
在写程序的过程中我们难免会遇到在页面上合并单元行的情况，如图所示如果对于会的同学可能很简单，但是对没有思路的同学来说还是比较麻烦的，提供一下用JQUERY实现的参考代码 function mergeCell(){ var trs = $("#table tr"); &nb
Java基础冰天百华 java基础
学习函数式编程 package base; import java.text.DecimalFormat; public class Main { public static void main(String[] args) { // Integer a = 4; // Double aa = (double)a / 100000; // Decimal
unix时间戳相互转换 adminjun 转换 unix 时间戳
如何在不同编程语言中获取现在的Unix时间戳(Unix timestamp)？ Java time JavaScript Math.round(new Date().getTime()/1000) getTime()返回数值的单位是毫秒 Microsoft .NET / C# epoch = (DateTime.Now.ToUniversalTime().Ticks - 62135
作为一个合格程序员该做的事 aijuans 程序员
作为一个合格程序员每天该做的事 1、总结自己一天任务的完成情况最好的方式是写工作日志，把自己今天完成了什么事情，遇见了什么问题都记录下来，日后翻看好处多多 2、考虑自己明天应该做的主要工作把明天要做的事情列出来，并按照优先级排列，第二天应该把自己效率最高的时间分配给最重要的工作 3、考虑自己一天工作中失误的地方，并想出避免下一次再犯的方法出错不要紧，最重
由html5视频播放引发的总结 ayaoxinchao html5 视频 video
前言项目中存在视频播放的功能，前期设计是以flash播放器播放视频的。但是现在由于需要兼容苹果的设备，必须采用html5的方式来播放视频。我就出于兴趣对html5播放视频做了简单的了解，不了解不知道，水真是很深。本文所记录的知识一些浅尝辄止的知识，说起来很惭愧。视频结构本该直接介绍html5的<video>的，但鉴于本人对视频
解决httpclient访问自签名https报javax.net.ssl.SSLHandshakeException: sun.security.validat bewithme httpclient
如果你构建了一个https协议的站点，而此站点的安全证书并不是合法的第三方证书颁发机构所签发，那么你用httpclient去访问此站点会报如下错误 javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path bu
Jedis连接池的入门级使用 bijian1013 redis redis数据库 jedis
Jedis连接池操作步骤如下： a.获取Jedis实例需要从JedisPool中获取； b.用完Jedis实例需要返还给JedisPool； c.如果Jedis在使用过程中出错，则也需要还给JedisPool； packag
变与不变 bingyingao 不变变亲情永恒
变与不变周末骑车转到了五年前租住的小区，曾经最爱吃的西北面馆、江西水饺、手工拉面早已不在，各种店铺都换了好几茬，这些是变的。三年前还很流行的一款手机在今天看起来已经落后的不像样子。三年前还运行的好好的一家公司，今天也已经不复存在。一座座高楼拔地而起，
【Scala十】Scala核心四：集合框架之List bit1129 scala
Spark的RDD作为一个分布式不可变的数据集合，它提供的转换操作，很多是借鉴于Scala的集合框架提供的一些函数，因此，有必要对Scala的集合进行详细的了解 1. 泛型集合都是协变的，对于List而言，如果B是A的子类，那么List[B]也是List[A]的子类，即可以把List[B]的实例赋值给List[A]变量 2. 给变量赋值(注意val关键字，a，b
Nested Functions in C bookjovi c closure
Nested Functions 又称closure，属于functional language中的概念，一直以为C中是不支持closure的，现在看来我错了，不过C标准中是不支持的，而GCC支持。既然GCC支持了closure，那么 lexical scoping自然也支持了，同时在C中label也是可以在nested functions中自由跳转的
Java-Collections Framework学习与总结-WeakHashMap BrokenDreams Collections
总结这个类之前，首先看一下Java引用的相关知识。Java的引用分为四种：强引用、软引用、弱引用和虚引用。强引用：就是常见的代码中的引用，如Object o = new Object();存在强引用的对象不会被垃圾收集
读《研磨设计模式》-代码笔记-解释器模式-Interpret bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 解释器（Interpreter）模式的意图是可以按照自己定义的组合规则集合来组合可执行对象 * * 代码示例实现XML里面1.读取单个元素的值 2.读取单个属性的值 * 多
After Effects操作&快捷键 cherishLC After Effects
1、快捷键官方文档中文版：https://helpx.adobe.com/cn/after-effects/using/keyboard-shortcuts-reference.html 英文版：https://helpx.adobe.com/after-effects/using/keyboard-shortcuts-reference.html 2、常用快捷键
Maven 常用命令 crabdave maven
Maven 常用命令 mvn archetype:generate mvn install mvn clean mvn clean complie mvn clean test mvn clean install mvn clean package mvn test mvn package mvn site mvn dependency:res
shell bad substitution daizj shell 脚本
#!/bin/sh /data/script/common/run_cmd.exp 192.168.13.168 "impala-shell -islave4 -q 'insert OVERWRITE table imeis.${tableName} select ${selectFields}, ds, fnv_hash(concat(cast(ds as string), im
Java SE 第二讲（原生数据类型 Primitive Data Type） dcj3sjt126com java
Java SE 第二讲： 1. Windows: notepad, editplus, ultraedit, gvim Linux: vi, vim, gedit 2. Java 中的数据类型分为两大类： 1）原生数据类型（Primitive Data Type） 2）引用类型（对象类型）（R
CGridView中实现批量删除 dcj3sjt126com PHP yii
1，CGridView中的columns添加 array( 'selectableRows' => 2, 'footer' => '<button type="button" onclick="GetCheckbox();" style=&
Java中泛型的各种使用 dyy_gusi java 泛型
Java中的泛型的使用：1.普通的泛型使用在使用类的时候后面的<>中的类型就是我们确定的类型。 public class MyClass1<T> {//此处定义的泛型是T private T var; public T getVar() { return var; } public void setVa
Web开发技术十年发展历程 gcq511120594 Web 浏览器数据挖掘
回顾web开发技术这十年发展历程： Ajax 03年的时候我上六年级，那时候网吧刚在小县城的角落萌生。传奇，大话西游第一代网游一时风靡。我抱着试一试的心态给了网吧老板两块钱想申请个号玩玩，然后接下来的一个小时我一直在，注，册，账，号。彼时网吧用的512k的带宽，注册的时候，填了一堆信息，提交，页面跳转，嘣，”您填写的信息有误，请重填”。然后跳转回注册页面，以此循环。我现在时常想，如果当时a
openSession()与getCurrentSession()区别： hetongfei java DAO Hibernate
来自 http://blog.csdn.net/dy511/article/details/6166134 1.getCurrentSession创建的session会和绑定到当前线程,而openSession不会。 2. getCurrentSession创建的线程会在事务回滚或事物提交后自动关闭,而openSession必须手动关闭。这里getCurrentSession本地事务(本地
第一章安装Nginx+Lua开发环境 jinnianshilongnian nginx lua openresty
首先我们选择使用OpenResty，其是由Nginx核心加很多第三方模块组成，其最大的亮点是默认集成了Lua开发环境，使得Nginx可以作为一个Web Server使用。借助于Nginx的事件驱动模型和非阻塞IO，可以实现高性能的Web应用程序。而且OpenResty提供了大量组件如Mysql、Redis、Memcached等等，使在Nginx上开发Web应用更方便更简单。目前在京东如实时价格、秒
HSQLDB In-Process方式访问内存数据库 liyonghui160com
HSQLDB一大特色就是能够在内存中建立数据库，当然它也能将这些内存数据库保存到文件中以便实现真正的持久化。先睹为快！下面是一个In-Process方式访问内存数据库的代码示例：下面代码需要引入hsqldb.jar包（hsqldb-2.2.8） import java.s
Java线程的5个使用技巧 pda158 java 数据结构
Java线程有哪些不太为人所知的技巧与用法？　　萝卜白菜各有所爱。像我就喜欢Java。学无止境，这也是我喜欢它的一个原因。日常工作中你所用到的工具，通常都有些你从来没有了解过的东西，比方说某个方法或者是一些有趣的用法。比如说线程。没错，就是线程。或者确切说是Thread这个类。当我们在构建高可扩展性系统的时候，通常会面临各种各样的并发编程的问题，不过我们现在所要讲的可能会略有不同。
开发资源大整合：编程语言篇——JavaScript（1） shoothao JavaScript
概述：本系列的资源整合来自于github中各个领域的大牛，来收藏你感兴趣的东西吧。程序包管理器管理javascript库并提供对这些库的快速使用与打包的服务。 Bower - 用于web的程序包管理。 component - 用于客户端的程序包管理，构建更好的web应用程序。 spm - 全新的静态的文件包管
避免使用终结函数 vahoa.ma java jvm C++
终结函数（finalizer）通常是不可预测的，常常也是很危险的，一般情况下不是必要的。使用终结函数会导致不稳定的行为、更差的性能，以及带来移植性问题。不要把终结函数当做C++中的析构函数（destructors）的对应物。我自己总结了一下这一条的综合性结论是这样的： 1）在涉及使用资源，使用完毕后要释放资源的情形下，首先要用一个显示的方