临风而眠

《机器学习实战》chap1 机器学习概览

Chap1 The Machine Learning Landscape

这本书第三版也已经出版了:https://github.com/ageron/handson-ml3

Hands-on Machine Learning with Scikit-Learn,Keras & TensorFlow
引入
- 很早的应用：光学字符识别(OCR， Optical Character Recognition）
- 20世纪90年代：垃圾邮件过滤器 the spam filter
- … Now 数不清的机器学习应用

现在开始培养自己看英文原版的习惯
结合着看…

文章目录

《机器学习实战》chap1 机器学习概览
1.1 What is Machine Learning
1.2 Why Use Machine Learning?
1.3 Types of Machine Learning Systems
- 1.3.1 Supervised/Unsupervised Learning
- - Supervised Learning
  - Unsupervised learning
  - Semisupervised learning
  - Reinforcement learning
- 1.3.2 Batch and Online Learning
- - Batch learning
  - Online learning
- 1.3.3 Instance-Based Versus Model-Based Learning
- - Instance-based learning
  - Model-based learning
1.4 Main Challenges of Machine Learning
- Insuffcient Quantity of Training Data
- Nonrepresentative Training Data
- Poor-Quality Data
- Irrelevant Features
- Overfitting the Training Data
- Underfitting the Training Data
1.5 Testing and Validating
Exercises
参考

Examples of Applications 举得一些例子没有记笔记

1.1 What is Machine Learning

definition
- a slightly more general definition:
  - [Machine Learning is the] field of study that gives computers the ability to learn without being explicitly programmed. —Arthur Samuel, 1959
  - 机器学习是一个研究领域，让计算机无须进行明确编程就具备学习能力。
- a more engineering-oriented one:
- A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E. —Tom Mitchell, 1997
- 一个计算机程序利用经验E来学习任务T，性能是P，如果针对任务T的性能P随着经验E不断增长，则称为机器学习
For example, your spam filter is a Machine Learning program that can learn to flag spam given examples of spam emails (e.g., flagged by users) and examples of regular (nonspam, also called “ham”) emails.
- 你的垃圾邮件过滤器是一个机器学习程序，它可以根据垃圾邮件的例子（例如，被用户标记的垃圾邮件）和普通的例子(非垃圾邮件，也称为 “ham”)邮件的例子学习如何标记出垃圾邮件。
The examples that the system uses to learn（系统用来进行学习的样例） are called the training set（训练集）. Each training example is called a training instance (or sample)（训练实例，训练样本）. In this case, the task T is to flag spam for new emails（标记垃圾邮件）, the experience E is the training data（训练数据）,
- the performance measure P needs to be defined; for example, you can use the ratio of correctly classified emails.
（比如可以把P定义为可以正确分类邮件的比例）
- This particular performance measure is called accuracy（准确率） and it is often used in classification tasks（分类任务）.

1.2 Why Use Machine Learning?

作者还是用垃圾邮件的例子，感觉讲的很棒
Machine Learning is great for:
- Problems for which existing solutions require a lot of hand-tuning or long lists of rules: one Machine Learning algorithm can often simplify code and perform better.
- Complex problems for which there is no good solution at all using a traditional approach: the best Machine Learning techniques can find a solution.
- Fluctuating（波动的） environments: a Machine Learning system can adapt to new data.
- Getting insights about complex problems and large amounts of data
积累一个我觉得文中很好的表达，下面这个shine

Another area where Machine Learning shines is for problems that either are too complex for traditional approaches or have no known algorithm.
Machine Learning is about making machines get better at some task by learning from data, instead of having to explicitly code rules.

1.3 Types of Machine Learning Systems

哈哈哈最近发现state-of-the-art这个词用的挺多的

There are so many different types of Machine Learning systems that it is useful to classify them in broad categories based on: （据以下标准进行大致分类）

Whether or not they are trained with human supervision (supervised, unsuper‐ vised, semisupervised, and Reinforcement Learning)
Whether or not they can learn incrementally on the fly (online versus batch learning)

是否可以动态地进行增量学习（在线学习和批量学习）
Whether they work by simply comparing new data points to known data points, or instead detect patterns in the training data and build a predictive model, much like scientists do (instance-based versus model-based learning)

These criteria are not exclusive; you can combine them in any way you like. For example, a state-of-the-art spam filter may learn on the fly using a deep neural net‐ work model trained using examples of spam and ham; this makes it an online, model-based, supervised learning system.

1.3.1 Supervised/Unsupervised Learning

be classified according to the amount and type of supervision they get during training.
four major categories: supervised learning, unsupervised learning, semisupervised learning, and Reinforcement Learning.

Supervised Learning

typical supervised learning tasks
- classification
  - 还是以spam filter为例：A labeled training set 中， trained with many example emails with their class (spam or ham)
- regression
  
  Fun fact: this odd-sounding name is a statistics term introduced by Francis Galton while he was studying the fact that the children of tall people tend to be shorter than their parents. Since children were shorter, he called this regression to the mean. This name was then applied to the methods he used to analyze correlations between variables.
  - predict a target numeric value, such as the price of a car, given a set of features (mileage, age, brand, etc.) called predictors.
  - To train the system, you need to give it many examples of cars, including both their predictors and their labels (i.e., their prices).
Some of the most important supervised learning algorithms (covered in this book):

Some regression algorithms can be used for classification as well, and vice versa. For example, Logistic Regression is commonly used for classification.
- k-Nearest Neighbors
- Linear Regression
- Logistic Regression
- Support Vector Machines (SVMs)
- Decision Trees and Random Forests
- Neural networks

Unsupervised learning

The training data is unlabeled. The system tries to learn without a teacher

以下为典型task和对应的一些重要算法

Clustering
- K-Means
- DBSCAN
- Hierarchical Cluster Analysis (HCA) 分层聚类分析
Anomaly detection and novelty detection 异常检测和新颖性检测

Anomaly detection：it learns to recognize them and when it sees a new instance it can tell whether it looks like a normal one or whether it is likely an anomaly

Novelty detection：和前者类似，the difference is that novelty detection algorithms expect to see only normal data during training, while anomaly detection algorithms are usually more tolerant, they can often perform well even with a small percentage of outliers in the training set.
- One-class SVM 单类SVM
- Isolation Forest
Visualization and dimensionality reduction 可视化和降维

visualization：you feed them a lot of complex and unlabeled data, and they output a 2D or 3D rep‐ resentation of your data that can easily be plotted

dimensionality reduction：the goal is to simplify the data without losing too much information

It is often a good idea to try to reduce the dimension of your train‐ ing data using a dimensionality reduction algorithm before you feed it to another Machine Learning algorithm (such as a super‐ vised learning algorithm). It will run much faster, the data will take up less disk and memory space, and in some cases it may also per‐ form better
- Principal Component Analysis (PCA)
- Kernel PCA 核主成分分析
- Locally-Linear Embedding (LLE) 局部线性嵌入
- t-distributed Stochastic Neighbor Embedding (t-SNE) t-分布随机近邻嵌入
Association rule learning 关联规则学习

the goal is to dig into large amounts of data and discover interesting relations between attributes

Apriori
Eclat

Semisupervised learning

deal with partially labeled training data, usually a lot of unlabeled data and a little bit of labeled data.
Most semisupervised learning algorithms are combinations of unsupervised and supervised algorithms.

Reinforcement learning

The learning system, called an agent in this context, can observe the environment, select and perform actions, and get rewards in return (or penalties in the form of negative rewards). It must then learn by itself what is the best strategy, called a policy, to get the most reward over time. A policy defines what action the agent should choose when it is in a given situation.

1.3.2 Batch and Online Learning

这一小节目前不是很理解

Another criterion used to classify Machine Learning systems is whether or not the system can learn incrementally from a stream of incoming data.

看系统是否可以从传入的数据流中进行增量学习

Batch learning

In batch learning, the system is incapable of learning incrementally: it must be trained using all the available data.
This will generally take a lot of time and computing resources, so it is typically done offline. First the system is trained, and then it is launched into production and runs without learning anymore; it just applies what it has learned. This is called offline learning.

Online learning

In online learning, you train the system incrementally by feeding it data instances sequentially, either individually or by small groups called mini-batches. Each learning step is fast and cheap, so the system can learn about new data on the fly, as it arrives.（系统可以根据飞速写入的最新数据进行学习）

on the fly ：(计算机)运行中

1.3.3 Instance-Based Versus Model-Based Learning

One more way to categorize Machine Learning systems is by how they generalize（如何泛化）. Most Machine Learning tasks are about making predictions. This means that given a number of training examples, the system needs to be able to generalize to examples it has never seen before. Having a good performance measure on the training data is good, but insufficient; the true goal is to perform well on new instances.

在训练数据上实现良好的性能指标固然重要，但是还不够充分。真正的目的是要在新的对象实例上表现出色
There are two main approaches to generalization: instance-based learning and model-based learning.

Instance-based learning

the system learns the examples by heart, then generalizes to new cases by comparing them to the learned examples (or a subset of them), using a similarity measure

Model-based learning

build a model of these exam‐ ples, then use that model to make predictions
In summary:
- You studied the data.
- You selected a model.
- You trained it on the training data (i.e., the learning algorithm searched for the model parameter values that minimize a cost function).
- Finally, you applied the model to make predictions on new cases (this is called inference), hoping that this model will generalize well.
This is what a typical Machine Learning project looks like.

There are many different types of ML systems: supervised or not, batch or online, instance-based or model-based, and so on.
In a ML project you gather data in a training set, and you feed the training set to a learning algorithm. If the algorithm is model-based it tunes some parameters to fit the model to the training set (i.e., to make good predictions on the training set itself), and then hopefully it will be able to make good predictions on new cases as well. If the algorithm is instance-based, it just learns the examples by heart and uses a similarity measure to generalize to new instances.

1.4 Main Challenges of Machine Learning

In short, since your main task is to select a learning algorithm and train it on some data, the two things that can go wrong are “bad algorithm” and “bad data.” Let’s start with examples of bad data.

Insuffcient Quantity of Training Data

作者这里写的很好玩哈哈哈

For a toddler （刚学会走路的孩子，牙牙学语的小孩）to learn what an apple is, all it takes is for you to point to an apple and say “apple” (possibly repeating this procedure a few times). Now the child is able to recognize apples in all sorts of colors and shapes. Genius. （天才）

Machine Learning is not quite there yet; it takes a lot of data for most Machine Learning algorithms to work properly. Even for very simple problems you typically need thousands of examples, and for complex problems such as image or speech recognition you may need millions of examples (unless you can reuse parts of an existing model).

Nonrepresentative Training Data

In order to generalize well, it is crucial that your training data be representative of the new cases you want to generalize to. This is true whether you use instance-based learning or model-based learning.

It is crucial to use a training set that is representative of the cases you want to generalize to. This is often harder than it sounds: if the sample is too small, you will have sampling noise (i.e., nonrepresentative data as a result of chance), but even very large samples can be nonrepresentative if the sampling method is flawed. This is called sampling bias.

针对你想要泛化的案例使用具有代表性的训练集，这一点至关重要。不过说起来容易，做起来难：如果样本集太小，将会出现采样噪声（即非代表性数据被选中）；而即便是非常大的样本数据，如果采样方式欠妥，也同样可能导致非代表性数据集，这就是所谓的采样偏差。

Poor-Quality Data

Obviously, if your training data is full of errors, outliers（异常值）, and noise (e.g., due to poorquality measurements), it will make it harder for the system to detect the underlying patterns, so your system is less likely to perform well. It is often well worth the effort to spend time cleaning up your training data. The truth is, most data scientists spend a significant part of their time doing just that.

Irrelevant Features

As the saying goes: garbage in, garbage out. Your system will only be capable of learning if the training data contains enough relevant features and not too many irrelevant ones. A critical part of the success of a Machine Learning project is coming up with a good set of features to train on. This process, called feature engineering, involves:

Feature selection: selecting the most useful features to train on among existing features.
Feature extraction: combining existing features to produce a more useful one (as we saw earlier, dimensionality reduction algorithms can help).
Creating new features by gathering new data.

前面几个challenge主要是bad data,下面几个主要是bad algorithm

Overfitting the Training Data

Say you are visiting a foreign country and the taxi driver rips you off. You might be tempted to say that all taxi drivers in that country are thieves. Overgeneralizing is something that we humans do all too often, and unfortunately machines can fall into the same trap if we are not careful.

这一段写的也好哈哈，仔细想想我们人确实常犯过拟合的错误。

In Machine Learning this is called overfitting: it means that the model performs well on the training data, but it does not generalize well.
Overfitting happens when the model is too complex relative to the amount and noisiness of the training data. The possible solutions are:
- To simplify the model by selecting one with fewer parameters (e.g., a linear model rather than a high-degree polynomial model), by reducing the number of attributes in the training data or by constraining the model
- To gather more training data
- To reduce the noise in the training data (e.g., fix data errors and remove outliers)
Constraining a model to make it simpler and reduce the risk of overfitting is called regularization.
The amount of regularization to apply during learning can be controlled by a hyper‐ parameter. A hyperparameter is a parameter of a learning algorithm (not of the model). As such, it is not affected by the learning algorithm itself; it must be set prior to training and remains constant during training. If you set the regularization hyper‐ parameter to a very large value, you will get an almost flat model (a slope close to zero); the learning algorithm will almost certainly not overfit the training data, but it will be less likely to find a good solution. Tuning hyperparameters is an important part of building a Machine Learning system。

Underfitting the Training Data

Underfitting is the opposite of overfitting: it occurs when your model is too simple to learn the underlying structure of the data.
The main options to fix this problem are:
Selecting a more powerful model, with more parameters
Feeding better features to the learning algorithm (feature engineering)
Reducing the constraints on the model (e.g., reducing the regularization hyper‐ parameter)

The system will not perform well if your training set is too small, or if the data is not representative, noisy, or polluted with irrelevant features (garbage in, garbage out). Lastly, your model needs to be neither too simple (in which case it will underfit) nor too complex (in which case it will overfit).

once you have trained a model, you don’t want to just “hope” it generalizes to new cases. You want to evaluate it, and fine-tune（微调，精调） it if necessary. Let’s see how

1.5 Testing and Validating

这部分里面提到的训练集，测试集，验证集，交叉验证，之前学习周志华老师的课的时候感觉那里老师讲的很清楚，当时笔记记录在这：周志华《机器学习初步》模型评估与选择

另外，这里也提到了 No Free Lunch（NFL) Theorem 哈哈哈！

NFL定理 : 一个算法 $\mathfrak{L}_{a}$ 若在某些问题上比另一个算法 $\mathfrak{L}_{b}$ 好，必存在另一些问题 $\mathfrak{L}_{b}$ 比 $\mathfrak{L}_{a}$ 好

In a famous 1996 paper,（“The Lack of A Priori Distinctions Between Learning Algorithms,”） David Wolpert demonstrated that if you make absolutely no assumption about the data, then there is no reason to prefer one model over any other. This is called the No Free Lunch (NFL) theorem. For some datasets the best model is a linear model, while for other datasets it is a neural network. There is no model that is a priori guaranteed to work better (hence the name of the theorem). The only way to know for sure which model is best is to evaluate them all. Since this is not possible, in practice you make some reasonable assumptions about the data and you evaluate only a few reasonable models. For example, for simple tasks you may evaluate linear models with various levels of regularization, and for a complex problem you may evaluate various neural networks.

Exercises

1.How would you define Machine Learning?

Machine Learning is about building systems that can learn from data. Learning means getting better at some task, given some performance measure.
2.Can you name four types of problems where it shines?

Machine Learning is great for complex problems for which we have no algorithmic solution, to replace long lists of hand-tuned rules, to build systems that adapt to fluctuating environments, and finally to help humans learn (e.g., data mining).
3.What is a labeled training set?

A labeled training set is a training set that contains the desired solution (a.k.a. a label) for each instance.

“a.k.a.” stands for “also known as.” It is used to introduce an alternative name or nickname for someone or something. For example, you might say, “The artist a.k.a. Kanye West.” This means that Kanye West is also known as “the artist.”
4.What are the two most common supervised tasks?

Regression and classification.
5.Can you name four common unsupervised tasks?

Clustering, visualization, dimensionality reduction, and association rule learning.
6.What type of Machine Learning algorithm would you use to allow a robot to walk in various unknown terrains?

Reinforcement Learning is likely to perform best if we want a robot to learn to walk in various unknown terrains, since this is typically the type of problem that Reinforcement Learning tackles. It might be possible to express the problem as a supervised or semi-supervised learning problem, but it would be less natural.
7.What type of algorithm would you use to segment your customers into multiple groups?

If you don’t know how to define the groups, then you can use a clustering algorithm (unsupervised learning) to segment your customers into clusters of similar customers. However, if you know what groups you would like to have, then you can feed many examples of each group to a classification algorithm (supervised learning), and it will classify all your customers into these groups.
8.Would you frame the problem of spam detection as a supervised learning problem or an unsupervised learning problem?

Spam detection is a typical supervised learning problem: the algorithm is fed many emails along with their labels (spam or not spam).
9.What is an online learning system?

An online learning system can learn incrementally, as opposed to a batch learning system. This makes it capable of adapting rapidly to both changing data and autonomous systems, and of training on very large quantities of data.
10.What is out-of-core learning?

这个前面笔记里面偷懒没记

Out-of-core algorithms can handle vast quantities of data that cannot fit in a computer’s main memory. An out-of-core learning algorithm chops the data into mini-batches and uses online learning techniques to learn from these mini-batches.
11.What type of learning algorithm relies on a similarity measure to make predictions?

An instance-based learning system learns the training data by heart; then, when given a new instance, it uses a similarity measure to find the most similar learned instances and uses them to make predictions.
12.What is the difference between a model parameter and a learning algorithm’s hyperparameter?

A model has one or more model parameters that determine what it will predict given a new instance (e.g., the slope 斜率 of a linear model). A learning algorithm tries to find optimal values（最优解） for these parameters such that the model generalizes well to new instances. A hyperparameter is a parameter of the learning algorithm itself, not of the model (e.g., the amount of regularization to apply).
13.What do model-based learning algorithms search for? What is the most common strategy they use to succeed? How do they make predictions?

Model-based learning algorithms search for an optimal value for the model parameters such that the model will generalize well to new instances. We usually train such systems by minimizing a cost function that measures how bad the system is at making predictions on the training data, plus a penalty for model complexity if the model is regularized. To make predictions, we feed the new instance’s features into the model’s prediction function, using the parameter values found by the learning algorithm.
14.Can you name four of the main challenges in Machine Learning?

Some of the main challenges in Machine Learning are the lack of data, poor data quality, nonrepresentative data, uninformative features, excessively simple models that underfit the training data, and excessively complex models that overfit the data.
15.If your model performs great on the training data but generalizes poorly to new instances, what is happening? Can you name three possible solutions?

If a model performs great on the training data but generalizes poorly to new instances, the model is likely overfitting the training data (or we got extremely lucky on the training data). Possible solutions to overfitting are getting more data, simplifying the model (selecting a simpler algorithm, reducing the number of parameters or features used, or regularizing the model), or reducing the noise in the training data.
16.What is a test set and why would you want to use it?

A test set is used to estimate the generalization error that a model will make on new instances, before the model is launched in production.
17.What is the purpose of a validation set?

A validation set is used to compare models. It makes it possible to select the best model and tune the hyperparameters.
18.What is the train-dev set, when do you need it, and how do you use it?

The train-dev set is used when there is a risk of mismatch between the training data and the data used in the validation and test datasets (which should always be as close as possible to the data used once the model is in production). The train-dev set is a part of the training set that’s held out (the model is not trained on it). The model is trained on the rest of the training set, and evaluated on both the train-dev set and the validation set. If the model performs well on the training set but not on the train-dev set, then the model is likely overfitting the training set. If it performs well on both the training set and the train-dev set, but not on the validation set, then there is probably a significant data mismatch between the training data and the validation + test data, and you should try to improve the training data to make it look more like the validation + test data.

In the context of machine learning, a “train-dev set” is a set of data that is used to evaluate the performance of a model during the training process. The model is trained on a separate training set, and then its performance is evaluated on the development set to see how well it generalizes to unseen data. The train-dev set is used to tune the hyperparameters of the model and to help prevent overfitting. It is common practice to have a separate test set that is only used to evaluate the final performance of the model once training is complete.

In the context of machine learning, a development set (also known as a validation set) is a set of data that is used to evaluate the performance of a model during the training process. It is called a “development set” because it is used to develop the model, i.e., to tune the hyperparameters and ensure that the model is not overfitting the training data.

The development set is different from the training set, which is used to fit the model to the data, and the test set, which is used to evaluate the final performance of the model on unseen data.

Some people use the terms “development set” and “validation set” interchangeably, while others use them to refer to slightly different things. In general, the term “validation set” is used more commonly, and it usually refers to a set of data that is used to evaluate the model during training and fine-tune the hyperparameters.
19.What can go wrong if you tune hyperparameters using the test set?

If you tune hyperparameters using the test set, you risk overfitting the test set, and the generalization error you measure will be optimistic (you may launch a model that performs worse than you expect).

参考

https://github.com/ageron/handson-ml3
https://github.com/ageron/handson-ml2

你可能感兴趣的:(机器学习,深度学习,人工智能)

AI写代码工具赋能前端开发：提升开发者解决问题能力 bd_ming 人工智能前端
近年来，人工智能（AI）技术在各个领域都取得了显著进展，前端开发领域也不例外。AI的快速发展为前端开发者带来了前所未有的机遇，同时也带来了新的挑战。开发者需要不断学习新的技术和工具，以适应快速变化的开发环境。而AI写代码工具的出现，为开发者提升解决问题的能力提供了强有力的支持。本文将探讨AI前端开发工具如何帮助开发者更高效地解决问题，并以ScriptEcho为例进行说明。……传统的Web前端开发工
AI写代码工具赋能前端开发：ScriptEcho 如何激发创新？ 2501_90335205 人工智能前端
近年来，人工智能技术飞速发展，深刻地改变着各个行业，前端开发领域也不例外。借助AI写代码工具，开发者们能够以前所未有的速度和效率构建复杂的应用程序，从而释放出更多的时间和精力专注于创新。本文将以ScriptEcho为例，深入探讨AI如何赋能前端开发，提升创新能力。……AI赋能前端创新：效率与创意的平衡传统的前端开发流程往往充满了重复性的工作，例如编写大量的样板代码、处理复杂的布局以及调试各种兼容性
DeepSeek预测2030年：全球 50% 的白领工作将由 AI Agent 辅助完成，金融、医疗等专业渗透率超 70% 未来AI编程 DeepSeek入门到精通人工智能金融
基于当前技术趋势、行业动态及搜索结果中的关键信息，对未来的发展进行多维度预测，涵盖人工智能、搜索行业、全球经济格局等领域：一、人工智能技术的革命性突破低成本高性能模型的普及DeepSeek-R1等国产大模型通过混合专家架构（MoE）和算法优化，以OpenAI1/70的训练成本实现同等性能，推动AI开发从“重训练”向“重推理”转型。这一模式将加速中小企业和新兴国家进入AI赛道，形成“算力平权”效应。
网关类设备技术演进思路看兵马俑的程序员网闸安全
1.新技术采纳5G和物联网技术：支持更快的数据传输和更多连接。人工智能（AI）和机器学习：用于数据分析、用户行为预测和自动化决策。边缘计算：在设备端进行数据处理，减少对云服务的依赖，提高响应速度。区块链技术：用于确保数据安全和网络安全。2.安全性和隐私数据加密和隐私保护：采用最新的加密技术保护数据传输和存储。身份验证和访问控制：强化用户身份验证，确保只有授权用户可以访问网关。固件和软件安全更新：支
国产替代 | 星环科技Sophon替代SAS，助力大型国有银行智能化营销星环科技数据库架构数据挖掘
分布式架构的｜国产智能分析工具在银行交易中，20%的头部优质客户会给银行贡献80%的利润，而赢得一个新客户的成本是保留一个老客户的5至6倍。某大型国有银行在面临此类数据挖掘的业务时，使用的是SAS产品。由于SAS是集中式的，对单台服务器要求太高，算力无法支撑需求，且无法支持可视化的机器学习，对于业务人员来说使用门槛过高。在经过产品选型后，决定采用星环科技的智能分析工具Sophon替换原有SAS，用
自然语言处理(NLP)：文本向量化从文字到数字的原理全栈你个大西瓜人工智能自然语言处理人工智能文本向量化 NLP
在人工智能领域，尤其是自然语言处理（NLP）中，将文本信息转化为机器可以理解的形式是一个至关重要的步骤。本文探讨如何将文本转换为向量表示的过程，包括分词、ID映射、One-hot编码以及最终的词嵌入（Embedding），并通过具体的案例代码来辅助解释这些概念。处理字符还是数字人工智能算法只能处理数字形式的数据，特别是浮点数。这意味着任何非数字的信息，如汉字、字母等，都需要被转换成数值形式才能用于
PyTorch知识点总结之一 Rain松机器学习与深度学习 pytorch 深度学习 python
PyTorch知识点总结之一1.什么是PyTorch？它有什么特点和优势？PyTorch是一个基于Python的科学计算库，它是用于机器学习和深度学习的框架之一。它由Facebook的人工智能研究团队开发和维护，是一个开源的软件包，可以帮助开发者构建各种深度学习模型。PyTorch的特点和优势如下：易于使用和学习：PyTorch采用了类似于Python的语法，使得它容易上手和学习。它还提供了丰富的
51、深度学习-自学之路-自己搭建深度学习框架-12、使用我们自己建的架构重写RNN预测网络小宇爱深度学习-自学之路深度学习 rnn 人工智能
importnumpyasnpclassTensor(object):def__init__(self,data,autograd=False,creators=None,creation_op=None,id=None):self.data=np.array(data)self.autograd=autogradself.grad=Noneif(idisNone):self.id=np.rand
大语言模型能否完全替代人类？——技术、能力与未来的思考 Hello kele 人工智能
随着人工智能技术的迅猛发展，尤其是大语言模型（如DeepSeek、GPT系列、Grok等）的出现，人们开始探讨一个引人深思的问题：这些智能系统是否有一天能完全替代人类？本文从技术现状、能力边界以及未来趋势三个方面，分析这个问题，并试图给出一种平衡的视角。一、技术现状：大语言模型的能力与局限大语言模型在过去几年中取得了显著进步。可以理解复杂的自然语言，生成连贯的文本，甚至完成编程、分析和创意任务。例
使用 ML.NET 开发工业预测系统：从数据到智能决策威哥说编程 c#AI编程人工智能 microsoft
在现代工业领域，随着生产设备和环境传感器的大量部署，生成了海量的实时数据。这些数据不仅可以帮助我们监控设备的健康状况，还能够通过智能分析实现预测性维护、故障检测和生产效率优化等目标。而机器学习技术，尤其是ML.NET，提供了一种高效、灵活的方式来挖掘这些数据背后的潜在价值。本文将带领大家通过使用ML.NET来开发一个简单的工业预测系统，帮助企业提高生产效率，降低故障风险。1.机器学习在工业中的应用
初学者推荐学习AI的路径 ProgramHan 学习人工智能
学习人工智能的路径可以分为基础知识、编程技能、机器学习、深度学习、数据处理与可视化、自然语言处理（NLP）、计算机视觉（CV）、强化学习、实践项目和持续学习几个阶段。以下是一个简要的路径：1️⃣基础知识数学基础（线性代数、微积分、概率统计）编程基础（Python/R等语言）算法与数据结构2️⃣机器学习基础理解监督学习（如回归、分类）、无监督学习（如聚类、PCA）掌握机器学习库（如scikit-le
机器学习实战：从理论到实践静默.\\ 机器学习人工智能
随着人工智能技术的迅猛发展，机器学习作为其核心部分，已经广泛应用于各个领域。它不仅在科技公司中扮演着关键角色，在医疗、金融、零售等行业也展现了巨大的潜力。然而，对于许多初学者来说，如何将理论知识转化为实际操作是一个挑战。本文旨在通过一个具体的案例——预测房价，来介绍机器学习的基本流程和具体操作步骤。我们将使用Python编程语言及其相关的科学计算库，如NumPy、Pandas、Scikit-Lea
云原生时代的分布式文件系统设计与实现 ITPUB-微风云原生
在云计算和大数据时代，高效的数据管理和访问对于企业来说至关重要。Alluxio，一个开源的分布式文件系统，应运而生，为大数据和人工智能应用提供了革命性的解决方案。由HaoyuanLi在加州大学伯克利分校AMPLab启动，Alluxio如今已成为全球众多大型科技公司（如Facebook、Uber、Microsoft等）的关键组件。Alluxio的历史与发展Alluxio最初是一个名为Tachyon的
开源模型应用落地-Qwen1.5-MoE-1/3的激活参数量达到7B模型的性能开源技术探险家开源模型-实际应用落地 #深度学习语言模型自然语言处理
一、前言2024.03.28阿里推出Qwen系列的首个MoE模型，Qwen1.5-MoE-A2.7B。它仅拥有27亿个激活参数，但其性能却能与当前最先进的70亿参数模型，如Mistral7B和Qwen1.5-7B相媲美。但是目前只有HFtransformers和vLLM支持该模型。二、术语介绍2.1.混合专家(MoE)架构是一种机器学习模型的结构设计,它将一个复杂的任务分解成多个相对简单的子任务,
选择 websim网站：一个用自然语言快速构建生成功能齐全的网站喜好儿网 AI网站 ai 人工智能 aigc
WebsimAI是一个前沿的网站创建平台，旨在通过人工智能技术彻底改变网页设计流程。用户只需用自然语言描述他们的愿景，即可快速生成功能齐全的网站。该工具非常适合从初学者到经验丰富的开发人员使用，可以快速生成应用程序、网站原型或试验网页设计。喜好儿网WebsimAI的主要优势之一是其用户友好的界面，让用户可以专注于创意和内容，而无需深入了解复杂的编程知识。此外，它还支持多种用途，比如生成游戏（如俄罗
PyTorch实战：手把手教你完成MNIST手写数字识别任务吴师兄大模型 PyTorch pytorch 人工智能 python 手写数字数别 MNIST 深度学习开发语言
系列文章目录Pytorch基础篇01-PyTorch新手必看：张量是什么？5分钟教你快速创建张量！02-张量运算真简单！PyTorch数值计算操作完全指南03-Numpy还是PyTorch？张量与Numpy的神奇转换技巧04-揭秘数据处理神器：PyTorch张量拼接与拆分实用技巧05-深度学习从索引开始：PyTorch张量索引与切片最全解析06-张量形状任意改！PyTorchreshape、tra
时序大模型：技术需求、现有成果及主流模型、模型架构、数据处理方式、优势、缺点及未来展望 xl.liu 架构人工智能
时序大模型：技术需求、现有成果及主流模型、模型架构、数据处理方式、优势、缺点及未来展望时序大模型如何保证数据的完整性和准确性时序大模型的性能高度依赖于数据的质量和完整性。为了确保模型的预测和分析结果准确可靠，需要采取一系列措施来保证数据的完整性和准确性。数据清洗：去除异常值：通过统计方法或机器学习算法检测并去除异常值，确保数据的合理性。填补缺失值：使用插值方法、均值填充、中位数填充或基于模型的预测
机器学习数学基础：36.φ相关系数分析 @心都机器学习人工智能
用φ相关系数分析性别与心理测验态度关系的教程一、学习目标学会使用φ相关系数分析两个二分变量（如性别男/女、对心理测验态度肯定/否定）之间的关系，并通过卡方检验判断结果是否具有统计学意义。二、数据准备假设我们想研究青年大学生的性别和对心理测验的态度之间的关系，收集到如下2×22×22×2列联表数据（调查了170170170人）：肯定否定合计男生222222888888110110110女生18181
机器学习数学基础：37.偏相关分析 @心都机器学习人工智能
偏相关分析教程一、偏相关分析是什么在很多复杂的系统中，比如地理系统，会有多个要素相互影响。偏相关分析就是在这样多要素构成的系统里，不考虑其他要素的干扰，专门去研究两个要素之间关系紧密程度的一种方法。用来衡量这种紧密程度的数值，叫做偏相关系数。举个简单例子，在研究一个地区的房价时，房价会受到很多因素影响，像地段、房屋面积、周边配套设施等。如果我们想知道单纯的房屋面积和房价之间的关系，就可以用偏相关分
机器学习数学基础：22.对称矩阵的对角化 @心都机器学习矩阵概率论
一、核心概念详解（一）内积定义与公式：在nnn维向量空间中，对于向量x⃗=(x1,x2,⋯ ,xn)\vec{x}\=(x_1,x_2,\cdots,x_n)x=(x1,x2,⋯,xn)和y⃗=(y1,y2,⋯ ,yn)\vec{y}\=(y_1,y_2,\cdots,y_n)y=(y1,y2,⋯,yn)，内积记作(x⃗,y⃗)(\vec{x},\vec{y})(x,y)，其计算公式为(x⃗,y⃗
为什么DeepSeek必须开源（以及它为何不会打败OpenAI）新加坡内哥谈技术人工智能深度学习机器人科技语言模型
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/如今，DeepSeek的名字已经传遍整个科技圈。这家中国AI实验室训练出了R1——一款开
机器学习数学基础：34.点二列 @心都机器学习概率论人工智能
点二列相关教程一、点二列相关的定义点二列相关是一种统计方法，用于衡量两个变量之间的相关程度。在这种相关分析中，一个变量是正态连续性变量，取值可以是连续的数值，比如身高、体重、考试分数等；另一个是真正的二分名义变量，其两个类别是天然存在、相互独立的，不能再细分，像性别（男/女）、是否吸烟（是/否）、抛硬币的结果（正面/反面）等。二、适用场景点二列相关常用于研究天然二分变量与连续变量之间的关系。例如在
免费 MLOps 课程：学习机器学习运维的完整流程真智AI 学习机器学习运维免费教程
掌握MLOps：训练和跟踪实验、构建ML流水线、模型部署、生产环境监控，并从DevOps采用最佳实践。免费MLOps课程概览（DataTalks.Club提供）课程平台：DataTalks.Club适合人群：有一定Python和ML经验的开发者重点内容：模型训练、实验跟踪、流水线构建、模型部署、监控和DevOps最佳实践目录什么是MLOps？为什么需要MLOps？MLOpsZoomcamp课程介绍
机器学习的三个步骤-ChatGPT4o作答部分分式机器学习人工智能
机器学习的三个步骤分别是：设置范围、设置标准、达成目标。这三个步骤是任何机器学习项目的基础框架，它们为模型的选择、优化和评估提供了清晰的指导。让我们深入探讨这三个步骤的具体内容。1.设置范围(DefiningtheScope)设置范围是机器学习项目中的第一步，它涉及到明确问题的类型和目标，选择合适的算法和模型结构。这个阶段的目标是确定适合当前任务的机器学习方法。关键内容：问题类型：监督学习(Sup
AGI框架探索另一只又死又活的猫
开发十年，就只剩下这套Java开发体系了>>>随着对机器学习领域的深入探索，我渐渐迷上了AGI通用人工智能。所以，闲暇时就对AGI框架进行了深入的了解，看看哪些AGI框架与个人的理念相符，方便做进一步的研究之用。朋友给我分享了一篇收集和汇总AGI技术的文章，正好，我就以此为索引，对里面的每一个框架进行了考察：50个杀手级人工智能项目：https://mp.weixin.qq.com/s/qafBW
RTX 3090图形处理巅峰性能解析智能计算研究中心其他
内容概要作为NVIDIA面向专业创作者与发烧级玩家的旗舰产品，RTX3090重新定义了图形处理的性能边界。本文将以Ampere架构的技术演进为切入点，系统性解构该显卡在显存配置、运算单元协作及图像处理技术方面的创新设计。通过对比测试数据与工程原理分析，重点探讨24GBGDDR6X显存在8K分辨率场景下的带宽利用率，以及10496个CUDA核心在光线追踪与深度学习超采样（DLSS）任务中的动态负载分
联邦学习优化驱动医疗诊断新突破智能计算研究中心其他
内容概要医疗人工智能的发展长期面临数据孤岛与隐私合规的双重挑战，传统集中式训练模式难以满足多机构协作需求。联邦学习技术通过构建分布式训练框架，使医疗机构在不共享原始数据的前提下，实现跨域模型的协同优化。这一技术突破为医学影像识别、病理特征分析等场景提供了新的技术路径，特别是在肿瘤筛查领域，通过迁移学习实现跨病种知识迁移，配合超参数自动调优机制，可使模型在有限标注数据下达到95%以上的病灶识别准确率
A100核心加速：高效计算方案解析智能计算研究中心其他
内容概要在人工智能与高性能计算领域，A100核心加速技术通过多维度的架构创新，重新定义了算力效率的边界。本文将从硬件设计、资源调度、算法优化及场景适配四个维度展开，系统解析其核心技术原理与落地实践路径。对于企业级计算场景而言，架构设计与资源管理策略的协同优化往往比单一性能指标更具实际价值。建议技术团队在部署前，优先完成工作负载特征分析与集群拓扑规划。第三代TensorCore架构的突破性设计，不仅
人工智能的崛起与未来发展趋势分析智能计算研究中心其他
内容概要人工智能作为一项颠覆性技术，近年来发展迅猛，正逐渐渗透到我们生活的每个角落。它不仅改变了人类的工作方式，还在医疗、金融、教育、交通等多个领域展现了巨大的应用潜力。通过理解人工智能的现状，我们可以更清晰地识别当前技术进展和市场需求，以及面临的挑战。领域应用实例发展现状医疗智能诊断、药物研发提高诊断准确率，缩短研发周期金融风险评估、智能投顾实现个性化服务与高效决策教育自适应学习系统提供个性化学
Ascend Extension for PyTorch是个what？机器学习人工智能深度学习
1AscendExtensionforPyTorchAscendExtensionforPyTorch插件是基于昇腾的深度学习适配框架，使昇腾NPU可以支持PyTorch框架，为PyTorch框架的使用者提供昇腾AI处理器的超强算力。项目源码地址请参见Ascend/Pytorch。昇腾为基于昇腾处理器和软件的行业应用及服务提供全栈AI计算基础设施。您可以通过访问昇腾社区，了解关于昇腾的更多信息。2
Java 并发包之线程池和原子计数 lijingyao8206 Java计数 ThreadPool 并发包 java线程池
对于大数据量关联的业务处理逻辑，比较直接的想法就是用JDK提供的并发包去解决多线程情况下的业务数据处理。线程池可以提供很好的管理线程的方式，并且可以提高线程利用率，并发包中的原子计数在多线程的情况下可以让我们避免去写一些同步代码。这里就先把jdk并发包中的线程池处理器ThreadPoolExecutor 以原子计数类AomicInteger 和倒数计时锁C
java编程思想抽象类和接口百合不是茶 java 抽象类接口
接口c++对接口和内部类只有简介的支持,但在java中有队这些类的直接支持 1 ,抽象类 : 如果一个类包含一个或多个抽象方法,该类必须限定为抽象类(否者编译器报错) 抽象方法 : 在方法中仅有声明而没有方法体 package com.wj.Interface;
[房地产与大数据]房地产数据挖掘系统 comsci 数据挖掘
随着一个关键核心技术的突破,我们已经是独立自主的开发某些先进模块,但是要完全实现,还需要一定的时间... 所以,除了代码工作以外,我们还需要关心一下非技术领域的事件..比如说房地产 &nb
数组队列总结沐刃青蛟数组队列
数组队列是一种大小可以改变，类型没有定死的类似数组的工具。不过与数组相比，它更具有灵活性。因为它不但不用担心越界问题，而且因为泛型（类似c++中模板的东西）的存在而支持各种类型。以下是数组队列的功能实现代码： import List.Student; public class
Oracle存储过程无法编译的解决方法 IT独行者 oracle 存储过程　
今天同事修改Oracle存储过程又导致2个过程无法被编译，流程规范上的东西，Dave 这里不多说，看看怎么解决问题。 1. 查看无效对象 XEZF@xezf(qs-xezf-db1)> select object_name,object_type,status from all_objects where status='IN
重装系统之后oracle恢复文强chu oracle
前几天正在使用电脑，没有暂停oracle的各种服务。突然win8.1系统奔溃，无法修复，开机时系统提示正在搜集错误信息，然后再开机，再提示的无限循环中。无耐我拿出系统u盘准备重装系统，没想到竟然无法从u盘引导成功。晚上到外面早了一家修电脑店，让人家给装了个系统，并且那哥们在我没反应过来的时候，直接把我的c盘给格式化了并且清理了注册表，再装系统。然后的结果就是我的oracl
python学习二（一些基础语法）小桔子 pthon 基础语法
紧接着把！昨天没看继续看django 官方教程，学了下python的基本语法与c类语言还是有些小差别： 1.ptyhon的源文件以UTF-8编码格式 2. / 除结果浮点型 // 除结果整形 % 除取余数 * 乘 ** 乘方 eg 5**2 结果是5的2次方25 _&
svn 常用命令 aichenglong SVN 版本回退
1 svn回退版本 1)在window中选择log,根据想要回退的内容,选择revert this version或revert chanages from this version 两者的区别: revert this version:表示回退到当前版本(该版本后的版本全部作废) revert chanages from this versio
某小公司面试归来 alafqq 面试
先填单子，还要写笔试题，我以时间为急，拒绝了它。。时间宝贵。老拿这些对付毕业生的东东来吓唬我。。面试官很刁难，问了几个问题，记录下； 1，包的范围。。。public,private,protect. --悲剧了 2，hashcode方法和equals方法的区别。谁覆盖谁.结果，他说我说反了。 3，最恶心的一道题，抽象类继承抽象类吗？（察，一般它都是被继承的啊） 4，stru
动态数组的存储速度比较集合框架百合不是茶集合框架
集合框架：自定义数据结构(增删改查等) package 数组; /** * 创建动态数组 * @author 百合 * */ public class ArrayDemo{ //定义一个数组来存放数据 String[] src = new String[0]; /** * 增加元素加入容器 * @param s要加入容器
用JS实现一个JS对象，对象里有两个属性一个方法 bijian1013 js对象
<html> <head> </head> <body> 用js代码实现一个js对象，对象里有两个属性，一个方法 </body> <script> var obj={a:'1234567',b:'bbbbbbbbbb',c:function(x){
探索JUnit4扩展：使用Rule bijian1013 java 单元测试 JUnit Rule
在上一篇文章中，讨论了使用Runner扩展JUnit4的方式，即直接修改Test Runner的实现(BlockJUnit4ClassRunner)。但这种方法显然不便于灵活地添加或删除扩展功能。下面将使用JUnit4.7才开始引入的扩展方式——Rule来实现相同的扩展功能。 1. Rule &n
[Gson一]非泛型POJO对象的反序列化 bit1129 POJO
当要将JSON数据串反序列化自身为非泛型的POJO时，使用Gson.fromJson(String, Class)方法。自身为非泛型的POJO的包括两种： 1. POJO对象不包含任何泛型的字段 2. POJO对象包含泛型字段，例如泛型集合或者泛型类 Data类 a.不是泛型类， b.Data中的集合List和Map都是泛型的 c.Data中不包含其它的POJO
【Kakfa五】Kafka Producer和Consumer基本使用 bit1129 kafka
0.Kafka服务器的配置一个Broker，一个Topic Topic中只有一个Partition（） 1. Producer： package kafka.examples.producers; import kafka.producer.KeyedMessage; import kafka.javaapi.producer.Producer; impor
lsyncd实时同步搭建指南——取代rsync+inotify ronin47
1. 几大实时同步工具比较 1.1 inotify + rsync 最近一直在寻求生产服务服务器上的同步替代方案，原先使用的是 inotify + rsync，但随着文件数量的增大到100W+，目录下的文件列表就达20M，在网络状况不佳或者限速的情况下，变更的文件可能10来个才几M，却因此要发送的文件列表就达20M，严重减低的带宽的使用效率以及同步效率；更为要紧的是，加入inotify
java-9. 判断整数序列是不是二元查找树的后序遍历结果 bylijinnan java
public class IsBinTreePostTraverse{ static boolean isBSTPostOrder(int[] a){ if(a==null){ return false; } /*1.只有一个结点时，肯定是查找树 *2.只有两个结点时，肯定是查找树。例如{5,6}对应的BST是 6 {6,5}对应的BST是
MySQL的sum函数返回的类型 bylijinnan java spring sql mysql jdbc
今天项目切换数据库时，出错访问数据库的代码大概是这样： String sql = "select sum(number) as sumNumberOfOneDay from tableName"; List<Map> rows = getJdbcTemplate().queryForList(sql); for (Map row : rows
java设计模式之单例模式 chicony java设计模式
在阎宏博士的《JAVA与模式》一书中开头是这样描述单例模式的：　　作为对象的创建模式，单例模式确保某一个类只有一个实例，而且自行实例化并向整个系统提供这个实例。这个类称为单例类。单例模式的结构　　单例模式的特点：单例类只能有一个实例。单例类必须自己创建自己的唯一实例。单例类必须给所有其他对象提供这一实例。　　饿汉式单例类 publ
javascript取当月最后一天 ctrain JavaScript
 <script language=javascript> var current = new Date(); var year = current.getYear(); var month = current.getMonth(); showMonthLastDay(year, mont
linux tune2fs命令详解 daizj linux tune2fs 查看系统文件块信息
一.简介： tune2fs是调整和查看ext2/ext3文件系统的文件系统参数，Windows下面如果出现意外断电死机情况，下次开机一般都会出现系统自检。Linux系统下面也有文件系统自检，而且是可以通过tune2fs命令，自行定义自检周期及方式。二.用法： Usage: tune2fs [-c max_mounts_count] [-e errors_behavior] [-g grou
做有中国特色的程序员 dcj3sjt126com 程序员
从出版业说起网络作品排到靠前的，都不会太难看，一般人不爱看某部作品也是因为不喜欢这个类型，而此人也不会全不喜欢这些网络作品。究其原因，是因为网络作品都是让人先白看的，看的好了才出了头。而纸质作品就不一定了，排行榜靠前的，有好作品，也有垃圾。许多大牛都是写了博客，后来出了书。这些书也都不次，可能有人让为不好，是因为技术书不像小说，小说在读故事，技术书是在学知识或温习知识，有
Android：TextView属性大全 dcj3sjt126com textview
android:autoLink 设置是否当文本为URL链接/email/电话号码/map时，文本显示为可点击的链接。可选值(none/web/email/phone/map/all) android:autoText 如果设置，将自动执行输入值的拼写纠正。此处无效果，在显示输入法并输
tomcat虚拟目录安装及其配置 eksliang tomcat配置说明 tomca部署web应用 tomcat虚拟目录安装
转载请出自出处：http://eksliang.iteye.com/blog/2097184 1.-------------------------------------------tomcat 目录结构 config：存放tomcat的配置文件 temp ：存放tomcat跑起来后存放临时文件用的 work ：当第一次访问应用中的jsp
浅谈：APP有哪些常被黑客利用的安全漏洞 gg163 APP
首先，说到APP的安全漏洞，身为程序猿的大家应该不陌生；如果抛开安卓自身开源的问题的话，其主要产生的原因就是开发过程中疏忽或者代码不严谨引起的。但这些责任也不能怪在程序猿头上，有时会因为BOSS时间催得紧等很多可观原因。由国内移动应用安全检测团队爱内测（ineice.com）的CTO给我们浅谈关于Android 系统的开源设计以及生态环境。 1. 应用反编译漏洞：APK 包非常容易被反编译成可读
C#根据网址生成静态页面 hvt Web .net C#asp.net hovertree
HoverTree开源项目中HoverTreeWeb.HVTPanel的Index.aspx文件是后台管理的首页。包含生成留言板首页，以及显示用户名，退出等功能。根据网址生成页面的方法： bool CreateHtmlFile(string url, string path) { //http://keleyi.com/a/bjae/3d10wfax.htm stri
SVG 教程（一）天梯梦 svg
SVG 简介 SVG 是使用 XML 来描述二维图形和绘图程序的语言。学习之前应具备的基础知识：继续学习之前，你应该对以下内容有基本的了解： HTML XML 基础如果希望首先学习这些内容，请在本站的首页选择相应的教程。什么是SVG？ SVG 指可伸缩矢量图形 (Scalable Vector Graphics) SVG 用来定义用于网络的基于矢量
一个简单的java栈 luyulong java 数据结构栈
public class MyStack { private long[] arr; private int top; public MyStack() { arr = new long[10]; top = -1; } public MyStack(int maxsize) { arr = new long[maxsize]; top
基础数据结构和算法八：Binary search sunwinner Algorithm Binary search
Binary search needs an ordered array so that it can use array indexing to dramatically reduce the number of compares required for each search, using the classic and venerable binary search algori
12个C语言面试题，涉及指针、进程、运算、结构体、函数、内存，看看你能做出几个！刘星宇 c 面试
12个C语言面试题，涉及指针、进程、运算、结构体、函数、内存，看看你能做出几个！ 1.gets()函数问：请找出下面代码里的问题： #include<stdio.h> int main(void) { char buff[10]; memset(buff,0,sizeof(buff));
ITeye 7月技术图书有奖试读获奖名单公布 ITeye管理员活动 ITeye 试读
ITeye携手人民邮电出版社图灵教育共同举办的7月技术图书有奖试读活动已圆满结束，非常感谢广大用户对本次活动的关注与参与。 7月试读活动回顾： http://webmaster.iteye.com/blog/2092746 本次技术图书试读活动的优秀奖获奖名单及相应作品如下（优秀文章有很多，但名额有限，没获奖并不代表不优秀）：《Java性能优化权威指南》