ljtyxl

（MLR）Learning Piece-wise Linear Models from Large Scale Data for Ad Click Prediction

Learning Piece-wise Linear Models from Large Scale Data for Ad Click Prediction

Kun Gai1, Xiaoqiang Zhu1, Han Li1, Kai Liu2†, Zhe Wang3†

Alibaba Inc.

[email protected], {xiaoqiang.zxq, lihan.lh}@alibaba-inc.com

Beijing Particle Inc. [email protected]
University of Cambridge. [email protected]

† contribute to this paper while worked at Alibaba

April 19, 2017

Abstract

CTR prediction in real-world business is a difficult machine learning problem with large scale nonlinear sparse data. In this paper, we introduce an industrial strength solution with model named Large Scale Piece-wise Linear Model (LS-PLM). We formulate the learning problem with L1 and L2,1 regularizers, leading to a non-convex and non-smooth optimization problem. Then, we propose a novel algorithm to solve it efficiently, based on directional derivatives and quasi-Newton method. In addition, we design a distributed system which can run on hundreds of machines parallel and provides us with the industrial scalability. LS-PLM model can capture nonlinear patterns from massive sparse data, saving us from heavy feature engineering jobs. Since 2012, LS-PLM has become the main CTR prediction model in Alibaba’s online display advertising system, serving hundreds of millions users every day.

ctr预测是一个具有大规模非线性稀疏数据的机器学习难题。本文介绍了一种工业强度的求解方法，即大型分段线性模型（LS-PLM）。我们用l1和l2，1正则化器构造了学习问题，得到了一个非凸非光滑的优化问题。然后，基于方向导数和拟牛顿法，提出了一种新的求解方法。此外，我们还设计了一个分布式系统，可以在数百台机器上并行运行，并为我们提供了工业可扩展性。LS-PLM模型可以从大量稀疏数据中捕获非线性模式，从而将我们从繁重的功能工程工作中拯救出来。自2012年以来，LS-PLM已经成为阿里巴巴在线展示广告系统中的主要ctr预测模型，每天为数亿用户提供服务。

Introduction

Click-through rate (CTR) prediction is a core problem in the multi-billion dollar online advertising industry. To improve the accuracy of CTR prediction, more and more data are involved, making CTR prediction a large scale learning problem, with massive samples and high dimension features.

Traditional solution is to apply a linear logistic regression (LR) model, trained in a parallel manner (Brendan et al. 2013, Andrew & Gao 2007). LR model with L1 regularization can generate sparse solution, making it fast for online prediction. Unfortunately, CTR prediction problem is a highly nonlinear problem. In particular, user-click generation involves many complex factors, like ad quality, context information, user interests, and complex interactions of these factors. To help LR model catch the nonlinearity, feature engineering technique is explored, which is both time and humanity consuming.

Another direction, is to capture the nonlinearity with well-designed models. Facebook (He et al. 2014) uses a hybrid model which combines decision trees with logistic regression. Decision tree plays a nonlinear feature transformation role, whose output is fed to LR model. However, tree-based method is not suitable for very sparse and high dimensional data (Safavian S. R. & Landgrebe D. 1990). (Rendle S. 2010) introduces Factorization Machines(FM), which involves interactions among features using 2nd order functions (or using other given-number-order functions). However, FM can not fit all general nonlinear patterns in data (like other higher order patterns).

In this paper, we present a piece-wise linear model and its training algorithm for large scale data. We name it Large Scale Piecewise Linear Model (LS-PLM). LS-PLM follows the divide-and-conquer strategy, that is, first divides the feature space into several local regions, then fits a linear model in each region, resulting in the output with combinations of weighted linear predictions. Note that, these two steps are learned simultaneously in a supervised manner, aiming to minimize the prediction loss. LS-PLM is superior for web-scale data mining in three aspects:

Nonlinearity. With enough divided regions, LS-PLM can fit any complex nonlinear function.
Scalability. Similar to LR model, LS-PLM is scalable both to massive samples and high dimensional features. We design a distributed system which can train the model on hundreds of machines parallel. In our online product systems, dozens of LS-PLM models with tens of million parameters are trained and deployed everyday.
Sparsity. As pointed in (Brendan et al. 2013), model sparsity is a practical issue for online serving in industrial setting. We show LS-PLM with L1 and L2,1 regularizer can achieve good sparsity.

The learning of LS-PLM with sparsity regularizer can be transformed to be a non-convex and nondifferential optimization problem, which is difficult to be solved. We propose an efficient optimization method for such problems, based on directional derivatives and quasi-Newton method. Due to the ability of capturing nonlinear patterns and scalability to massive data, LS-PLMs have become main CTR prediction models in the online display advertising system in alibaba, serving hundreds of millions users since 2012 early. It is also applied in recommendation systems, search engines and other product systems.

The paper is structured as follows. In Section 2, we present LS-PLM model in detail, including formulation, regularization and optimization issues. In Section 3 we introduce our parallel implementation structure. in Section 4, we evaluate the model carefully and demonstrate the advantage of LS-PLM compared with LR. Finally in Section 5, we close with conclusions.

Figure 1: A demo illustration of LS-PLM model. Figure A is the demo dataset. It is a binary classification problem, with red points belong to positive class and blue points belong to negative class. Figure B shows the classification result using LR model. Figure C shows the classification result using LS-PLM model. It’s clear that LS-PLM can capture the nonlinear distribution of data.

2. Method

We focus on the large scale CTR prediction application. It is a binary classification problems, with dataset and xt ∈ Rd is usually high dimensional and sparse.

2.1 Formulation

To model the nonlinearity of massive scale data, we employ a divide-and-conquer strategy, similar with

(Jordan & Jacobs 1994). We divide the whole feature space into some local regions. For each region we employ an individual generalized linear-classification model. In this way, we tackle the nonlinearity with a piece-wise linear model. We give our model as follows:

（1）

Here Θ = {u1,··· ,um,w1,··· ,wm} ∈ Rd×2m denote the model parameters. {u1,··· ,um} is the parameters for dividing function σ(·), and {w1,··· ,wm} for fitting function η(·). Given instance x, our predicating model p(y|x) consists of two parts: the first part σ(uTj x) divides feature space into m (hyper-parameter) different regions, the second part η(wjTx) gives prediction in each region. Function g(·) ensures our model to satisfy the definition of probability function.

Special Case. Take softmax ( Kivinen & Warmuth 1998) as dividing function σ(x) and sigmoid (Hilbe 2009) as fitting function η(x) and g(x) = x, we get a specific formulation:

(2)

In this case, our mixture model can be seen as a FOE model (Jordan & Jacobs 1994, Wang & Puterman 1998) as follows:

(3)

Eq.(2) is the most common used formulation in our real applications. In the reminder of the paper, without special declaration, we take Eq.(2) as our prediction model. Figure 1 illustrates the model compared with LR in a demo dataset, which shows clearly LS-PLM can capture the nonlinear pattern of data. The objective function of LS-PLM model is formalized as Eq.(4):

arg minΘf(Θ) = loss(Θ) + λkΘk2,1 + βkΘk1 (4)

loss(Θ)=−Xhyt log(p(yt=1|xt,Θ)) + (1 − yt)log(p(yt=0|xt,Θ))i (5)

t=1

Here loss(Θ) defined in Eq.(5) is the neg-likelihood loss function and kΘ2,1k and kΘ1k are two regulariza-

tion terms providing different properties. First, L2,1 regularization () is employed for feature selection. As in our model, each dimension of feature is associated with 2m parameters. L2,1 regularization are expected to push all the 2m parameters of one dimension of feature to be zero, that is, to suppress those less important features. Second, L1 regularization (kΘk1 = Pij |θij|) is employed for sparsity. Except with the feature selection property, L1 regularization can also force those parameters of left features to be zero as much as possible, which can help improve the interpretation ability as well as generalization performance of the model.

However, both L1 norm and L2,1 norm are non-smooth functions. This causes the objective function of Eq.(4) to be non-convex and non-smooth, making it difficult to employ those traditional gradient-descent optimization methods (Andrew & Gao 2007, Zhang 2004, Bertsekas 2003) or EM method (Wang & Puterman

1998).

Note that, while (Wang & Puterman 1998) gives the same mixture model formulation as Eq.(3), our model is more general for employing different kinds of prediction functions. Besides, we propose a different objective function for large scale industry data, taking the feature sparsity into consideration explicitly. This is crucial for real-world applications, as prediction speed and memory usage are two key indicators for online model serving. Furthermore, we give a more efficient optimization method to solve the large-scale non-convex problem, which is described in the following section.

1. Optimization

Before we present our optimization method, we establish some notations and definitions that will be used in the reminder of the paper. Let ∂ij+f(Θ) denote the right partial derivative of f at Θ with respect to Θij:

(6)

where eij is the ijth standard basis vector. The directional derivative of f as Θ in direction d is denoted as f0(Θ;d), which is defined as:

(7)

A vector d is regarded as a descent direction if f0(Θ;d) < 0. sign(·) is the sign function takes value {−1,0,1}. The projection function

(

Θij , sign(Θij) = sign(Ωij)

πij(Θ;Ω) = (8)

0 , otherwise

means projecting Θ onto the orthant defined by Ω.

1. 1. Choose descent direction

As discussed above, our objective function for large scale CTR prediction problem is both non-convex and non-smooth. Here we propose a general and efficient optimization method to solve such kind of non-convex problems. Since the negative-gradients of our objective function do not exists for all Θ, we take the direction d which minimizes the directional derivative of f with Θ as a replace. The directional derivative f0(Θ;d) exists for any Θ and direction d, whic is declared as Lemma 1.

Lemma 1. When an objective function f(Θ) is composed by a smooth loss function with L1 and L2,1 norm, for example the objective function given in Eq. (4), the directional derivative f0(Θ;d) exists for any Θ and direction d.

We leave the proof in Appendix A. Since the directional derivative f0(Θ;d) always exists, we choose the direction as the descent direction which minimizes the directional derivative f0(Θ;d) when the negative gradient of f(Θ) does not exist. The following proposition 2 explicitly gives the direction.

Proposition 2. Given a smooth loss function loss(Θ) and an objective function f(Θ) = loss(Θ)+λkΘk2,1+ βkΘk1, the bounded direction d which minimizes the directional derivative f0(Θ;d) is denoted as follows:

(9)

where and v = max{| − ∇loss(Θ)ij| − β,0}sign(−∇loss(Θ)ij).

More details about the proof can be found in Appendix B. According to the proof, we can see that the negative pseudo-gradient defined in Gao’s work (Andrew & Gao 2007) is a special case of our descent direction. Our proposed method is more general to find the descent direction for those non-smooth and non-convex objective functions.

Based on the direction dk in Eq.(9), we update the model parameters along a descent direction calculated by limited-memory quasi-newton method (LBFGS) (Wang & Puterman 1998), which approximates the inverse Hessian matrix of Eq.(4) on the given orthant. Motivated by the OWLQN method (Andrew & Algorithm 1 Optimize problem Eq.(4)

Input:Choose initial point Θ(0)

S ← {},Y ← {}

for k = 0 to MaxIters do

Compute d(k) with Eq. (9).
Compute pk with Eq. (11) using S and Y .
Find Θ(k+1) with constrained line search (12).
If termination condition satisfied then stop and return Θ(k+1) End if
Update S with s(k) = Θ(k) − Θ(k−1)
Update Y with y(k) = −d(k) − (−d(k−1)) end for

Gao 2007), we also restrict the signs of model parameters not changing in each iteration. Given the chosen direction dk and the old Θ(k), we constrain the orthant of current iteration as follows:

. (10)

(k)

When Θ = 0, the new Θij would not change sign in current iteration. When Θij = 0, we choose the

(k) orthant decided by the selected directionfor the new Θij .

1. 1. Update direction constraint and line search

Given the descent direction dk, we approximate the inverse-Hessian matrix Hk using LBFGS method with a sequence of y(k),s(k). Then the final update direction is Hkd(k). Here we give two tricks to adjust the update direction. First, we constrain the update direction in the orthant with respect to d(k). Second, as our objective function is non-convex, we cannot guarantee that Hk is positive-definite. We use y(k)Ts(k) > 0 as a condition to ensure Hk is a positive-definite matrix. If y(k)Ts(k) ≤ 0, we switch to d(k) as the update direction. The final update direction pk is defined as follows:

Hkd(k);d(k)), y(k)Ts(k) > 0

(11) d( ), otherwise

Given the update direction, we use backtracking line search to find the proper step size α. Same as OWLQN, we project the new Θ(k+1) onto the given orthant decided by the Eq. (10).

Θ(k+1) = π(Θ(k) + αpk;ξ(k)) (12)

1. Algorithm

A pseudo-code description of optimization is given in Algorithm 1. In fact, only a few steps of the standard LBFGS algorithm need to change. These modifications are:

The direction d(k) which minimizes the direction derivative of the non-convex objective is used in replace of negative gradient.
The update direction is constrained to the given orthant defined by the chosen direction d(k). Switch to d(k) when the Hk is not positive-definite.
During the line search, each search point is projected onto the orthant of the previous point.

Implementation

In this section, we first provide our parallel implementation of LS-PLM model for large-scale data, then introduce an important trick which helps to accelerate the training procedue greatly.

Figure 2: The architecture of parallel implementation. Figure A illustrates the physical distributed topology. It’s a variant of parameter server, where each computation node runs with both a server and a worker, aiming to maximize the utility of computation power as well as memory usuage. Figure B illustrates the parameter server structure in model-parallelism and data-parallelism manner.

1. Parallel implementation

To apply Algorithm 1 in large-scale settings, we implement it with a distributed learning framework, as illustrated in Figure 2. It’s a variant of parameter server. In our implementation, each computation node runs with both a server node and a worker node, aiming to

Maximize the utility of CPU’s computation power. In traditional parameter server setting, server nodes work as a distributed KV storer with interfaces of push and pull operations, which are low computation costly. Running with worker nodes can make full use of the computation power.
Maximize the utility of memory. Machines today usually have big memory, for example 128GB. Running on the same computation node, server node and worker node can share and utilize the big memory better.

In brief, there are two roles in the framework. The first role is the work node. Each node stores a part of training data and a local model, which only saves the model parameters used for the local training data. The second role is the server node. Each node stores a part of global model, which is mutually-exclusive. In each iteration, all of the worker nodes first calculate the loss and the descent direction with local model and local data in parallel(data parallelism). Then server nodes aggregate the loss and the direction d(k) as well as the corresponding entries of Θ needed to calculate the revised gradient (model parallelism). After finishing calculating the steepest descent direction in Step 1, workers synchronize the corresponding entries of Θ, and then, perform Step 2–6 locally.

1. Common Feature Trick

Figure 3: Common feature pattern in display advertising. Usually in each page view, a user will see several different ads at the same time. In this situation, user features can be shared across these samples.

In addition to the general-purpose parallel implementation, we also optimized the implementation in online advertising context. Training samples in CTR prediction tasks usually have similar common feature pattern. Take display advertising as an example, as illustrated in Figure 3, during each page view, a user will see several different ads at the same time. For example, user U1 in Figure 3 sees three ads in one visit session, thus generates three samples. In this situation, features of user U1 can be shared across these three samples. These features include user profiles (sex, age, etc.) and user behavior histories during visits of Alibabas e-commerce websites, for example, his/her shopping item IDs, preferred brands or favorite shop IDs.

Remember the model defined in Eq. 2, most computation cost focus on µTi x and wiTx. By employing the common feature trick, we can split the calculation into common and non-common parts and rewrite them as follows:

µTi x = µTi,cxc + µTi,ncxnc (13)

wiTx = wi,cT xc + wi,ncT xnc

Hence, for the common feature part, we need just calculate once and then index the result, which will be utilized by the following samples.

Specifically, we optimize the parallel implementation with common features trick in the following three aspects:

Group training samples with common features and make sure these samples are stored in the same worker.
Save memory by storing common features shared by multiple samples only once.
Speed up iteration by updating loss and gradient w.r.t. common features only once.

Due to the common feature pattern of our production data, employing the common feature trick improves the performance of training procedure greatly, which will be shown in the following section 4.3.

Experiments

In this session, we evaluate the performance of LS-PLM. Our datasets are generated from Alibaba’s mobile display advertising product system. As shown in Table 1, we collect seven datasets in continuous sequential Table 1: Alibaba’s mobile display advertising CTR prediction datasets

Dataset	#features	#samples (training/validation/testing)
1	3.04 × 106	1.34/0.25/0.26 × 109
2	3.27 × 106	1.44/0.26/0.26 × 109
3	3.49 × 106	1.56/0.26/0.25 × 109
4	3.67 × 106	1.62/0.25/0.26 × 109
5	3.82 × 106	1.69/0.26/0.26 × 109
6	3.95 × 106	1.74/0.26/0.26 × 109
7	4.07 × 106	1.78/0.26/0.26 × 109

periods, aiming to evaluate the consist performance of the proposed model, which is important for online product serving. In each dataset, training/validation/testing samples are disjointly collected from different days, with proportion about 7:1:1. AUC (Fawcett 2006) metric is used to evaluate the model performance.

1. Effectiveness of division number

LS-PLM is a piece-wise linear model, with division number m controlling the model capacity. We evaluate the division effectiveness on model’s performance. It is executed on dataset 1 and results are shown in Figure

Generally speaking, larger m means more parameters and thus leads to larger model capacity. But the training cost will also increase, both time and memory. Hence, in real applications we have to balance the model performance with the training cost.

Figure 4: Model performance with different divisions.

Figure 4 shows the training and testing AUC with different division number m. We try m = 6,12,24,36, the testing AUC for m = 12 is markedly better than m = 6, and improvement for m = 24,36 is relatively gentle. Thus in all the following experiments , the parameter m is set as 12 for LS-PLM model.

1. Effectiveness of regularization

As stated in Session 2, in order to keep our model simpler and more generalized, we prefer to constrain the model parameters sparse by both L1 and L2,1 norms. Here we evaluate the strength of both the regularization terms.

Table 2 gives the results. As expected, both L1 and L2,1 norms can push our model to be sparse. Model trained with L2,1 norm has only 9.4% non-zero parameters left and 18.7% features are kept back. While in L1 norm case, there are only 1.9% non-zero parameters left. Combining them together, we get the sparsest Table 2: Regularization effects on model sparsity and performance β/λ(L1/L2,1) #features #non-zero parameters testing auc

Table 3: Training cost comparision with/without common feature trick

Cost	Without CF.	With CF.	Cost Saving
Memory cost/node	89.2 GB	31 GB	65.2%
Time cost/iteration	121s	10s	91.7%

result. Meanwhile, models trained with different norm get different AUC performance. Again adding the two norms together the model reaches the best AUC performance.

In this experiment, the hyper-parameter m is set to be 12. Parameters β and λ are selected by grid search. {0.01,0.1,1,10} are tried for both norms in the all cases. The model with β = 1 and λ = 1 preforms best.

1. Effectiveness of common feature trick

We prove the effectiveness of common features trick. Specifically, we set up the experiments with 100 workers, each of which uses 12 CPU cores, with up to 110 GB memory totally. As shown in Table 3, compressing instances with common feature trick does not affect the actual dimensions of feature space. However, in practice we can significantly reduce memory usage (reduce to about 1/3) and speed up the calculation (around 12 times faster) compared to the training without common feature trick.

Figure 5: Model performance comparison on 7 different test datasets. LS-PLM owns consistent and markable promotion compared with LR.

1. Comparison with LR

We now compare LS-PLM with LR, the widely used CTR prediction model in product setting. The two models are trained using our distributed implementation architecture, running with hundreds of machines for speed-up. The choice of the L1 and L2,1 parameters for LS-PLM and the L1 parameter for LR are based on grid search. β = 0.01,0.1,1,10 and λ = 0.01,0.1,1,10 are tried. The best parameters are β = 1 and λ = 1 for LS-PLM, and β = 1 for LR.

As shown in Figure 5, LS-PLM outperforms LR clearly. The average improvement of AUC for LR is 1.44%, which has significant impact to the overall online ad system performance. Besides, the improvement is stable. This ensures LS-PLM can be safely deployed for daily online production system.

Conclusions

In this paper, a piece-wise linear model, LS-PLM for CTR prediction problem is proposed. It can capture the nonlinear pattern from sparse data and save us from heavy feature engineering jobs, which is crucial for real industry applications. Additionally, powered by our distributed and optimized implementation, our algorithm can handle problems of billion samples with ten million parameters, which is the typical industrial data volume. Regularization terms of L1 and L2,1 are utilized to keep the model sparse. Since 2012, LS-PLM has become the main CTR prediction model in alibaba’s online display advertising system, serving hundreds of millions users every day.

Acknowledgments

We would like to thank Xingya Dai and Yanghui Yan for their help for this work.

Appendix

A Proof of Lemma 2.1

Proof. the definition of f0(Θ;d) is given as follows:

(14)

As the gradient of loss function exists for any Θ, the directional derivative for the first part is

loss(Θ + αd) − loss(Θ) T

lim = ∇loss(Θ) d

(15)

loss(Θ + αd) − loss(Θ) =lim

α↓0 α

For the second part, we know if kΘi·k2,1 6= 0, the L2,1 norm’s partial derivative exists. So the directional derivative is

. (16)

However, when kΘi·k2,1 = 0, it means Θij = 0,1 ≤ j ≤ 2m. Then its directional derivative can be denoted as follows:

(17)

So combine the above cases in Eq. (16) and Eq. (17), we get the directional derivative for the second part:

(18)

Same as the second part, the direction derivative for the third part is:

(19)

Based on Eq. (15), Eq. (18) and Eq. (19), we get that for any Θ and direction d, f0(Θ;d) exists.

B Proof of Proposition 2.2

Proof. Finding the expected direction turns to an optimization problem, which is formulated as follows:

minf0(Θ;d) s.t. kdk2 ≤ C. (20) d

Here the direction d is bounded by a constant scalar C. To solve this problem, we employ lagrange function to combine the objective function and inequality function together:

L(d,µ) = f0(Θ;d) + µ(kdk2 − C). (21)

Here µ ≥ 0 is the lagrange multiplier. Setting the partial derivative of d with respect to L(d,µ) to zero has three cases.

Define

when Θij 6= 0, it implies that

2µdij = s − βsign(Θij)

when Θij = 0 and kΘi·k2,1 > 0, it is easy to have

2µdij = max{|s| − β,0}sign(s)

We give more details when Θij = 0 and kΘi·k2,1 = 0. For di· we have

loss(Θ).

Here we use sign(di·) = [sign(di1),...,sign(di2m)]T for simplicity. Then we get

loss(Θ)i· − βsign(di·)

which implies that sign(di·) = sign(−∇loss(Θ)i· − βsign(di·)). When dij ≥ 0, it implies −∇loss(Θ)ij −

. βsign(dij) ≥ 0. Inversely, we have −∇loss(Θ)ij−βsign(dij) ≤ 0 when dij ≤ 0. So we define v = −∇loss(Θ)i·− βsign(di·) and vj = max{| − ∇loss(Θ)ij| − β,0}sign(−∇loss(Θ)ij). So

As kdi·k ≥ 0, we have 2µkdi·k = max(kvk − λ,0). Thus 2

The lagrange multiplier u is a scalar, and it has equivalent influence for all dij. We can see that the optimal direction which is bounded by C is the same direction as we defined in Eq. (9) without considering the constant scalar µ. Here we finish our proof.

References

Andrew G. and Gao J. (2007) Scalable Training of L1-Regularized Log-Linear Models. Proceedings of the 24-th International Conference on Machine Learning.
Bertsekas, D. (2003) Nonlinear Programming. Springer US, 51–88.
Brendan H., Holt G., Sculley D., Young M., Ebner D., Grady J., Nie L., Phillips. T, Davydov E., Golovin D., Chikkerur S., Liu D., Wattenberg M., Hrafnkelsson A., Boulos T., Kubica J. (2013) Ad Click Prediction: a View from the Trenches. Proceedings of the 19-th KDD.
Fawcett T. (2006) An introduction to ROC analysis. Pattern Recognition Letters, 27, 861–874.
Friedman J. (1999) Greedy Function Approximation: A Gradient Boosting Machine. Technical Report, Dept. of Statistics, Stanford University.
Hilbe M. (2009) Logistic regression models. CRC Press.
He X., Pan J., Jin O., Xu T., Liu B., Xu T, Shi Y., Atallah A, Herbrich R., Bowers S., Candela J. (2014) Practical Lessons from Predicting Clicks on Ads at Facebook. Proceedings of the 20-th KDD.
Jordan I., Jacobs A (1994) Hierarchical mixtures of experts and the EM algorithm. Neural computation, 6(2): 181-214.
Kivinen J., Warmuth M K. (1998) Relative Loss Bounds for Multidimensional Regression Problems. Machine Learning, 45(3):301-329.
Rendle S. (2010) Factorization Machines. Proceedings of the 10th IEEE International Conference on Data Mining.
Roth S, Black M J. (2009) Fields of experts. International Journal of Computer Vision, 82(2): 205–229.
Safavian S. R., Landgrebe D. (1990) A survey of decision tree classifier methodology[J].
Wang P.-M and Puterman M. (1998) Mixed Logistic Regression Models. Journal of Agricultural, Biological, and Environmental Statistics, 3(2), 175–200.
Zhang T. (2004) Solving large scale linear prediction problems using stochastic gradient descent algorithms. Proceedings of the twenty-first international conference on Machine learning. ACM, 116.
Gai K. http://club.alibabatech.org/resource_detail.htm?topicId=106

你可能感兴趣的:(算法实现)

C# 代码（`Hashtable` 和 `SortedList`）张謹礧 c#哈希算法开发语言
一、Hashtable（哈希表）1.基本概念非泛型集合：存储键值对（object类型），通过哈希算法实现快速查找。线程安全：默认非线程安全，可通过Hashtable.Synchronized创建线程安全版本。键的唯一性：键必须唯一，且不可为null（值可为null）。2.创建与初始化//创建空的HashtableHashtablehashtable=newHashtable();//创建并初始化
Java 递归方法详解：从基础语法到实战应用，彻底掌握递归编程思想大葱白菜 java合集 java 开发语言个人开发后端学习
作为一名Java开发工程师，你一定在开发中遇到过需要重复调用自身逻辑的问题，比如：树形结构处理、文件夹遍历、斐波那契数列、算法实现（如DFS、回溯、分治）等。这时候，递归方法（RecursiveMethod）就成为你不可或缺的工具。本文将带你全面掌握：什么是递归方法？递归的三要素（边界条件、递归公式、递归方向）递归与循环的对比常见递归问题与实现（阶乘、斐波那契、汉诺塔、树遍历等）递归在真实项目中的
【图像分割】基于模糊聚类FCM和改进的模糊聚类算法实现CT图像分割matlab代码天天Matlab科研工作室图像处理 Matlab各类代码算法聚类 matlab
1简介医学影像分割的基本目标是将图像分割成不同的解剖组织，从而可以从背景中提取出感兴趣区域。因为图像的低分辨率和弱对比度，实现医学影像分割是一件具有挑战的任务。而且，这个任务由于噪声和伪阴影变得更加困难，这些干扰项可能是因器材限制、重建算法和患者移动等原因造成的。目前还没有通用的医学图像分割算法，算法的优点和缺点经常根据所研究的问题而变化。将分割概念具体到颅内出血CT图像上，就是将颅腔中的出血病灶
手绘电路图的节点和端点检测一个简化版的算法实现框架 zhangfeng1133 算法
于论文描述，我将提供一个简化版的算法实现框架，用于手绘电路图的节点和端点检测，并整合生成电路原理图。以下代码结合了YOLOv5目标检测和传统图像处理技术，符合论文中提到的98.2%mAP和92%节点识别准确率的关键指标。核心算法实现（Python+OpenCV+YOLOv5）importcv2importnumpyasnpimporttorchfromyolov5importYOLOv5#需要安装
大规模图计算引擎的分区与通信优化：负载均衡与网络延迟的解决方案 LCG元系统服务架构负载均衡网络运维
目录一、系统架构设计与核心流程1.1原创架构图解析1.2双流程对比分析二、分区策略优化实践2.1动态权重分区算法实现（Python）三、通信优化机制实现3.1基于RDMA的通信层实现（TypeScript）四、性能对比与调优4.1分区策略基准测试五、生产级部署方案5.1Kubernetes部署配置（YAML）5.2安全审计配置六、技术前瞻与演进附录：完整技术图谱一、系统架构设计与核心流程1.1原创
探索OpenCV 3.2源码：计算机视觉的架构与实现轩辕姐姐
本文还有配套的精品资源，点击获取简介：OpenCV是一个全面的计算机视觉库，提供广泛的功能如图像处理、对象检测和深度学习支持。OpenCV3.2版本包含了改进的深度学习和GPU加速特性，以及丰富的示例程序。本压缩包文件提供了完整的OpenCV3.2源代码，对于深入学习计算机视觉算法和库实现机制十分宝贵。源码的模块化设计、C++接口、算法实现、多平台支持和性能优化等方面的深入理解，都将有助于开发者的
面试高频题力扣 130. 被围绕的区域洪水灌溉(FloodFill) 深度优先遍历(dfs) 暴力搜索 C++解题思路每日一题 Q741_147 C/C++每日一题：从语法到算法面试 leetcode 深度优先 c++洪水灌溉
目录零、题目描述一、为什么这道题值得你花时间掌握？二、题目拆解：提取核心关键点三、解题思路：从边界入手，反向标记四、算法实现：深度优先遍历（DFS）+两次遍历五、C++代码实现：一步步拆解代码拆解时间复杂度空间复杂度七、坑点总结八、举一反三九、总结零、题目描述题目链接：被围绕的区域题目描述：示例1：输入：board=[[“X”,“X”,“X”,“X”],[“X”,“O”,“O”,“X”],[“X”
Java设计模式之行为型模式（策略模式）介绍与说明爪哇手记 #Java知识点 java 设计模式策略模式
一、策略模式简介策略模式（StrategyPattern）是一种行为型设计模式，它定义了一系列算法，并将每个算法封装起来，使它们可以相互替换，且算法的变化不会影响使用算法的客户。策略模式让算法独立于使用它的客户而变化，属于对象行为型模式。其核心思想是将算法的定义与使用分离，通过接口或抽象类来定义算法族，具体算法实现由具体策略类完成，客户端可以根据需要选择合适的策略。二、策略模式的结构抽象策略（St
在IDEA中无缝接入DeepSeek：智能编程助手指南摆烂大大王 deepseek intellij-idea java ide deepseek AIGC
一、为什么要在IDEA中使用DeepSeek？DeepSeek作为先进的AI编程助手，能提供：智能代码补全与建议实时错误检测与修复方案代码解释与文档生成复杂算法实现建议多语言支持（Python/Java/JS等）二、接入前准备获取API密钥访问DeepSeek官网注册账号在控制台创建APIKey并保存IDEA环境要求IntelliJIDEA2020.3+安装HTTPClient插件（已内置）三、两
LeetCode题目（Python实现）：课程表 II RexT1 LeetCode题目列表队列数据结构 leetcode python
文章目录题目拓扑序列：入度表（广度优先遍历）算法实现执行结果复杂度分析拓扑序列：深度优先搜索算法实现执行结果复杂度分析题目现在你总共有n门课需要选，记为0到n-1。在选修某些课程之前需要一些先修课程。例如，想要学习课程0，你需要先完成课程1，我们用一个匹配来表示他们:[0,1]给定课程总量以及它们的先决条件，返回你为了学完所有课程所安排的学习顺序。可能会有多个正确的顺序，你只要返回一种就可以了。如
AI人工智能遇上TensorFlow：技术融合新趋势 AI大模型应用之禅人工智能 tensorflow python ai
AI人工智能遇上TensorFlow：技术融合新趋势关键词：人工智能、TensorFlow、深度学习、神经网络、机器学习、技术融合、AI开发摘要：本文深入探讨了人工智能技术与TensorFlow框架的融合发展趋势。我们将从基础概念出发，详细分析TensorFlow在AI领域的核心优势，包括其架构设计、算法实现和实际应用。文章包含丰富的技术细节，如神经网络原理、TensorFlow核心算法实现、数学
基于大模型的急性出血坏死性胰腺炎预测技术方案 LCG元人工智能 python
目录一、算法实现伪代码1.数据预处理与特征工程2.大模型训练（以Transformer为例）3.实时预测与动态调整二、模块流程图1.术前预测流程2.术中动态决策流程3.术后护理流程三、系统集成方案1.系统架构图2.核心模块交互流程四、系统部署拓扑图1.物理部署拓扑2.部署说明五、技术验证方案1.交叉验证流程2.实验验证设计六、健康教育模块示例一、算法实现伪代码1.数据预处理与特征工程#数据清洗与归
大数据领域数据产品的零售行业应用创新模式大数据洞察大数据与AI人工智能大数据零售单例模式 ai
大数据领域数据产品的零售行业应用创新模式关键词：大数据、零售行业、数据产品、应用创新、客户洞察、智能决策、数字化转型摘要：本文深入探讨了大数据技术在零售行业中的应用创新模式。我们将从零售行业数字化转型的背景出发，分析大数据产品如何重塑零售价值链，包括客户洞察、供应链优化、精准营销和智能决策等方面。文章将详细介绍相关技术原理、算法实现和实际应用案例，为零售企业提供可操作的大数据应用框架和创新思路。1
Python核心基础DAY1--Python的基础变量类型之字符串和数字类型
一、引言Python作为一种功能强大且广泛应用的编程语言，其基础变量类型是构建各种复杂程序的基石。在Python中，字符串和数字类型是最常用的基础变量类型之一。对于初学者来说，深入理解这两种类型是掌握Python编程的关键第一步。无论是数据处理、算法实现还是构建Web应用程序，对字符串和数字类型的熟练运用都至关重要。二、变量变量是代数的思想，是用来引用数据和功能占位的，具备动态性和可变性；使用的变
AIGC领域AI作画：在数字雕塑中的应用实践 AI原生应用开发 AI 原生应用开发 AIGC AI作画 ai
AIGC领域AI作画：在数字雕塑中的应用实践关键词：AIGC、AI作画、数字雕塑、生成对抗网络、3D建模、艺术创作、深度学习摘要：本文深入探讨了AIGC(人工智能生成内容)技术在数字雕塑领域的创新应用。我们将从技术原理、算法实现到实际案例，全面解析AI如何赋能传统数字雕塑创作流程。文章首先介绍AIGC在艺术创作中的背景和发展现状，然后详细讲解核心算法原理和数学模型，接着通过实际项目案例展示AI作画
OpenCV实战之二 | 基于哈希算法比较图像的相似性 w94ghz OpenCV实战笔记 opencv 哈希算法人工智能
前言☘️本章节主要介绍常用的图像相似性评价算法：图像哈希算法。图像哈希算法通过获取图像的哈希值并比较两幅图像的哈希值的汉明距离来衡量两幅图像是否相似。两幅图像越相似，其哈希值的汉明距离越小。图像哈希算法可以用于图片检索，重复图片剔除，以图搜图以及图片相似度比较。目录一、汉明距离二、img_hash模块三、哈希算法哈希算法实现步骤：代码实现一、汉明距离汉明距离（HammingDistance）是用于
鸿蒙安全实战：三步实现AES加密，让你的用户密码坚不可摧！前端世界 harmonyos harmonyos 安全华为
摘要在鸿蒙应用中，数据加密是保护敏感信息（如用户密码）的核心手段。本文通过一个用户登录系统的实际场景，详细解析如何使用AES对称加密算法实现密码的安全存储与验证。我们将从密钥生成、加密存储到解密验证逐步展开，并提供完整代码实现和性能分析。描述当用户注册时，系统需将密码加密后存储；登录时需解密验证。直接存储明文密码存在严重安全隐患，而AES-256作为行业标准对称加密算法，能有效解决这一问题。鸿蒙通
直线插补动画引擎：从数学原理到C#实现——用代码绘制动态几何艺术墨夶 C#学习资料 c#算法开发语言
一、直线插补核心算法解析1.1DDA算法数学原理//////DDA算法实现直线插补///publicclassLineInterpolator{privatePointF_currentPoint;privatePointF_endPoint;privatefloat_stepSize;privatefloat_dx,_dy;privatefloat_xIncrement,_yIncrement;
基于 STM32+FPGA 的快速傅里叶频域图像在 TFT 中显示的设计与实现(项目资料)（ID:8）嵌入式资料库嵌入式项目合集 fpga开发 stm32 嵌入式硬件单片机
目录摘要1绪论1.1研究背景与意义1.2国内外研究现状1.3研究内容与目标2系统方案设计2.1总体架构设计2.2硬件方案设计2.2.1主控模块选型2.2.2FPGA模块选型2.2.3TFT显示模块选型2.2.4通信方案设计2.3软件方案设计2.3.1FFT算法实现方案2.3.2频域图像渲染方案3硬件电路设计3.1STM32最小系统电路3.2FPGA模块电路3.3TFT显示模块电路3.4软件IIC通
串---暴力字符串匹配算法实现 KYGALYX 数据结构算法数据结构
暴力字符串匹配算法详解暴力字符串匹配算法（BruteForceStringMatchingAlgorithm）是一种简单的字符串匹配算法，它通过逐个比较主串中的字符与模式串中的字符来进行匹配。虽然这种方法简单直观，但在最坏情况下可能需要多次比较，导致效率较低。本文档将详细介绍暴力字符串匹配算法的原理、步骤以及如何在C语言中实现。1.暴力字符串匹配算法原理1.1主串与模式串主串：待搜索的字符串。模式
力扣网编程121题：买卖股票的最佳时机之动态规划（简单）魏劭逻辑编程题 C语言 leetcode 动态规划算法
一.简介前一篇文章使用贪心算法实现了力扣网上121题：买卖股票的最佳时机，文章如下：力扣网编程189题：买卖股票的最佳时机之贪心算法（简单）-CSDN博客本文使用动态规划实现该题目。二.力扣网编程189题：买卖股票的最佳时机之动态规划（简单）给定一个数组prices，它的第i个元素prices[i]表示一支给定股票第i天的价格。你只能选择某一天买入这只股票，并选择在未来的某一个不同的日子卖出该股票
水下目标检测：突破与创新加油吧zkf 目标跟踪人工智能计算机视觉
水下目标检测技术背景水下环境带来独特挑战：光线衰减导致对比度降低，散射引发图像模糊，色偏使颜色失真。动态水流造成目标形变，小目标（如10×10像素海胆）检测困难。声呐与光学数据融合可提升精度，但多模态对齐仍是技术难点。核心算法实现要点图像预处理直方图均衡化与Retinex算法结合改善对比度和色偏：defsingle_scale_retinex(img,sigma):retinex=np.log10
用Python解锁图像处理之力：从基础到智能应用的深度探索熊猫钓鱼>_> python 图像处理开发语言
在像素构成的数字世界里，Python已成为解码图像奥秘的核心引擎。一、为何选择Python处理图像？超越工具的本质思考当人们谈论图像处理时，往往会陷入工具对比的漩涡（PythonvsMATLABvsC++）。但Python的真正价值在于其构建的完整生态闭环：科学计算基石：NumPy的ndarray结构完美对应图像的多维矩阵本质算法实现自由：从传统算子到深度学习模型的无缝衔接可视化即战力：Matpl
分布式领域后端服务的限流算法实现大厂资深架构师 Spring Boot 开发实战分布式算法 wpf ai
分布式领域后端服务的限流算法实现关键词：分布式系统、限流算法、令牌桶、漏桶、滑动窗口、Redis、高并发摘要：本文深入探讨分布式系统中后端服务的限流算法实现。我们将从基础概念出发，详细分析各种限流算法的原理和适用场景，包括计数器算法、滑动窗口算法、令牌桶算法和漏桶算法。文章将提供Python实现代码和数学建模，并通过实际案例展示如何在分布式环境中使用Redis实现高效的限流机制。最后，我们将讨论限
【数据挖掘】支持向量机（SVM）大雨淅淅大数据数据挖掘支持向量机算法大数据回归
目录一、支持向量机（SVM）算法概述二、支持向量机（SVM）算法优缺点和改进2.1支持向量机（SVM）算法优点2.2支持向量机（SVM）算法缺点2.3支持向量机（SVM）算法改进三、支持向量机（SVM）算法实现3.1支持向量机（SVM）算法C语言实现3.2支持向量机（SVM）算法JAVA实现3.3支持向量机（SVM）算法python实现四、支持向量机（SVM）算法应用五、支持向量机（SVM）算法发
RabbitMQ消息队列在大数据系统中的实战应用案例 AI天才研究院 AI大模型企业级应用开发实战 Agentic AI 实战 AI人工智能与大数据 rabbitmq 分布式 ai
RabbitMQ消息队列在大数据系统中的实战应用案例关键词：RabbitMQ、消息队列、大数据系统、实战案例、高并发处理、分布式架构、数据管道摘要：本文深入探讨RabbitMQ消息队列在大数据系统中的核心应用场景，结合具体技术实现和实战案例，详细解析其在数据采集、实时处理、异步解耦等关键环节的技术优势。通过架构设计原理、核心算法实现、数学模型分析和项目实战，展示如何利用RabbitMQ构建高可靠、
深度探索：机器学习中的条件生成对抗网络（Conditional GAN, CGAN）算法原理及其应用
目录1.引言与背景2.CGAN定理3.算法原理4.算法实现5.优缺点分析优点：缺点：6.案例应用7.对比与其他算法8.结论与展望1.引言与背景生成对抗网络（GenerativeAdversarialNetworks,GANs）作为一种深度学习框架，在无监督学习领域展现出强大的能力，特别在图像、音频、文本等复杂数据的生成任务中取得了显著成果。然而，原始GAN模型在生成过程中缺乏对生成样本特定属性的直
【基于C# + HALCON的工业视系统开发实战】十七、航空级精度！涡轮叶片三维型面检测：激光扫描与CAD模型比对技术 AI_DL_CODE c#halcon 三维检测涡轮叶片点云配准型面偏差激光扫描
摘要：涡轮叶片是航空发动机的核心部件，其型面精度直接影响发动机效率与安全性。传统三坐标测量存在效率低（单叶片需40分钟）、覆盖率不足（仅检测关键截面）等问题。本文基于C#.NETCore6与HALCON24.11，构建三维型面检测系统：通过激光线扫描（每秒2000线）获取百万级点云，经MLS滤波降噪（保留0.03mm细节）与快速采样（0.1mm间隔）优化数据；采用ICP算法实现点云与CAD模型配准
Python开发从新手到专家：第三章列表、元组和集合 caifox菜狐狸 Python开发从新手到专家 python 元素集合列表元组数据结构字典
在Python开发的旅程中，数据结构是每一位开发者必须掌握的核心知识。它们是构建程序的基石，决定了代码的效率、可读性和可维护性。本章将深入探讨Python中的三种基本数据结构：列表、元组和集合。这三种数据结构在实际开发中有着广泛的应用，从简单的数据存储到复杂的算法实现，它们都扮演着不可或缺的角色。无论你是刚刚接触Python的新手，还是希望进一步提升编程技能的开发者，本章都将是你的宝贵指南。我们将
C语言教学大变革！DeepSeek如何改变高职院校编程课堂？武汉唯众智创 c语言开发语言程序设计 Deepseek
一、引言在当今数字化转型的浪潮中，程序设计与分析能力已成为高职教育中不可或缺的核心竞争力。作为编程语言的基础，C语言不仅训练学生的计算思维，还培养其算法实现能力。然而，当前高职院校的C语言教学面临诸多挑战，如实践环节薄弱、学生创新能力不足等。DeepSeek等新一代智能编码支持系统的出现，为这一现状带来了转机。该系统融合了深度神经网络与语义解析技术，能够智能生成代码、优化缺陷检测、解构程序逻辑，并
java数字签名三种方式知了ing java jdk
以下3钟数字签名都是基于jdk7的 1，RSA String password="test"; // 1.初始化密钥 KeyPairGenerator keyPairGenerator = KeyPairGenerator.getInstance("RSA"); keyPairGenerator.initialize(51
Hibernate学习笔记 caoyong Hibernate
1>、Hibernate是数据访问层框架，是一个ORM(Object Relation Mapping)框架，作者为:Gavin King 2>、搭建Hibernate的开发环境 a>、添加jar包: aa>、hibernatte开发包中/lib/required/所
设计模式之装饰器模式Decorator（结构型）漂泊一剑客 Decorator
1. 概述若你从事过面向对象开发，实现给一个类或对象增加行为，使用继承机制，这是所有面向对象语言的一个基本特性。如果已经存在的一个类缺少某些方法，或者须要给方法添加更多的功能（魅力），你也许会仅仅继承这个类来产生一个新类—这建立在额外的代码上。
读取磁盘文件txt，并输入String 一炮送你回车库 String
public static void main(String[] args) throws IOException { String fileContent = readFileContent("d:/aaa.txt"); System.out.println(fileContent);
js三级联动下拉框 3213213333332132 三级联动
//三级联动省/直辖市<select id="province"></select> 市/省直辖<select id="city"></select> 县/区 <select id="area"></select>
erlang之parse_transform编译选项的应用 616050468 parse_transform 游戏服务器属性同步 abstract_code
最近使用erlang重构了游戏服务器的所有代码，之前看过C++/lua写的服务器引擎代码，引擎实现了玩家属性自动同步给前端和增量更新玩家数据到数据库的功能，这也是现在很多游戏服务器的优化方向，在引擎层面去解决数据同步和数据持久化，数据发生变化了业务层不需要关心怎么去同步给前端。由于游戏过程中玩家每个业务中玩家数据更改的量其实是很少
JAVA JSON的解析 darkranger java
// { // “Total”：“条数”， // Code: 1, // // “PaymentItems”:[ // { // “PaymentItemID”:”支款单ID”, // “PaymentCode”:”支款单编号”, // “PaymentTime”:”支款日期”, // ”ContractNo”:”合同号”， //
POJ-1273-Drainage Ditches aijuans ACM_POJ
POJ-1273-Drainage Ditches http://poj.org/problem?id=1273 基本的最大流，按LRJ的白书写的 #include<iostream> #include<cstring> #include<queue> using namespace std; #define INF 0x7fffffff int ma
工作流Activiti5表的命名及含义 atongyeye 工作流 Activiti
activiti5 - http://activiti.org/designer/update在线插件安装 activiti5一共23张表 Activiti的表都以ACT_开头。第二部分是表示表的用途的两个字母标识。用途也和服务的API对应。 ACT_RE_*: 'RE'表示repository。这个前缀的表包含了流程定义和流程静态资源（图片，规则，等等）。 A
android的广播机制和广播的简单使用百合不是茶 android 广播机制广播的注册
Android广播机制简介在Android中，有一些操作完成以后，会发送广播，比如说发出一条短信，或打出一个电话，如果某个程序接收了这个广播，就会做相应的处理。这个广播跟我们传统意义中的电台广播有些相似之处。之所以叫做广播，就是因为它只负责“说”而不管你“听不听”，也就是不管你接收方如何处理。另外，广播可以被不只一个应用程序所接收，当然也可能不被任何应
Spring事务传播行为详解 bijian1013 java spring 事务传播行为
在service类前加上@Transactional，声明这个service所有方法需要事务管理。每一个业务方法开始时都会打开一个事务。 Spring默认情况下会对运行期例外(RunTimeException)进行事务回滚。这
eidtplus operate 征客丶 eidtplus
开启列模式: Alt+C 鼠标选择 OR Alt+鼠标左键拖动列模式替换或复制内容(多行): 右键-->格式-->填充所选内容-->选择相应操作 OR Ctrl+Shift+V(复制多行数据,必须行数一致) -------------------------------------------------------
【Kafka一】Kafka入门 bit1129 kafka
这篇文章来自Spark集成Kafka(http://bit1129.iteye.com/blog/2174765)，这里把它单独取出来，作为Kafka的入门吧下载Kafka http://mirror.bit.edu.cn/apache/kafka/0.8.1.1/kafka_2.10-0.8.1.1.tgz 2.10表示Scala的版本，而0.8.1.1表示Kafka
Spring 事务实现机制 BlueSkator spring 代理事务
Spring是以代理的方式实现对事务的管理。我们在Action中所使用的Service对象，其实是代理对象的实例，并不是我们所写的Service对象实例。既然是两个不同的对象，那为什么我们在Action中可以象使用Service对象一样的使用代理对象呢？为了说明问题，假设有个Service类叫AService，它的Spring事务代理类为AProxyService，AService实现了一个接口
bootstrap源码学习与示例：bootstrap-dropdown（转帖） BreakingBad bootstrap dropdown
bootstrap-dropdown组件是个烂东西，我读后的整体感觉。一个下拉开菜单的设计： <ul class="nav pull-right"> <li id="fat-menu" class="dropdown">
读《研磨设计模式》-代码笔记-中介者模式-Mediator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ /* * 中介者模式（Mediator）：用一个中介对象来封装一系列的对象交互。 * 中介者使各对象不需要显式地相互引用，从而使其耦合松散，而且可以独立地改变它们之间的交互。 * * 在我看来，Mediator模式是把多个对象（
常用代码记录 chenjunt3 UI Excel J#
1、单据设置某行或某字段不能修改 //i是行号,"cash"是字段名称 getBillCardPanelWrapper().getBillCardPanel().getBillModel().setCellEditable(i, "cash", false); //取得单据表体所有项用以上语句做循环就能设置整行了 getBillC
搜索引擎与工作流引擎 comsci 算法工作搜索引擎网络应用
最近在公司做和搜索有关的工作，(只是简单的应用开源工具集成到自己的产品中)工作流系统的进一步设计暂时放在一边了，偶然看到谷歌的研究员吴军写的数学之美系列中的搜索引擎与图论这篇文章中的介绍，我发现这样一个关系(仅仅是猜想) -----搜索引擎和流程引擎的基础--都是图论，至少像在我在JWFD中引擎算法中用到的是自定义的广度优先
oracle Health Monitor daizj oracle Health Monitor
About Health Monitor Beginning with Release 11g, Oracle Database includes a framework called Health Monitor for running diagnostic checks on the database. About Health Monitor Checks Health M
JSON字符串转换为对象 dieslrae java json
作为前言,首先是要吐槽一下公司的脑残编译部署方式,web和core分开部署本来没什么问题,但是这丫居然不把json的包作为基础包而作为web的包,导致了core端不能使用,而且我们的core是可以当web来用的(不要在意这些细节),所以在core中处理json串就是个问题.没办法,跟编译那帮人也扯不清楚,只有自己写json的解析了.
C语言学习八结构体，综合应用，学生管理系统 dcj3sjt126com C语言
实现功能的代码： # include <stdio.h> # include <malloc.h> struct Student { int age; float score; char name[100]; }; int main(void) { int len; struct Student * pArr; int i,
vagrant学习笔记 dcj3sjt126com vagrant
想了解多主机是如何定义和使用的, 所以又学习了一遍vagrant 1. vagrant virtualbox 下载安装 https://www.vagrantup.com/downloads.html https://www.virtualbox.org/wiki/Downloads 查看安装在命令行输入vagrant 2.
14.性能优化-优化-软件配置优化 frank1234 软件配置性能优化
1.Tomcat线程池修改tomcat的server.xml文件： <Connector port="8080" protocol="HTTP/1.1" connectionTimeout="20000" redirectPort="8443" maxThreads="1200" m
一个不错的shell 脚本教程入门级 HarborChung linux shell
一个不错的shell 脚本教程入门级建立一个脚本　　Linux中有好多中不同的shell，但是通常我们使用bash (bourne again shell) 进行shell编程，因为bash是免费的并且很容易使用。所以在本文中笔者所提供的脚本都是使用bash（但是在大多数情况下，这些脚本同样可以在 bash的大姐，bourne shell中运行）。　　如同其他语言一样
Spring4新特性——核心容器的其他改进 jinnianshilongnian spring 动态代理 spring4 依赖注入
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
Linux设置tomcat开机启动 liuxingguome tomcat linux 开机自启动
执行命令sudo gedit /etc/init.d/tomcat6 然后把以下英文部分复制过去。（注意第一句#!/bin/sh如果不写，就不是一个shell文件。然后将对应的jdk和tomcat换成你自己的目录就行了。 #!/bin/bash # # /etc/rc.d/init.d/tomcat # init script for tomcat precesses
第13章 Ajax进阶（下） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Troubleshooting Crystal Reports off BW blueoxygen BO
http://wiki.sdn.sap.com/wiki/display/BOBJ/Troubleshooting+Crystal+Reports+off+BW#TroubleshootingCrystalReportsoffBW-TracingBOE Quite useful, especially this part: SAP BW connectivity For t
Java开发熟手该当心的11个错误 tomcat_oracle java jvm 多线程单元测试
#1、不在属性文件或XML文件中外化配置属性。比如，没有把批处理使用的线程数设置成可在属性文件中配置。你的批处理程序无论在DEV环境中，还是UAT（用户验收测试）环境中，都可以顺畅无阻地运行，但是一旦部署在PROD 上，把它作为多线程程序处理更大的数据集时，就会抛出IOException，原因可能是JDBC驱动版本不同，也可能是#2中讨论的问题。如果线程数目可以在属性文件中配置，那么使它成为
正则表达式大全 yang852220741 html 编程正则表达式
今天向大家分享正则表达式大全，它可以大提高你的工作效率正则表达式也可以被当作是一门语言，当你学习一门新的编程语言的时候，他们是一个小的子语言。初看时觉得它没有任何的意义，但是很多时候，你不得不阅读一些教程，或文章来理解这些简单的描述模式。一、校验数字的表达式数字：^[0-9]*$ n位的数字：^\d{n}$ 至少n位的数字：^\d{n,}$ m-n位的数字：^\d{m,n}$