dby_freedom

What is LightGBM, How to implement it? How to fine tune the parameters?

Hello,

Data Science is the fastest growing field in the world. Everyday there will be a launch of bunch of new algorithm, some of those fails and some achieve the peak of success. Today, I am touching one of the most successful machine learning algorithm, Light GBM.

What motivated me to write a blog on LightGBM?

While working on kaggle data science competition I came across multiple powerful algorithms. LightGBM is one of those. LightGBM is a relatively new algorithm and it doesn’t have a lot of reading resources on the internet except its documentation. It becomes difficult for a beginner to choose parameters from the long list given in the documentation. Simply to help new geeks, I am coming up with this beautiful blog.

I will try my best to keep this blog small and simple as adding hundreds of pages of irrelevant information will confuse you.

What is Light GBM?

Light GBM is a gradient boosting framework that uses tree based learning algorithm.

How it differs from other tree based algorithm?

Light GBM grows tree vertically while other algorithm grows trees horizontally meaning that Light GBM grows tree leaf-wise while other algorithm grows level-wise. It will choose the leaf with max delta loss to grow. When growing the same leaf, Leaf-wise algorithm can reduce more loss than a level-wise algorithm.

Below diagrams explain the implementation of LightGBM and other boosting algorithms.

Explains how LightGBM works

How other boosting algorithm works

Why Light GBM is gaining extreme popularity?

The size of data is increasing day by day and it is becoming difficult for traditional data science algorithms to give faster results. Light GBM is prefixed as ‘Light’ because of its high speed. Light GBM can handle the large size of data and takes lower memory to run. Another reason of why Light GBM is popular is because it focuses on accuracy of results. LGBM also supports GPU learning and thus data scientists are widely using LGBM for data science application development.

Can we use Light GBM everywhere?

No, it is not advisable to use LGBM on small datasets. Light GBM is sensitive to overfitting and can easily overfit small data. Their is no threshold on the number of rows but my experience suggests me to use it only for data with 10,000+ rows.

We briefly discussed the concept of Light GBM, now what about it’s implementation?

Implementation of Light GBM is easy, the only complicated thing is parameter tuning. Light GBM covers more than 100 parameters but don’t worry, you don’t need to learn all.

It is very important for an implementer to know atleast some basic parameters of Light GBM. If you carefully go through following parameters of LGBM, I bet you will find this powerful algorithm a piece of cake.

Let’s start discussing parameters.

Parameters

Control Parameters

max_depth: It describes the maximum depth of tree. This parameter is used to handle model overfitting. Any time you feel that your model is overfitted, my first advice will be to lower max_depth.

min_data_in_leaf: It is the minimum number of the records a leaf may have. The default value is 20, optimum value. It is also used to deal over fitting

feature_fraction: Used when your boosting(discussed later) is random forest. 0.8 feature fraction means LightGBM will select 80% of parameters randomly in each iteration for building trees.

bagging_fraction: specifies the fraction of data to be used for each iteration and is generally used to speed up the training and avoid overfitting.

early_stopping_round: This parameter can help you speed up your analysis. Model will stop training if one metric of one validation data doesn’t improve in last early_stopping_round rounds. This will reduce excessive iterations.

lambda: lambda specifies regularization. Typical value ranges from 0 to 1.

min_gain_to_split: This parameter will describe the minimum gain to make a split. It can used to control number of useful splits in tree.

max_cat_group: When the number of category is large, finding the split point on it is easily over-fitting. So LightGBM merges them into ‘max_cat_group’ groups, and finds the split points on the group boundaries, default:64

Core Parameters

Task: It specifies the task you want to perform on data. It may be either train or predict.

application: This is the most important parameter and specifies the application of your model, whether it is a regression problem or classification problem. LightGBM will by default consider model as a regression model.

regression: for regression
binary: for binary classification
multiclass: for multiclass classification problem

boosting: defines the type of algorithm you want to run, default=gdbt

gbdt: traditional Gradient Boosting Decision Tree
rf: random forest
dart: Dropouts meet Multiple Additive Regression Trees
goss: Gradient-based One-Side Sampling

num_boost_round: Number of boosting iterations, typically 100+

learning_rate: This determines the impact of each tree on the final outcome. GBM works by starting with an initial estimate which is updated using the output of each tree. The learning parameter controls the magnitude of this change in the estimates. Typical values: 0.1, 0.001, 0.003…

num_leaves: number of leaves in full tree, default: 31

device: default: cpu, can also pass gpu

Metric parameter

metric: again one of the important parameter as it specifies loss for model building. Below are few general losses for regression and classification.

mae: mean absolute error
mse: mean squared error
binary_logloss: loss for binary classification
multi_logloss: loss for multi classification

IO parameter

max_bin: it denotes the maximum number of bin that feature value will bucket in.

categorical_feature: It denotes the index of categorical features. If categorical_features=0,1,2 then column 0, column 1 and column 2 are categorical variables.

ignore_column: same as categorical_features just instead of considering specific columns as categorical, it will completely ignore them.

save_binary: If you are really dealing with the memory size of your data file then specify this parameter as ‘True’. Specifying parameter true will save the dataset to binary file, this binary file will speed your data reading time for the next time.

Knowing and using above parameters will definitely help you implement the model. Remember I said that implementation of LightGBM is easy but parameter tuning is difficult. So let’s first start with implementation and then I will give idea about the parameter tuning.

Implementation

Installating LGBM:

Installing LightGBM is a crucial task. I found this as the best resource which will guide you in LightGBM installation.

I am using Anaconda and installing LightGBM on anaconda is a clinch. Just run the following command on your Anaconda command prompt and whoosh, LightGBM is on your PC.

conda install -c conda-forge lightgbm

Dataset:

This data is very small just 400 rows and 5 columns (specially used for learning purpose). This is a classification problem where we have to predict whether a customer will buy the product from advertise given on the website. I am not explaining dataset as dataset is self-explanatory. You can download dataset from my drive.

Note: The dataset is clean and has no missing value. The main aim behind choosing this much smaller data is to keep the things simpler and understandable.

I am assuming that you all know basics of python. Go through data preprocessing steps, they are fairly easy but if you have any doubt then ask me in the comment, I will get back to you asap.

Data preprocessing:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset
dataset = pd.read_csv('...input\\Social_Network_Ads.csv')
X = dataset.iloc[:, [2, 3]].values
y = dataset.iloc[:, 4].values

# Splitting the dataset into the Training set and Test set
from sklearn.cross_validation import train_test_split
x_train, x_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state = 0)

# Feature Scaling
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
x_train = sc.fit_transform(x_train)
x_test = sc.transform(x_test)

Model building and training:

We need to convert our training data into LightGBM dataset format(this is mandatory for LightGBM training).

After creating a converting dataset, I created a python dictionary with parameters and their values. Accuracy of your model totally depends on the values you provide to parameters.

In the end block of code, I simply trained model with 100 iterations.

import lightgbm as lgb

d_train = lgb.Dataset(x_train, label=y_train)

params = {}
params['learning_rate'] = 0.003
params['boosting_type'] = 'gbdt'
params['objective'] = 'binary'
params['metric'] = 'binary_logloss'
params['sub_feature'] = 0.5
params['num_leaves'] = 10
params['min_data'] = 50
params['max_depth'] = 10

clf = lgb.train(params, d_train, 100)

Few things to notice in parameters:

Used ‘binary’ as objective(remember this is classification problem)
Used ‘binary_logloss’ as metric(same reason, binary classification problem)
‘num_leaves’=10 (as it is small data)
‘boosting type’ is gbdt, we are implementing gradient boosting(you can try random forest)

Model prediction:

we just need to write a line for predictions.

Output will be a list of probabilities. I converted probabilities to binary prediction keeping threshold=0.5

#Prediction
y_pred=clf.predict(x_test)

#convert into binary values

for i in range(0,99):
    if y_pred[i]>=.5:       # setting threshold to .5
       y_pred[i]=1
    else:  
       y_pred[i]=0

Results:

We can check results either using confusion matrix or directly calculating accuracy

Code:

#Confusion matrix

from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test, y_pred)

#Accuracy

from sklearn.metrics import accuracy_score
accuracy=accuracy_score(y_pred,y_test)

Screenshots of result:

Confusion Matrix

Accuracy Score

Many of you must be thinking that I used smaller dataset and still my model has 92% accuracy. Why there is no overfitting? The simple reason is I fine tuned model parameters.

So now let’s jump into parameter fine tuning.

Parameter Tuning:

Data scientists always struggle in deciding when to use which parameter? and what should be the ideal value of that parameter?

Following set of practices can be used to improve your model efficiency.

num_leaves: This is the main parameter to control the complexity of the tree model. Ideally, the value of num_leaves should be less than or equal to 2^(max_depth). Value more than this will result in overfitting.
min_data_in_leaf: Setting it to a large value can avoid growing too deep a tree, but may cause under-fitting. In practice, setting it to hundreds or thousands is enough for a large dataset.
max_depth: You also can use max_depth to limit the tree depth explicitly.

For Faster Speed:

Use bagging by setting bagging_fraction and bagging_freq
Use feature sub-sampling by setting feature_fraction
Use small max_bin
Use save_binary to speed up data loading in future learning
Use parallel learning, refer to parallel learning guide.

For better accuracy:

Use large max_bin (may be slower)
Use small learning_rate with large num_iterations
Use large num_leaves(may cause over-fitting)
Use bigger training data
Try dart
Try to use categorical feature directly

To deal with over-fitting:

Use small max_bin
Use small num_leaves
Use min_data_in_leaf and min_sum_hessian_in_leaf
Use bagging by set bagging_fraction and bagging_freq
Use feature sub-sampling by set feature_fraction
Use bigger training data
Try lambda_l1, lambda_l2 and min_gain_to_split to regularization
Try max_depth to avoid growing deep tree

Conclusion:

I implemented LightGBM on multiple datasets and found that its accuracy challenged other boosting algorithms. From my experience, I will always recommend you to try this algorithm at least once.

I hope you guys enjoyed this blog and it was useful to all. I would request you you to give suggestion which will help to improve this blog.

Source: Microsoft LightGBM Documentation

Thanks,

Pushkar Mandot

audio的Framework层到hal 如何调用（以setparameters为例）盼雨落，等风起 audio 安卓
首先查看AudioManager之setParameters从应用到hal流程分析android6.0看到最后两个格：audio_hw_device_t->set_parameters()是上层调用hal层的接口导致下层***audio_hw->adev_set_parameters()***执行。他们之间的联系，通过legacy_adev_open建立联系。legacy_adev_open是干什
mobaxterm终端sqlplus乱码问题解决胡斌附体数据库 sqlplus 字符集设置乱码
背景。使用mobaxterm终端连接linux。在查询数据库表注释时发现**？**中文乱码。影响对表的分析。完成以下三个编码设置再打开sqlplus查询含中文的数据就正常了总结。需要查看sqlplus的编码是什么SELECTparameter,valueFROMnls_database_parametersWHEREparameterIN('NLS_CHARACTERSET','NLS_NCHAR
Python训练营-Day11 m0_72314023 Python训练营 python 机器学习深度学习
DAY11常见的调参方式超参数调整专题1知识点回顾1.网格搜索2.随机搜索（简单介绍，非重点实战中很少用到，可以不了解）3.贝叶斯优化（2种实现逻辑，以及如何避开必须用交叉验证的问题）4.time库的计时模块，方便后人查看代码运行时长#LightGBM-网格优化print("\n---3.网格搜索优化LightGBM(训练集->测试集)---")importlightgbmaslgbfromskl
机器学习-三大SOTA Boosting算法总结和调优小新学习屋机器学习机器学习 boosting 集成学习决策树人工智能
参考书籍：《机器学习公式推导和代码实现》书籍页码：P197～205简介除了深度学习适用的文本、图像、语音、视频等非结构化数据，对于训练样本较少的结构化数据，Boosting算法仍是第一选择。XGBoost、LightGBM、CatBoost是目前经典的SOTABoosting算法算法对比维度XGBoostLightGBMCatBoos说明算法的继承性是对GBDT的改进是对XGBoost的改进是对X
MCP 协议使用核心讲解 Code_Geo 服务器前端 MCP协议
MCP协议使用核心讲解✅MCP协议的核心在于以下几个方面一、MCP请求结构（MCPRequest）{"messages":[{"role":"user","content":"帮我查询一下上海的天气"}],"tools":[{"name":"weather_query","description":"查询天气","parameters":{"type":"object","properties":
【机器学习第四期（Python）】LightGBM 方法原理详解 WW、forever 机器学习原理及代码实现机器学习 python 人工智能
LightGBM概述一、LightGBM简介二、LightGBM原理详解⚙️核心原理LightGBM的主要特点三、LightGBM实现步骤（Python）可调参数推荐完整案例代码（回归任务+可视化）参考LightGBM是由微软开源的基于梯度提升框架（GBDT）的机器学习算法，专为高性能、高效率设计，适用于大规模数据处理任务。它在准确率、训练速度和资源使用上都优于传统GBDT实现（如XGBoost）
SQL Server 等待数据库引擎恢复句柄失败 y523648 数据库服务器运维
用管理员身份运行PowerShell，模拟扇区大小为4KbNew-ItemProperty-Path"HKLM:\SYSTEM\CurrentControlSet\Services\stornvme\Parameters\Device"-Name"ForcedPhysicalSectorSizeInBytes"-PropertyTypeMultiString-Force-Value"*4095"验
jmeter中变量的作用范围_Jmeter 的变量类型和作用域 weixin_39789525 jmeter中变量的作用范围
Jmeter的有多种变量类型，有其各自的作用域。有时候不小心就会使它们之间相互冲突或者覆盖，本文梳理各种类型变量的生成方法、使用特点、作用范围。目录变量类型Jmeter中的变量分别有以下几种：UserDefinedVariablesUserParameters属性Properties运行中创建的变量我们分开单独研究各自的特点。UserDefinedVariables(UDV)UserDefined
C++11 lambda 顾小玙 c++开发语言
前言在Cpp11以前，为了把函数当作对象调用，可以使用C中的函数指针类型，也可以使用Cpp98的仿函数。但二者都不是很好用，函数指针return_type(*name)(parameters)的长相就令人望而却步，仿函数将一个函数重载为一个类的operator()的方式又沉重麻烦。C++11中做出了(抄Python的)更灵活、轻便的lambda表达式。lambda表达式lambda表达式是一个匿名
【.net core】【sqlsugar】在where条件查询时使用原生SQL MoFe1 .netcore sql 数据库
//初始化查询varquery=repository.IQueryable();//添加原生SQLWHERE条件query=query.Where("fieldAWhere(stringwhereString,objectparameters=null);
C++程序实现阻止屏保、阻止系统自动关闭屏幕、阻止系统待机（附源码） dvlinker C/C++实战专栏阻止屏保阻止系统自动关闭屏幕阻止系统待机 API Monitor
目录1、概述2、设置屏幕保护程序，修改自动关闭显示器和待机的时间2.1、设置屏保程序2.2、修改自动关闭显示器和待机的时间3、通过屏保的通知消息来阻止屏保4、调用API函数SystemParametersInfo关闭/启用屏保，但存在问题4.1、初步确定处理策略4.2、启动监控进程去监控主进程4.3、系统强行关机的情况无法处理5、使用APIMonitor监测到目标程序对API的调用，找到了问题的突
AWS EventBridge的精准匹配规则实践 t0_54coder 编程问题解决手册 aws 算法 javascript 个人开发
在使用AWS服务的过程中，EventBridge（事件桥接）是一个非常强大的工具，它可以帮助我们捕获和处理各种事件。不过，如何编写一个精确的事件匹配规则却是一项挑战。今天，我们将探讨如何创建一个EventBridge规则模式，以捕获特定格式的S3事件。事件背景假设我们有一个S3存储桶，其中包含了以下格式的事件：{"requestParameters":{"bucketName":"mybucket
Java Lambda表达式 empti_ Java基础 java
JavaLambda表达式Lambda表达式是Java8引入的一个重要特性，它提供了一种更简洁的方式来表示匿名函数（anonymousfunction），使得函数式编程在Java中变得更加容易。1.基本语法Lambda表达式的基本语法如下：(parameters)->expression或(parameters)->{statements;}2.主要特点简洁性：比匿名内部类更简洁函数式接口：Lam
LightGBM：极速梯度提升机——结构化数据建模的终极武器大千AI助手人工智能 Python #OTHER 随机森林算法机器学习决策树人工智能 GBDT LightGBM
基于直方图与Leaf-wise生长的高效GBDT实现，横扫Kaggle与工业场景一、为什么需要LightGBM？GBDT的瓶颈传统梯度提升树（如XGBoost）在处理海量数据时面临两大痛点：训练速度慢：需预排序特征&层次生长（Level-wise）内存消耗高：存储特征值与分裂点信息LightGBM的诞生微软亚洲研究院于2017年开源，核心目标：✅训练效率提升10倍✅内存占用降低50%✅保持与XGB
参数量 vs 计算量：模型轻量化的双面指标，90%的人没理清！
1.参数量（Parameters）：模型的知识容量定义：参数量指的是模型在训练过程中需要学习和存储的权重（Weights）和偏置（Biases）的总数量。这些参数是模型从数据中学习到的“知识”的具体数值体现。本质：模型内部存储的状态量。它直接决定了模型文件的大小。计算方式：统计模型中所有可学习参数的数量。全连接层（FC/Dense）：参数数量=输入维度*输出维度+输出维度(其中+输出维度是偏置项)
开源项目控制面板（control-panel）安装与使用指南秋然仪Stranger
开源项目控制面板（control-panel）安装与使用指南control-panelembeddablepanelofinputsforparametersetting项目地址:https://gitcode.com/gh_mirrors/co/control-panel一、项目目录结构及介绍control-panel/├──README.md-项目说明文件，提供快速入门信息。├──LICENS
DAY 26 函数专题1：函数定义与参数小白菜333666 python
1.函数的定义函数是封装可重用代码的块，使用def关键字定义：deffunction_name(parameters):"""函数文档字符串（可选）"""#函数体returnresult#可选defadd(a,b):returna+bprint(add(3,5))#输出：82.变量作用域：局部变量和全局变量局部变量：函数内部定义，只在函数内有效。全局变量：函数外部定义，需用global关键字在函数
CommunityToolkit.Mvvm 重构激光直写控制软件 CoderIsArt 激光微加工 C#重构 c#
使用CommunityToolkit.Mvvm库重新设计激光直写控制软件的框架，展示现代MVVM实现方式。一、项目结构LaserDirectWriteApp/├──Models/│├──LaserParameters.cs│├──MotionParameters.cs│└──SystemStatus.cs├──ViewModels/│├──MainViewModel.cs│├──LaserCont
matlab瞬变电磁时域有限差分方法 xx155802862xx matlab 开发语言
瞬变电磁时域有限差分方法MATLAB数值仿真教程程序codelisting/Appendix_A/fdtd_1d_code.m,3184codelisting/Appendix_A/initialize_plotting_parameters.m,836codelisting/Appendix_A/plot_fields.m,353codelisting/Appendix_C/polar_plot
Lambda表达式与Stream API bubiyoushang888 windows python 开发语言
Java8引入了许多新特性，其中最引人注目的是Lambda表达式和StreamAPI。这两个特性极大地提高了Java编程的简洁性和效率。一、Lambda表达式Lambda表达式是一种新的编程语法，它允许我们将函数作为参数传递给其他方法，从而使代码更加简洁。Lambda表达式的基本语法如下：(parameters)->expression或者(parameters)->{statements;}例如
STUN协议与 TURN协议桃花岛主70 网络网络协议
STUN（SessionTraversalUtilitiesforNAT，NAT会话穿越应用程序）是一种网络协议，STUN（SimpleTraversalofUserDatagramProtocolthroughNetworkAddressTranslators(NATs)，NAT的UDP简单穿越）是一种网络协议，它允许位于NAT（或多重NAT）后的客户端找出自己的公网地址，查出自己位于哪种类型的
穿不了 NAT 怎么办？用 TURN Server 把墙搬走！一只牛博服务器 nat turn 穿透网络运维
欢迎来到我的博客，代码的世界里，每一行都是一个故事穿不了NAT怎么办？用TURNServer把墙搬走！摘要前言：你以为你能直连，其实NAT在冷笑TURNServer是什么？为啥我需要它？TURN的作用（通俗翻译版）使用场景安装coturn：先喝碗EPEL汤⚙配置详解：我贴我自己用的配置给你看启动服务：打造systemd启动脚本你可以使用第三方测试工具验证：常见问题TURN服务器还有谁？总结：你以为
【机器人编程基础】 python函数视睿 Amu陪你从零开始学习机器人 python 开发语言人工智能算法数据结构机器人
python函数python函数定义和调用函数定义示例：定义一个简单的函数函数调用示例：调用上面定义的函数参数类型位置参数关键字参数默认参数可变参数关键字参数字典返回值函数注解函数嵌套函数作为一等公民总结参数和返回值参数（Parameters）位置参数（PositionalArguments）关键字参数（KeywordArguments）默认参数（DefaultArguments）可变参数（Arb
探索PJSIP：多媒体通信的强大开源库卓榕非Sabrina
探索PJSIP：多媒体通信的强大开源库pjprojectPJSIPproject项目地址:https://gitcode.com/gh_mirrors/pj/pjproject项目介绍PJSIP是一个免费且开源的多媒体通信库，采用C语言编写，并提供了C、C++、Java、C#和Python等多种编程语言的高级API。它实现了SIP、SDP、RTP、STUN、TURN和ICE等标准协议，将信号协议（
ASP.NET Core JWT鉴权：用代码铸造“防弹令牌”，让黑客在401地狱门外哭泣！墨夶 C#学习资料 asp.net 后端
1.API的“中二危机”与JWT核武器的救赎被黑客爆破的API崩溃了：“100个请求/秒？这要算到宇宙热寂！”//JWT核武器启动services.AddJwtBearer(options=>{options.RequireHttpsMetadata=true;//HTTPS防弹衣options.TokenValidationParameters=newTokenValidationParamet
LightGBM 与 XGBoost 深度解析：从基础原理到实战优化爱看烟花的码农 ML 集成学习机器学习人工智能
LightGBM与XGBoost深度解析：从基础原理到实战优化引言梯度提升机(GradientBoostingMachine,GBM)及其衍生算法，如XGBoost和LightGBM，是当今机器学习领域中应用最为广泛且效果卓越的监督学习模型之一。然而，许多学习者在初次接触这些算法时，往往对其复杂的内部机制感到困惑，难以形成深刻理解，常常止步于对算法流程的死记硬背。本教程旨在深入浅出地剖析GBDT(
FreeRTOS创建任务时的堆栈大小问题嵌入式码农驿站单片机程序 freertos stm32 单片机
FreeRTOS创建任务函数BaseType_txTaskCreate(TaskFunction_tpxTaskCode,constchar*constpcName,constconfigSTACK_DEPTH_TYPEusStackDepth,void*constpvParameters,UBaseType_tuxPriority,TaskHandle_t*constpxCreatedTask)
JAVA——泛型 *TQK* Java java 笔记学习开发语言
泛型（Generics）是Java语言在JDK5.0版本中引入的一种强大特性，用于在编译时提供更强的类型检查和类型安全。它允许程序员在定义类、接口和方法时使用类型参数（TypeParameters），从而实现类型参数化。通过泛型，可以编写出更加通用、灵活且类型安全的代码。一、泛型的核心概念1.类型参数化泛型的核心思想是通过类型参数化，使类、接口和方法能够处理多种数据类型，而不是单一的类型。这样可以
六、WebRTC中ICE的实现 gdliweibing WebRTC webrtc 服务器 p2p
一、Candidate种类&优先级高到底：host、srflx、prflx、relay.同一局域网内通过host类型的Candidate在内网建立连接。非同一局域网，隔断从STUN、TURN服务器中收集srflx和relay类型的Candidate。收集srflx类型Candidate时，ICE会尝试NAT打洞。如果打洞成功则使用P2P传输，否则使用TURN服务器中转数据。二、ICE策略RTCPe
Java接受参数传递进来的参数名称和自己定义的不一样怎么办？ TTc_ java 前端 javascript
1.实体类packagecom.wechat.project.reservation.domain.dataset;importcom.alibaba.fastjson.annotation.JSONField;importcom.wechat.project.reservation.domain.Parameters;importlombok.AllArgsConstructor;importl
jQuery 键盘事件keydown ,keypress ,keyup介绍 107x js jquery keydown keypress keyup
本文章总结了下些关于jQuery 键盘事件keydown ,keypress ,keyup介绍，有需要了解的朋友可参考。一、首先需要知道的是： 1、keydown() keydown事件会在键盘按下时触发. 2、keyup() 代码如下复制代码 $('input').keyup(funciton(){
AngularJS中的Promise bijian1013 JavaScript AngularJS Promise
一.Promise Promise是一个接口，它用来处理的对象具有这样的特点：在未来某一时刻（主要是异步调用）会从服务端返回或者被填充属性。其核心是，promise是一个带有then()函数的对象。为了展示它的优点，下面来看一个例子，其中需要获取用户当前的配置文件： var cu
c++ 用数组实现栈类 CrazyMizzz 数据结构 C++
#include<iostream> #include<cassert> using namespace std; template<class T, int SIZE = 50> class Stack{ private: T list[SIZE];//数组存放栈的元素 int top;//栈顶位置 public: Stack(
java和c语言的雷同麦田的设计者 java 递归 scaner
软件启动时的初始化代码，加载用户信息2015年5月27号从头学java二 1、语言的三种基本结构：顺序、选择、循环。废话不多说，需要指出一下几点： a、return语句的功能除了作为函数返回值以外，还起到结束本函数的功能，return后的语句不会再继续执行。 b、for循环相比于whi
LINUX环境并发服务器的三种实现模型被触发 linux
服务器设计技术有很多，按使用的协议来分有TCP服务器和UDP服务器。按处理方式来分有循环服务器和并发服务器。 1 循环服务器与并发服务器模型在网络程序里面，一般来说都是许多客户对应一个服务器，为了处理客户的请求，对服务端的程序就提出了特殊的要求。目前最常用的服务器模型有： ·循环服务器：服务器在同一时刻只能响应一个客户端的请求 ·并发服务器：服
Oracle数据库查询指令肆无忌惮_ oracle数据库
20140920 单表查询 -- 查询************************************************************************************************************ -- 使用scott用户登录 -- 查看emp表 desc emp
ext右下角浮动窗口知了ing JavaScript ext
第一种 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/
浅谈REDIS数据库的键值设计矮蛋蛋 redis
http://www.cnblogs.com/aidandan/ 原文地址：http://www.hoterran.info/redis_kv_design 丰富的数据结构使得redis的设计非常的有趣。不像关系型数据库那样，DEV和DBA需要深度沟通，review每行sql语句，也不像memcached那样，不需要DBA的参与。redis的DBA需要熟悉数据结构，并能了解使用场景。
maven编译可执行jar包 alleni123 maven
http://stackoverflow.com/questions/574594/how-can-i-create-an-executable-jar-with-dependencies-using-maven <build> <plugins> <plugin> <artifactId>maven-asse
人力资源在现代企业中的作用百合不是茶 HR 企业管理
//人力资源在在企业中的作用人力资源为什么会存在，人力资源究竟是干什么的人力资源管理是对管理模式一次大的创新，人力资源兴起的原因有以下点：工业时代的国际化竞争，现代市场的风险管控等等。所以人力资源在现代经济竞争中的优势明显的存在，人力资源在集团类公司中存在着明显的优势(鸿海集团)，有一次笔者亲自去体验过红海集团的招聘，只知道人力资源是管理企业招聘的当时我被招聘上了，当时给我们培训的人
Linux自启动设置详解 bijian1013 linux
linux有自己一套完整的启动体系，抓住了linux启动的脉络，linux的启动过程将不再神秘。阅读之前建议先看一下附图。本文中假设inittab中设置的init tree为： /etc/rc.d/rc0.d /etc/rc.d/rc1.d /etc/rc.d/rc2.d /etc/rc.d/rc3.d /etc/rc.d/rc4.d /etc/rc.d/rc5.d /etc
Spring Aop Schema实现 bijian1013 java spring AOP
本例使用的是Spring2.5 1.Aop配置文件spring-aop.xml <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.org/schema/beans" xmln
【Gson七】Gson预定义类型适配器 bit1129 gson
Gson提供了丰富的预定义类型适配器，在对象和JSON串之间进行序列化和反序列化时，指定对象和字符串之间的转换方式， DateTypeAdapter public final class DateTypeAdapter extends TypeAdapter<Date> { public static final TypeAdapterFacto
【Spark八十八】Spark Streaming累加器操作（updateStateByKey) bit1129 update
在实时计算的实际应用中，有时除了需要关心一个时间间隔内的数据，有时还可能会对整个实时计算的所有时间间隔内产生的相关数据进行统计。比如：对Nginx的access.log实时监控请求404时，有时除了需要统计某个时间间隔内出现的次数，有时还需要统计一整天出现了多少次404，也就是说404监控横跨多个时间间隔。 Spark Streaming的解决方案是累加器，工作原理是，定义
linux系统下通过shell脚本快速找到哪个进程在写文件 ronin47
一个文件正在被进程写我想查看这个进程文件一直在增大找不到谁在写使用lsof也没找到这个问题挺有普遍性的，解决方法应该很多，这里我给大家提个比较直观的方法。 linux下每个文件都会在某个块设备上存放，当然也都有相应的inode, 那么透过vfs.write我们就可以知道谁在不停的写入特定的设备上的inode。幸运的是systemtap的安装包里带了inodewatch.stp，位
java-两种方法求第一个最长的可重复子串 bylijinnan java 算法
import java.util.Arrays; import java.util.Collections; import java.util.List; public class MaxPrefix { public static void main(String[] args) { String str="abbdabcdabcx";
Netty源码学习-ServerBootstrap启动及事件处理过程 bylijinnan java netty
Netty是采用了Reactor模式的多线程版本，建议先看下面这篇文章了解一下Reactor模式： http://bylijinnan.iteye.com/blog/1992325 Netty的启动及事件处理的流程，基本上是按照上面这篇文章来走的文章里面提到的操作，每一步都能在Netty里面找到对应的代码其中Reactor里面的Acceptor就对应Netty的ServerBo
servelt filter listener 的生命周期 cngolon filter listener servelt 生命周期
1. servlet 当第一次请求一个servlet资源时，servlet容器创建这个servlet实例，并调用他的 init(ServletConfig config)做一些初始化的工作，然后调用它的service方法处理请求。当第二次请求这个servlet资源时，servlet容器就不在创建实例，而是直接调用它的service方法处理请求，也就是说
jmpopups获取input元素值 ctrain JavaScript
jmpopups 获取弹出层form表单首先，我有一个div，里面包含了一个表单，默认是隐藏的，使用jmpopups时，会弹出这个隐藏的div，其实jmpopups是将我们的代码生成一份拷贝。当我直接获取这个form表单中的文本框时，使用方法：$('#form input[name=test1]').val()；这样是获取不到的。我们必须到jmpopups生成的代码中去查找这个值，$(
vi查找替换命令详解 daizj linux 正则表达式替换查找 vim
一、查找查找命令 /pattern<Enter> ：向下查找pattern匹配字符串 ?pattern<Enter>：向上查找pattern匹配字符串使用了查找命令之后，使用如下两个键快速查找： n：按照同一方向继续查找 N：按照反方向查找字符串匹配 pattern是需要匹配的字符串，例如： 1: /abc<En
对网站中的js,css文件进行打包 dcj3sjt126com PHP 打包
一，为什么要用smarty进行打包 apache中也有给js,css这样的静态文件进行打包压缩的模块，但是本文所说的不是以这种方式进行的打包，而是和smarty结合的方式来把网站中的js,css文件进行打包。为什么要进行打包呢，主要目的是为了合理的管理自己的代码。现在有好多网站，你查看一下网站的源码的话，你会发现网站的头部有大量的JS文件和CSS文件，网站的尾部也有可能有大量的J
php Yii: 出现undefined offset 或者 undefined index解决方案 dcj3sjt126com undefined
在开发Yii 时，在程序中定义了如下方式： if($this->menuoption[2] === 'test')，那么在运行程序时会报：undefined offset:2，这样的错误主要是由于php.ini 里的错误等级太高了，在windows下错误等级
linux 文件格式（1） sed工具 eksliang linux linux sed工具 sed工具 linux sed详解
转载请出自出处： http://eksliang.iteye.com/blog/2106082 简介 sed 是一种在线编辑器，它一次处理一行内容。处理时，把当前处理的行存储在临时缓冲区中，称为“模式空间”（pattern space），接着用sed命令处理缓冲区中的内容，处理完成后，把缓冲区的内容送往屏幕。接着处理下一行，这样不断重复，直到文件末尾
Android应用程序获取系统权限 gqdy365 android
引用如何使Android应用程序获取系统权限第一个方法简单点，不过需要在Android系统源码的环境下用make来编译： 1. 在应用程序的AndroidManifest.xml中的manifest节点
HoverTree开发日志之验证码 hvt .net C#asp.net hovertree webform
HoverTree是一个ASP.NET的开源CMS，目前包含文章系统，图库和留言板功能。代码完全开放，文章内容页生成了静态的HTM页面，留言板提供留言审核功能，文章可以发布HTML源代码，图片上传同时生成高品质缩略图。推出之后得到许多网友的支持，再此表示感谢！留言板不断收到许多有益留言，但同时也有不少广告，因此决定在提交留言页面增加验证码功能。ASP.NET验证码在网上找，如果不是很多，就是特别多
JSON API：用 JSON 构建 API 的标准指南中文版 justjavac json
译文地址：https://github.com/justjavac/json-api-zh_CN 如果你和你的团队曾经争论过使用什么方式构建合理 JSON 响应格式，那么 JSON API 就是你的 anti-bikeshedding 武器。通过遵循共同的约定，可以提高开发效率，利用更普遍的工具，可以是你更加专注于开发重点：你的程序。基于 JSON API 的客户端还能够充分利用缓存，
数据结构随记_2 lx.asymmetric 数据结构笔记
第三章栈与队列一．简答题 1. 在一个循环队列中，队首指针指向队首元素的前一个位置。 2.在具有n个单元的循环队列中，队满时共有 n-1 个元素。 3. 向栈中压入元素的操作是先移动栈顶指针&n
Linux下的监控工具dstat 网络接口 linux
1) 工具说明dstat是一个用来替换 vmstat,iostat netstat,nfsstat和ifstat这些命令的工具, 是一个全能系统信息统计工具. 与sysstat相比, dstat拥有一个彩色的界面, 在手动观察性能状况时, 数据比较显眼容易观察; 而且dstat支持即时刷新, 譬如输入dstat 3, 即每三秒收集一次, 但最新的数据都会每秒刷新显示. 和sysstat相同的是,
C 语言初级入门--二维数组和指针 1140566087 二维数组 c/c++指针
/* 二维数组的定义和二维数组元素的引用二维数组的定义：当数组中的每个元素带有两个下标时，称这样的数组为二维数组； (逻辑上把数组看成一个具有行和列的表格或一个矩阵); 语法：类型名数组名[常量表达式1][常量表达式2] 二维数组的引用：引用二维数组元素时必须带有两个下标，引用形式如下：例如： int a[3][4]; 引用：
10点睛Spring4.1-Application Event wiselyman application
10.1 Application Event Spring使用Application Event给bean之间的消息通讯提供了手段应按照如下部分实现bean之间的消息通讯继承ApplicationEvent类实现自己的事件实现继承ApplicationListener接口实现监听事件使用ApplicationContext发布消息

What is LightGBM, How to implement it? How to fine tune the parameters?

What is LightGBM, How to implement it? How to fine tune the parameters?

Parameters

Implementation

Results:

Parameter Tuning:

Conclusion:

你可能感兴趣的:(lightGBM,turn,parameters)