元气少女wuqh

Auto-encoding Variational Bayes 阅读笔记

Notation

pθ(z|x) : intractable posterior
pθ(x|z) : probabilistic decoder
qϕ(z|x) : recognition model, variational approximation to pθ(z|x) , also regarded as a probabilistic encoder
pθ(z)pθ(x|z) : generative model
ϕ : variational parameters
θ : generative parameters

Abbreviation

SGVB: Stochastic Gradient Variational Bayes
AEVB: auto-encoding VB
ML: maximum likelihood
MAP: maximum a posteriori

Motivation

Problem
- How to perform efficient inference and learning in directed probabilistic models, in the presence of continuous latent variables with intractable posterior distribution pθ(z|x) and large datasets?
Existing Solution and Difficulty
- VB: involves the optimization of an approximation to the intractable posterior
- mean-field: requires analytical solutions of expectations w.r.t. the approximate posterior, which are also intractable in the general case

Contribution of this paper

(1) SGVB estimator: an estimator of the variational lower bound
- yielded by a reparameterization of the variational lower bound
- simple & differentiable & unbiased
- straightforwad to optimize using standard SG ascent techniques
(2) AEVB algorithm
- using SGVB to optimize a recognition model that allows us to perform very efficient approximate posterior inference using simple ancestral sampling, which in turn allows us to efficiently learn the model parameters, without the need of expensive iterative inference schemes (such as MCMC) per datapoint.
- condition: i.i.d. datasets X={x(i)}Ni=1 & continuous latent variable z per datapoint

Methodology

assumption

directed graphical models with continuous latent variables
i.i.d. dataset with latent variables per datapoint
- where we like to perform
  - ML or MAP inference on the (global) paramters θ
  - variational inference on the latent variable z
pθ(z) and pθ(x|z) : both PDFs are differentiable almost everywhere w.r.t. both θ and z

target case

intractability
- pθ(x)=∫pθ(z)pθ(x|z)dz: so we cannot evaluate or differentiate it.
- pθ(z|x)=pθ(x|z)pθ(z)pθ(x) : so the EM algorithm cannot be used.
- the required integrals for any reasonable mean-field VB algorithm: so the VB algorithm cannot be used.
- in cases of moderately complicated likelihood function, e.g. in a neural network with a nonlinear hidden layer
a large dataset
- batch optimization is too costly => minibatch or single datapoints
- sampling bases solutions are too slow, e.g. Monte Carlo EM, since it involves a typically expensive sampling loop per datapoint.

solution and application

efficient approximate ML or MAP estimation for θ (Full): Appendix F
- allow us to mimic the hidden random process and generate artificial data that resemble the real data
efficient approximate posterior inference pθ(z|x) for a choice of θ
- useful for coding or data representation tasks
efficient approximate marginal inference of x : Appendix D
- allow us to perform all kinds of inference tasks where p(x) is required, such as image denoising, inpainting, and super-resolution.
1. derivation of the variational bound

logpθ(x(1)，⋯,x(N))=∑i=1Nlogpθ(x(i))

Here we use xi to represent x(i)

logp(xi)=∫zq(z|xi)logp(xi)dz (q can be any distribution)=∫zq(z|xi)logp(z,xi)p(z|xi)dz=∫zq(z|xi)log[q(z|xi)p(z|xi)⋅p(z,xi)q(z|xi)]dz=∫zq(z|xi)logq(z|xi)p(z|xi)dz+∫zq(z|xi)logp(z,xi)q(z|xi)dz=DKL[qϕ(z|xi)∥pθ(z|xi)]+LB(θ,ϕ;xi)

Becuase DKL≥0 , so LB is called the (variational) lower bound, then we have

logpθ(xi)≥LB(θ,ϕ;xi)

LB(θ,ϕ;xi)=Eqϕ(z|x)[logpθ(x,z)−logqϕ(z|x)]=∫zq(z|xi)logp(z,xi)q(z|xi)dz=∫zq(z|xi)logp(xi|z)p(z)q(z|xi)dz=−DKL[q(z|xi)∥p(z)]+∫zq(z|xi)logp(xi|z)dz=−DKL[qϕ(z|xi)∥pθ(z)]+Eqϕ(z|xi)[logpθ(xi|z)]
- −DKL[qϕ(z|xi)∥pθ(z)] : act as a regularizer
- Eqϕ(z|xi)[logpθ(xi|z)] : a an expected negative reconstruction error
- TARGET: defferentiate and optimize LB w.r.t. both ϕ and θ . However, ∇ϕLB is problematic.
2. Solution-1 Naive Monte Carlo gradient estimator
- disadvantages: high variance & impractical for this purpose, 原文中出现的公式如下：
  $\nabla ϕ E q ϕ (z) [f (z)] = E q ϕ (z) [f (z) \nabla q ϕ (z) log q ϕ (z)] ≃ 1 L \sum l = 1 L f (z) \nabla q ϕ (z l) log q ϕ (z l) where z l \sim q ϕ (z | x i)$
但由于我还没来得及学Monte Carlo Gradient Estimator的理论，根据VAE这边论文后面的公式，个人觉得…上面的公式应该是（数学功底不够深厚，不确定二者是否等价）：

∇ϕEqϕ(z)[f(z)]=Eqϕ(z)[f(z)∇qϕ(z)logqϕ(z)]≃1L∑l=1Lf(zl)where zl∼qϕ(z|xi)

3. Solution-2 SGVB estimator
- reparamterize z̃ ∼qϕ(z|x) using a differentiable transformation gϕ(ϵ,x) of an (auxiliary) noise variable ϵ
  - under certain mild conditions for a chosen approximate posterior qϕ(z|x)
    $z ̃ = g ϕ (ϵ, x) w i t h ϵ \sim p (ϵ)$
- form Monte Carlo estimates as follows:
  $q ϕ (z | x) \prod i d z i = p (ϵ) \prod i d ϵ i \int q ϕ (z | x) f (z) d z = \int p (ϵ) f (z) d ϵ = \int p (ϵ) f (g ϕ (ϵ, x)) d ϵ E q ϕ (z | x i) [f (z)] = E p (ϵ) [f (g ϕ (ϵ, x i))] ≃ 1 L \sum l = 1 L f (g ϕ (ϵ l, x i)) w h e r e ϵ l \sim p (ϵ)$
- apply the MC estimator technique to LB(θ,ϕ;xi) , yielding 2 SGVB estimator L̃ A and L̃ B :
  $L ̃ A (θ, ϕ; x i) = 1 L \sum l = 1 L log p θ (x i, z i, l) - log q ϕ (z i, l | x i) L ̃ B (θ, ϕ; x i) = - D K L (q ϕ (z | x i) ∥ p θ (z)) + 1 L \sum l = 1 L log p θ (x i | z i, l) where z i, l = g ϕ (ϵ i, l, x i) and ϵ l \sim p (ϵ)$
- minibatch with size M
  
  LB(θ,ϕ;X)≃L̃ M(θ,ϕ;XM)=NM∑i=1ML̃ (θ,ϕ;xi)
  - the number of samples L per datapoint can be set to 1 as long as the minibatch size M was large enough, e.g. M=100
3.1 Minibatch version of AEVB algorithm
- θ,ϕ← Initialize parameters
- repeate
  - XM← Random minibatch of M datapoints
  - ϵ← Random samples from noise distribution p(ϵ)
  - g←∇θ,ϕL̃ M(θ,ϕ;XM,ϵ)
  - θ,ϕ← Update parameters using gradients g (e.g. SGD or Adagrad)
- until convergence of parameters θ,ϕ
- return θ,ϕ
chosen of qϕ(z|x) , p(ϵ) , and gϕ(ϵ,x)

There are three basic approaches:
- Tractable inverse CDF. Let ϵ∼U(0,I) , gϕ(ϵ,x) be the inverse CDF of qϕ(z|x) .
  - Examples: Exponential, Cauchy, Logistic, Rayleigh, Pareto, Weibull, Reciprocal, Gompertz, Gumbel and Erlang distributions.
- ”location-scale” family of distributions: choose the standard distribution (with location = 0, scale = 1) as the auxiliary variable ϵ , and let g(⋅)=location+scale⋅ϵ
  - Examples: Laplace, Elliptical, Student’s t, Logistic, Uniform, Triangular and Gaussian distributions.
  - 下文介绍的VAE即适用于此种情况
- Composition: It is often possible to express random variables as different transformations of auxiliary variables.
  - Examples: Log-Normal (exponentiation of normally distributed variable), Gamma (a sum over exponentially distributed variables), Dirichlet (weighted sum of Gamma variates), Beta, Chi-Squared, and F distributions.
Variational Auto-Encoder
- let pθ(z)=N(z;0,I)
- pθ(z|x) is intractable
- use a neural network for qϕ(z|x)
- ϕ and θ are optimized jointly with the AEVB algorithm
- params of pθ(x|z) are computed from z with a MLP (multi-layered perceptrons, a fully-connected neural network with a hidden layer)
  - multivariae Gaussian: in case of real-valued data
    $log p (x | z) = log N (x; μ, σ 2 I) where μ = W μ h + b μ log σ 2 = W σ h + b σ h = tanh (W h z + b h)$
  - Bernouli: incase of binary data
    $log p (x | z) = \sum i = 1 D x i log y i + (1 - x i) \cdot log (1 - y i) where y = f σ (W y tanh (W h + b h) + b y) f σ (\cdot) : elementwise sigmoid activation function$
From what mentioned above, we have:

logqϕ(z|xi)=logN(z;μi,σ2,iI)

where μi and σi are outputs of the encoding MLP.

We sample form zi,l∼qϕ(z|xi) using zi,l=gϕ(xi,ϵl)=μi+σi⊙ϵl , where ϵl∼N(0,I) and ⊙ denotes element-wize product.

Here both pθ(z) and qϕ(z|x) are Gaussian, so we can use the estimator L̃ B , since the KL divergence item is analytical. Then we have:

L(θ,ϕ;xi)≃−DKL(qϕ(z|xi)∥pθ(z))+1L∑l=1Llogpθ(xi|zi,l)≃12∑j=1L(1+log(σi2j)−μi2j−σi2j)+1L∑l=1Llogpθ(xi|zi,l)where zil=μi+σi⊙ϵl and ϵl∼N(0,I)

Solution of −DKL(qϕ(z)∥pθ(z)) , Gaussian case

Let J be the dimensionality of z, then we have:

∫qθ(z)logpθ(z)dz=∫N(z;μ,σ2)logN(z;0,I)dz=−J2log(2π)−12∑j=1J(μ2j+σ2j)

And:

∫qθ(z)logqθ(z)dz=∫N(z;μ,σ2)logN(z;μ,σ2)dz=−J2log(2π)−12∑j=1J(1+logσ2j)

Therefore:

−DKL(qϕ(z)∥pθ(z))=∫qθ(z)(logpθ(z)−logqθ(z))dz=12∑j=1L(1+log(σi2j)−μi2j−σi2j)

此处附上相关证明：

Visulization

Since the prior of the latent space is Gaussian, linearly spaced coordinates on the unit square were transformed through the inverse CDF of the Gaussian to produce values of the latent variables $z$ .

你可能感兴趣的:(Paper,Reading)

FB-OCC: 3D Occupancy Prediction based on Forward-BackwardView Transformation justtoomuchforyou 智驾
NVidia，CVPR20233DOccupancyPredictionChallengeworkshoppaper：https://arxiv.org/pdf/2307.1492code：https://github.com/NVlabs/FB-BEV大参数量imagebackboneInternImage-H，1B外部数据集预训练：object365nuscenes：有点云label，强化网络
PillarNet: Real-Time and High-PerformancePillar-based 3D Object Detection justtoomuchforyou 目标检测人工智能计算机视觉智驾
ECCV2022paper：[2205.07403]PillarNet:Real-TimeandHigh-PerformancePillar-based3DObjectDetectioncode：https://github.com/VISION-SJTU/PillarNet-LTS纯点云基于pillar3D检测模型网络比较SECOND基于voxel，one-stage，基于sparse3Dc
.NET C# async/定时任务的异步线程池调度方案最大线程数‌ = 处理器核心数 × 250 专注VB编程开发20年 .net c#开发语言
关于.NET中Threading.Timer的线程机制，结合线程池特性和异步协作原理分析如下：一、线程复用机制‌共享进程级线程池‌Threading.Timer的回调任务‌不会每次新建线程‌，而是提交到.NET进程全局线程池统一调度，该线程池与async/await任务共享同一资源池。线程池维护可复用工作线程队列，避免频繁创建/销毁开销任务优先由空闲线程执行，无空闲线程则进入全局队列等待‌线程池扩
CCF推荐会议计算机体系结构/并行与分布计算/存储系统领域3月份截稿资讯汇总! 会议之眼人工智能深度学习阿里云云计算计算机网络
会议之眼快讯会议之眼精心汇总了以下CCF推荐会议之计算机十大领域之一：计算机体系结构/并行与分布计算/存储系统领域，2024年度3月份会议截稿资讯！为你第一时间进行播报！让广大科研学者及时了解最新的学术进展，助力学者们在专业领域保持竞争优势！会议简称：ISLPED会议全称：InternationalSymposiumonLowPowerElectronicsandDesignFullPaperDe
python做生物信息学分析_Python从零开始第五章生物信息学①提取差异基因吴敬欣 python做生物信息学分析
目前来说，做生物信息学的人越来越多，但是我觉得目前而言做生信的主要有三类人：老本行是做实验的，做生信可能是为了辅助研究或者是为了发paper(有非常多的临床生选择趟生信这波水)主要是做生信的，主要涵盖高通量测序数据分析，组学数据分析等等，专门从事生物学数据分析的这群人，其大部分也是本科生物狗作为强大的生力军，以调包写R，python为主。那么这群人就要熟悉看各种包的tutorial以及如何进行常规
Python多线程实现FTP密码破解技术指南不胖的羊
本文还有配套的精品资源，点击获取简介：本文主要介绍在Python环境下，使用多线程技术提升FTP密码安全性测试的效率。通过threading模块实现多线程，每个线程尝试一个密码，大幅加快破解过程。详细阐述了ftpbrute.py脚本的关键实现部分，包括导入库、定义密码字典、创建线程类、启动线程、等待线程完成以及添加错误处理和安全措施。需要注意的是，未经授权的密码破解活动是非法的，必须在合法授权的情
Python的多线程 Simple十年一剑 Python python 多线程
#coding=utf-8#包含threading模块importthreadingfromtimeimportctime,sleepdefmusic(func):foriinrange(2):print"Iwaslisteningto%s.%s"%(func,ctime())sleep(1)defmove(func):foriinrange(2):print"Iwasatthe%s!%s"%(f
进阶版爬虫启明源码爬虫
要掌握进阶版爬虫，你需要从基础爬虫技能过渡到更复杂的内容采集与反爬机制绕过技术。以下是一个系统性的进阶学习路线及关键技术点：进阶爬虫学习路线图一、基础回顾（必须扎实）熟练使用：requests/httpx网页解析：BeautifulSoup/lxml/xpath多线程/多进程：threading/multiprocessing/concurrent.futures简单爬虫项目：新闻/电商类页面爬取
c# groupbox大小_C# Winform窗体和控件自适应大小 weixin_39998541 c#groupbox大小
usingSystem;usingSystem.Collections.Generic;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Windows.Forms;namespaceCSharpFormApplication{classAutoResizeForm{//(1).声明结构,只记录窗体和
python 在线预览文件_OFFICE 文档转换为html在线预览苏橙橙 python 在线预览文件
OFFICE文档转换为html在线预览OFFICE文档在线预览方案很多：服务器先转换为PDF，再转换为SWF，最后通过网页加载Flash预览，比如flexpaperOffice文档直接转换为SWF，通过网页加载Flash预览微软的Office365在浏览器中直接打开转换为html今天，我们要用的方案是转换为html来预览。技术方案：office文档转换为pdf：使用libreofficepdf转h
lost connection to mysql server at ‘reading initial communication packet‘, system error: 0 丿失刀丨 mysql
我遇到的问题就是连接的线程太多，没有关闭造成的，具体操作如下：showprocesslist;看Time长的直接杀掉就行了。杀掉的方式是kill+id;如下：kill533;
lost connection to mysql server at ‘reading initial communication packet‘ %d%d2 bug
参考：dockerrun--namemysql\-p3307:3306\-eMYSQL_ROOT_PASSWORD=root\-dmysql:5.7.36#宿主机3307映射容器3306(没有修改mysql配置，默认3306)使用WSL访问网络应用程序|MicrosoftLearn->windows访问WSL直接使用localhostDockerDesktop(WSL)部署MySQL使用Navic
Python多线程与多进程
文章目录1、PythonGIL（全局解释器锁）一、GIL导致伪并发的核心机制二、伪并发的表现与影响1.CPU密集型任务：多线程无效甚至负优化2.I/O密集型任务：多线程有效3.伪并发本质三、为什么需要GIL？设计初衷四、解决GIL限制的方案2、多线程和多进程核心区别实现方式与代码示例1.多线程实现（`threading`模块）2.多进程实现（`multiprocessing`模块）高级用法线程池/
Android 14.0 默认壁纸不好看，客户要换成他们喜欢的壁纸，Android 14.0 更换默认壁纸的方法 zzq1996 android
Android14.0默认壁纸不好看，客户要换成他们喜欢的壁纸，Android14.0更换默认壁纸的方法替换如下framework路径的壁纸图片。diff--gita/frameworks/base/core/res/res/drawable-nodpi/default_wallpaper.pngb//frameworks/base/core/res/res/drawable-nodpi/defa
转 Totally Data-Driven Automated Testing black_sam QTP测试框架 testing payment application subroutine spreadsheet function
TotallyData-DrivenAutomatedTestingAWhitePaperByKeithZambelichSr.SoftwareQualityAssuranceAnalystAutomatedTestingEvangelistProfessionalHistoryandCredentials:IhavebeeninvolvedinSoftwareTestingandSoftware
ubuntu执行sudo apt update 出现问题
问题描述：Readingpackagelists...DoneE:Failedtofetchhttps://mirrors.ustc.edu.cn/ubuntu-ports/dists/focal/main/binary-amd64/Packages404NotFound[IP:202.38.95.110443]E:Failedtofetchhttps://mirrors.ustc.edu.cn/
C#中实现排序的方法并绑定到datagridview控件中唐令 c#开发语言
usingSystem;usingSystem.Collections.Generic;usingSystem.ComponentModel;usingSystem.Data;usingSystem.Drawing;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Windows.Forms;usin
python io模块怎么导入_Python的StringIO模块和cStringIO模块 weixin_39518840 python io模块怎么导入
1.StringIO模块StringIO用于像文件一样对字符串缓冲区或者叫做内存文件进行读写。f=StringIO()#readyforwritingf=StringIO(buf)#readyforreadingf.close()#explicitlyreleaseresourcesheldflag=f.isatty()#alwaysfalsepos=f.tell()#getcurrentposi
ubuntu 20.04安装配置ssh远程服务中出现的一些问题及总结 ava不要秃头 ubuntu 服务器 linux
0.安装配置过程参考(56条消息)ubuntu20.04开启SSH远程登录_从此开始低调范✌️的博客-CSDN博客_ubuntu开启ssh远程登录1.输入sudoapt-getinstallopenssh-server提示Readingstateinformation...Error!E:Unabletoparsepackagefile/var/lib/apt/extended_states(1)
Cross-stitch Networks for Multi-task Learning 项目教程童香莺Wyman
Cross-stitchNetworksforMulti-taskLearning项目教程Cross-stitch-Networks-for-Multi-task-LearningATensorflowimplementationofthepaperarXiv:1604.03539项目地址:https://gitcode.com/gh_mirrors/cr/Cross-stitch-Network
探索多任务学习的新维度：Cross-stitch Networks 计蕴斯Lowell
探索多任务学习的新维度：Cross-stitchNetworksCross-stitch-Networks-for-Multi-task-LearningATensorflowimplementationofthepaperarXiv:1604.03539项目地址:https://gitcode.com/gh_mirrors/cr/Cross-stitch-Networks-for-Multi-t
7种方法提高源码阅读技巧学会了没源码阅读编程技巧
Readingsourcecodeisinthejobdescriptionofasoftwaredeveloper.However,thisexperienceisnotalwayspleasant.Noteveryonewouldliketoreadsomeoneelse’scodebecausetheyfinditboring,sometimesfrustrating.Therearecas
资源分享 | 一、盘点高清壁纸网站雨中散步撒哈拉资源壁纸高清
资源分享|一、盘点高清壁纸网站1.wallroom2.极简壁纸3.高清壁纸库4.动漫图片和壁纸5.WallpaperAbyss资源分享|一、盘点无版权图片网站作者：1024导航网址：https://shanhaigo.cn备注：1024导航致力于收集高质量网站，内容持续更新中....本文内容，已收录至1024导航：https://shanhaigo.cn1.wallroom高分辨率高质量壁纸2.极
SAM2论文解读-既实现了视频的分割一切，又比图像的分割一切SAM更快更好 ↣life♚ 计算机视觉大模型通用模型人工智能计算机视觉深度学习通用分割视频分割算法
code：https://github.com/facebookresearch/sam2/tree/maindemo:https://sam2.metademolab.com/paper:https://ai.meta.com/research/publications/sam-2-segment-anything-in-images-and-videos/这是SAM这是SAM2Facebook
字节Bagel多模态大模型解读小李飞刀李寻欢 OpenSource 大模型 paper github 代码解读
github：https://github.com/bytedance-seed/BAGELpaper：https://arxiv.org/pdf/2505.14683本文是一篇关于多模态预训练模型BAGEL的论文，由来自字节跳动、深圳先进技术研究院、莫纳什大学、香港科技大学和加州大学圣克鲁兹分校的研究人员共同撰写。BAGEL是一个开源的基础模型，支持多种模态的理解和生成，包括文本、图像和视频。该
Python爬虫实战：研究threading相关技术 ylfhpy 爬虫项目实战 python 爬虫开发语言 html scrapy
1.引言1.1研究背景与意义随着互联网的快速发展，网页数据量呈爆炸式增长。网络爬虫作为一种自动获取网页内容的工具，在搜索引擎优化、数据挖掘、舆情分析等领域具有广泛应用。传统的单线程爬虫在面对大规模数据采集任务时效率低下，无法充分利用多核CPU资源。多线程技术可以显著提高爬虫的并发处理能力，加快数据采集速度。1.2国内外研究现状国外在网络爬虫领域起步较早，Google、Bing等搜索引擎公司拥有大规
python两个客户端互发信息 hashiqimiya python 网络服务器
一、#在一台电脑上即可完成聊天#这里是服务器importsocketimportthreading#用于转发消息的函数defhandle_client(client_socket,client_address,clients):print(f"客户端{client_address}已连接")whileTrue:try:#接收客户端消息message=client_socket.recv(1024)
CS_Prj01 用C#生成一个桌面指针式时钟程序
1.运行结果2.程序usingSystem;usingSystem.Collections.Generic;usingSystem.ComponentModel;usingSystem.Data;usingSystem.Drawing;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Windows.
[paper] Look Into Person AlgoComp paper reading 计算机视觉
(CVPR2017)LookintoPerson:Self-supervisedStructure-sensitiveLearningandANewBenchmarkforHumanParsingPaper:http://www.linliang.net/files/CVPR17_LIP.pdfProject:http://hcp.sysu.edu.cn/lip/index.phpCode:htt
CHES 2022 issue-4文章总结打工小熊猫密码学文献分类总结同态加密网络安全可信计算技术密码学安全威胁分析网络攻击模型
来源：https://ches.iacr.org/2022/acceptedpapers.php简要分类：分类文章编号后量子密码软硬件加速相关13,22,26侧信道攻防相关3,6,8-12,14,15,17,18,20,21,23,25,27,28,29,31,32同态相关241.WhenBadNewsBecomeGoodNews:TowardsUsableInstancesofLearningw
Nginx负载均衡 510888780 nginx 应用服务器
Nginx负载均衡一些基础知识: nginx 的 upstream目前支持 4 种方式的分配 1)、轮询（默认）每个请求按时间顺序逐一分配到不同的后端服务器，如果后端服务器down掉，能自动剔除。 2)、weight 指定轮询几率，weight和访问比率成正比
RedHat 6.4 安装 rabbitmq bylijinnan erlang rabbitmq redhat
在 linux 下安装软件就是折腾，首先是测试机不能上外网要找运维开通，开通后发现测试机的 yum 不能使用于是又要配置 yum 源，最后安装 rabbitmq 时也尝试了两种方法最后才安装成功机器版本： [root@redhat1 rabbitmq]# lsb_release LSB Version: :base-4.0-amd64:base-4.0-noarch:core
FilenameUtils工具类 eksliang FilenameUtils common-io
转载请出自出处：http://eksliang.iteye.com/blog/2217081 一、概述这是一个Java操作文件的常用库，是Apache对java的IO包的封装，这里面有两个非常核心的类FilenameUtils跟FileUtils，其中FilenameUtils是对文件名操作的封装;FileUtils是文件封装，开发中对文件的操作，几乎都可以在这个框架里面找到。非常的好用。
xml文件解析SAX 不懂事的小屁孩 xml
xml文件解析:xml文件解析有四种方式， 1.DOM生成和解析XML文档(SAX是基于事件流的解析) 2.SAX生成和解析XML文档(基于XML文档树结构的解析) 3.DOM4J生成和解析XML文档 4.JDOM生成和解析XML 本文章用第一种方法进行解析，使用android常用的DefaultHandler import org.xml.sax.Attributes;
通过定时任务执行mysql的定期删除和新建分区，此处是按日分区酷的飞上天空 mysql
使用python脚本作为命令脚本，linux的定时任务来每天定时执行 #!/usr/bin/python # -*- coding: utf8 -*- import pymysql import datetime import calendar #要分区的表 table_name = 'my_table' #连接数据库的信息 host,user,passwd,db =
如何搭建数据湖架构？听听专家的意见蓝儿唯美架构
Edo Interactive在几年前遇到一个大问题：公司使用交易数据来帮助零售商和餐馆进行个性化促销，但其数据仓库没有足够时间去处理所有的信用卡和借记卡交易数据 “我们要花费27小时来处理每日的数据量，”Edo主管基础设施和信息系统的高级副总裁Tim Garnto说道：“所以在2013年，我们放弃了现有的基于PostgreSQL的关系型数据库系统，使用了Hadoop集群作为公司的数
spring学习——控制反转与依赖注入 a-john spring
控制反转（Inversion of Control，英文缩写为IoC）是一个重要的面向对象编程的法则来削减计算机程序的耦合问题，也是轻量级的Spring框架的核心。控制反转一般分为两种类型，依赖注入（Dependency Injection，简称DI）和依赖查找（Dependency Lookup）。依赖注入应用比较广泛。
用spool+unixshell生成文本文件的方法 aijuans xshell
例如我们把scott.dept表生成文本文件的语句写成dept.sql,内容如下: 　　set pages 50000; 　　set lines 200; 　　set trims on; 　　set heading off; 　　spool /oracle_backup/log/test/dept.lst; 　　select deptno||','||dname||','||loc
1、基础--名词解析(OOA/OOD/OOP) asia007 学习基础知识
OOA:Object-Oriented Analysis（面向对象分析方法）是在一个系统的开发过程中进行了系统业务调查以后，按照面向对象的思想来分析问题。OOA与结构化分析有较大的区别。OOA所强调的是在系统调查资料的基础上，针对OO方法所需要的素材进行的归类分析和整理，而不是对管理业务现状和方法的分析。　　OOA（面向对象的分析）模型由5个层次（主题层、对象类层、结构层、属性层和服务层）
浅谈java转成json编码格式技术百合不是茶 json编码 java转成json编码
json编码;是一个轻量级的数据存储和传输的语言在java中需要引入json相关的包,引包方式在工程的lib下就可以了 JSON与JAVA数据的转换（JSON 即 JavaScript Object Natation，它是一种轻量级的数据交换格式，非常适合于服务器与 JavaScript 之间的数据的交
web.xml之Spring配置(基于Spring+Struts+Ibatis) bijian1013 java web.xml SSI spring配置
指定Spring配置文件位置 <context-param> <param-name>contextConfigLocation</param-name> <param-value> /WEB-INF/spring-dao-bean.xml,/WEB-INF/spring-resources.xml, /WEB-INF/
Installing SonarQube（Fail to download libraries from server） sunjing Install Sonar
1. Download and unzip the SonarQube distribution 2. Starting the Web Server The default port is "9000" and the context path is "/". These values can be changed in &l
【MongoDB学习笔记十一】Mongo副本集基本的增删查 bit1129 mongodb
一、创建复本集假设mongod,mongo已经配置在系统路径变量上，启动三个命令行窗口，分别执行如下命令： mongod --port 27017 --dbpath data1 --replSet rs0 mongod --port 27018 --dbpath data2 --replSet rs0 mongod --port 27019 -
Anychart图表系列二之执行Flash和HTML5渲染白糖_ Flash
今天介绍Anychart的Flash和HTML5渲染功能 HTML5 Anychart从6.0第一个版本起，已经逐渐开始支持各种图的HTML5渲染效果了，也就是说即使你没有安装Flash插件，只要浏览器支持HTML5，也能看到Anychart的图形（不过这些是需要做一些配置的）。这里要提醒下大家，Anychart6.0版本对HTML5的支持还不算很成熟，目前还处于
Laravel版本更新异常4.2.8-> 4.2.9 Declaration of ... CompilerEngine ... should be compa bozch laravel
昨天在为了把laravel升级到最新的版本，突然之间就出现了如下错误： ErrorException thrown with message "Declaration of Illuminate\View\Engines\CompilerEngine::handleViewException() should be compatible with Illuminate\View\Eng
编程之美-NIM游戏分析-石头总数为奇数时如何保证先动手者必胜 bylijinnan 编程之美
import java.util.Arrays; import java.util.Random; public class Nim { /**编程之美 NIM游戏分析问题：有N块石头和两个玩家A和B，玩家A先将石头随机分成若干堆，然后按照BABA...的顺序不断轮流取石头，能将剩下的石头一次取光的玩家获胜，每次取石头时，每个玩家只能从若干堆石头中任选一堆，
lunce创建索引及简单查询 chengxuyuancsdn 查询创建索引 lunce
import java.io.File; import java.io.IOException; import org.apache.lucene.analysis.Analyzer; import org.apache.lucene.analysis.standard.StandardAnalyzer; import org.apache.lucene.document.Docume
[IT与投资]坚持独立自主的研究核心技术 comsci it
和别人合作开发某项产品....如果互相之间的技术水平不同,那么这种合作很难进行,一般都会成为强者控制弱者的方法和手段..... 所以弱者,在遇到技术难题的时候,最好不要一开始就去寻求强者的帮助,因为在我们这颗星球上,生物都有一种控制其
flashback transaction闪回事务查询 daizj oracle sql 闪回事务
闪回事务查询有别于闪回查询的特点有以下3个：（1）其正常工作不但需要利用撤销数据，还需要事先启用最小补充日志。（2）返回的结果不是以前的“旧”数据，而是能够将当前数据修改为以前的样子的撤销SQL（Undo SQL）语句。（3）集中地在名为flashback_transaction_query表上查询，而不是在各个表上通过“as of”或“vers
Java I/O之FilenameFilter类列举出指定路径下某个扩展名的文件游其是你 FilenameFilter
这是一个FilenameFilter类用法的例子，实现的列举出“c:\\folder“路径下所有以“.jpg”扩展名的文件。 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
C语言学习五函数，函数的前置声明以及如何在软件开发中合理的设计函数来解决实际问题 dcj3sjt126com c
# include <stdio.h> int f(void) //括号中的void表示该函数不能接受数据，int表示返回的类型为int类型 { return 10; //向主调函数返回10 } void g(void) //函数名前面的void表示该函数没有返回值 { //return 10; //error 与第8行行首的void相矛盾 } in
今天在测试环境使用yum安装，遇到一个问题： Error: Cannot retrieve metalink for repository: epel. Pl dcj3sjt126com centos
今天在测试环境使用yum安装，遇到一个问题： Error: Cannot retrieve metalink for repository: epel. Please verify its path and try again 处理很简单，修改文件“/etc/yum.repos.d/epel.repo”，将baseurl的注释取消， mirrorlist注释掉。即可。 &n
单例模式 shuizhaosi888 单例模式
单例模式懒汉式 public class RunMain { /** * 私有构造 */ private RunMain() { } /** * 内部类，用于占位，只有 */ private static class SingletonRunMain { priv
Spring Security（09）——Filter 234390216 Spring Security
Filter 目录 1.1 Filter顺序 1.2 添加Filter到FilterChain 1.3 DelegatingFilterProxy 1.4 FilterChainProxy 1.5
公司项目NODEJS实践0.1 逐行分析JS源代码 mongodb nginx ubuntu nodejs
一、前言前端如何独立用nodeJs实现一个简单的注册、登录功能，是不是只用nodejs+sql就可以了？其实是可以实现，但离实际应用还有距离，那要怎么做才是实际可用的。网上有很多nod
java.lang.Math liuhaibo_ljf java Math lang
System.out.println(Math.PI); System.out.println(Math.abs(1.2)); System.out.println(Math.abs(1.2)); System.out.println(Math.abs(1)); System.out.println(Math.abs(111111111)); System.out.println(Mat
linux下时间同步 nonobaba ntp
今天在linux下做hbase集群的时候，发现hmaster启动成功了，但是用hbase命令进入shell的时候报了一个错误 PleaseHoldException: Master is initializing，查看了日志，大致意思是说master和slave时间不同步，没办法，只好找一种手动同步一下，后来发现一共部署了10来台机器，手动同步偏差又比较大，所以还是从网上找现成的解决方
ZooKeeper3.4.6的集群部署 roadrunners zookeeper 集群部署
ZooKeeper是Apache的一个开源项目，在分布式服务中应用比较广泛。它主要用来解决分布式应用中经常遇到的一些数据管理问题，如：统一命名服务、状态同步、集群管理、配置文件管理、同步锁、队列等。这里主要讲集群中ZooKeeper的部署。 1、准备工作我们准备3台机器做ZooKeeper集群，分别在3台机器上创建ZooKeeper需要的目录。数据存储目录
Java高效读取大文件 tomcat_oracle java
　　读取文件行的标准方式是在内存中读取，Guava 和Apache Commons IO都提供了如下所示快速读取文件行的方法：　　Files.readLines(new File(path), Charsets.UTF_8); 　　FileUtils.readLines(new File(path)); 　　这种方法带来的问题是文件的所有行都被存放在内存中，当文件足够大时很快就会导致
微信支付api返回的xml转换为Map的方法 xu3508620 xml map 微信api
举例如下： <xml> <return_code><![CDATA[SUCCESS]]></return_code> <return_msg><![CDATA[OK]]></return_msg> <appid><

按字母分类： A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 其他