fuliang

NLP Resources

Tools : Machine Translation , POS Taggers , NP chunking , Sequence models , Parsers , Semantic Parsers/SRL , NER , Coreference , Language models , Concordances , Summarization , Other

Corpora : Large collections , Particular languages , Treebanks , Discourse , WSD , Literature , Acquisition

SGML/XML

Dictionaries

Lexical/morphological resources

Courses, Syllabi, and other Educational Resources

Mailing lists

Other stuff on the Web : General , IR , IE/Wrappers , People , Societies

Tools

Machine Translation systems

Instructions

Building a baseline statistical phrase MT system

Wonderful pages about how to download a bunch of tools and some data and put them together to build a very competent baseline statistical MT system: NAACL 2006 WMt or 2009 WMT .

Freely downloadable

EGYPT system

System from 1999 JHU workshop. Mainly of historical interest.

GIZA++ and mkcls

Franz Och. C++. GPL.

Thot

Phrase-based model building kit

Phramer

An Open-Source Java Statistical Phrase-Based MT Decoder

Moses

A new open-source phrase-based MT decoder with functionality beyond Pharaoh.

Syntax Augmented Machine Translation via Chart Parsing

Andreas Zollmann and Ashish Venugopal

Free, but getting them requires hassle

Pharaoh decoder

Philip Koehn, ISI.

MTTK

Machine Translation Tool Kit. Deng and Byrne.

Part of Speech Taggers

Freely downloadable

Stanford POS tagger

Loglinear tagger in Java (by Kristina Toutanova)

hunpos

An HMM tagger with models available for English and Hungarian. A reimplementation of TnT (see below) in OCaml. pre-compiled models. Runs on Linux, Mac OS X, and Windows.

MBT: Memory-based Tagger

Based on TiMBL

TreeTagger

A decision tree based tagger from the University of Stuttgart (Helmut Scmid). It's language independent, but comes complete with parameter files for English, German, Italian, Dutch, French, Old French, Spanish, Bulgarian, and Russian. (Linux, Sparc-Solaris, Windows, and Mac OS X versions. Binary distribution only.) Page has links to sites where you can run it online.

SVMTool

POS Tagger based on SVMs (uses SVMlight). LGPL.

ACOPOST (formerly ICOPOST)

Open source C taggers originally written by by Ingo Schröder. Implements maximum entropy, HMM trigram, and transformation-based learning. C source available under GNU public license.

MXPOST : Adwait Ratnaparkhi's Maximum Entropy part of speech tagger

Java POS tagger. A sentence boundary detector (MXTERMINATOR) is also included. Original version was only JDK1.1; later version worked with JDK1.3+. Class files, not source.

fnTBL

A fast and flexible implementation of Transformation-Based Learning in C++. Includes a POS tagger, but also NP chunking and general chunking models.

mu-TBL

An implementation of a Transformation-based Learner (a la Brill), usable for POS tagging and other things by Torbjörn Lager. Web demo also available. Prolog.

YamCha

SVM-based NP-chunker, also usable for POS tagging, NER, etc. C/C++ open source. Won CoNLL 2000 shared task. (Less automatic than a specialized POS tagger for an end user.)

QTAG Part of speech tagger

An HMM-based Java POS tagger from Birmingham U. (Oliver Mason). English and German parameter files. [Java class files, not source.]

The TOSCA/LOB tagger .

Currently available for MS-DOS only. But the decision to make this famous system available is very interesting from an historical perspective, and for software sharing in academia more generally. LOB tag set.

The venerable Brill's Transformation-based learning Tagger

A symbolic tagger, written in C. It's no longer available from a canonical location, but you might find a version from the Wikipedia page or you could try a reimplementation such as fnTBL .

Original Xerox Tagger

A common lisp HMM tagger available by ftp .

Lingua-EN-Tagger

Perl POS tagger by Maciej Ceglowski and Aaron Coburn. Version 0.11. (A bigram HMM tagger.)

Free, but require registration

TATOO

The ISSCO tagger. HMM tagger. Need to register to download.

PoSTech Korean morphological analyzer and tagger

Online registration.

TnT - A Statistical Part-of-Speech Tagger

Trainable for various languages, comes with English and German pre-compiled models. Runs on Solaris and Linux.

Usable by email or on the web, but not distributed freely

Memory-based tagger

From ILK group, Catholic University Brabant (Jakub Zavrel/Walter Daelemans). Does Dutch, English, Spanish, Swedish, Slovene. Other MBL demos are also available.

Birmingham tagger

Accepts only plain ASCII email message contents. The tagset used is similar to the Brown/LOB/Penn set.

CLAWS tagger

The UCREL CLAWS tagger is available for trial use on the web. (It's limited to 300 words though -- this site is more of an advertisement for licensing the real thing -- available as software for Suns or as a paid service.) You can also find info on CLAWS tagsets , though that page doesn't seem to link to the C7 tagset .

The AMALGAM tagger

The AMALGAM Project also has various other useful resources, in particular a web guide to different tag sets in common use . The tagging is actually done by a (retrained) version of the Brill tagger (q.v.).

Xerox XRCE MLTT Part Of Speech Taggers

Tags any of 14 languages (European and Arabic), online on the web.

Portuguese taggers on the web: Projecto Natura and a QTAG adaptation .

Not free

Lingsoft

Lingsoft in Finland has (symbolic) analysis tools for many European languages. More information can be obtained by emailing [email protected] . There is an online demo .

Conexor

Conexor in Finland has demonstrations of EngCG-style taggers and parsers, for English, Swedish, and Spanish.

Xerox

Xerox has morphological analyzers and taggers for many languages. There are demos of some of their tools on the web. More information can be obtained by contacting Daniella Russo .

Infogistics

Infogistics , an Edinburgh spinoff has a tagging and NP/Verb group chunker available commercially, including an evaluation version.

No longer available

LT POS and LT TTT

The Edinburgh Language Technology Group tagger and text tokenizer (and sentence splitter were binary-only Solaris tools which no longer seem to be available.

NP chunking

Downloadable

YamCha

SVM-based NP-chunker, also usable for POS tagging, NER, etc. C/C++ open source. Won CoNLL 2000 shared task. (Less automatic than a specialized POS tagger for an end user.)

Mark Greenwood's Noun Phrase Chunker

A Java reimplementation of Ramshaw and Marcus (1995).

fnTBL

A fast and flexible implementation of Transformation-Based Learning in C++. Includes a POS tagger, but also NP chunking and general chunking models.

Generic sequence models

Downloadable

CRF++

Generic CRF-based model in C++. Open source. By the author of YamCha.

Carafe

Generic CRF-based sequence models in O-CaML. Open source. By Ben Wellner.

FreeLing

A large suite of language analyzers. Written in C++. Covers text preprocessing, morphology, NER, POS tagging, parsing.

Parsers

Information on available probabilistic parsers can be found on the FSNLP: probabilistic parsing links page.

Semantic Parsers

Downloadable

ASSERT

PropBank semantic roles (and opinions, etc.) by Sameer Pradhan.

Shalmaneser

FrameNet-based by Katrin Erk.

Tree Kernels in SVMlight by Alessandro Moschitti.

A general package, but it has particularly been used for SRL.

Named Entity Recognition

Downloadable

Stanford Named Entity Recognizer

A Java Conditional Random Field sequence model with trained models for Named Entity Recognition. Java. GPL. By Jenny Finkel.

LingPipe

Tools include statistical named-entity recognition, a heuristic sentence boundary detector, and a heuristic within-document coreference resolution engine. Java. GPL. By Bob Carpenter, Breck Baldwin and co.

YamCha

SVM-based NP-chunker, also usable for POS tagging, NER, etc. C/C++ open source. Won CoNLL 2000 shared task. (Less automatic than a specialized POS tagger for an end user.)

Coreference (Anaphora) Resolution

Downloadable

BART

A Beautiful Anaphora Resolution Toolkit. Java. By Yannick Versley and many others. Java. Apache with GPL components.

Guitar

Java. GPL.

Language modeling toolkits

Downloadable

IRSTLM Toolkit Compatible with SRILM, suitable for very large language models. LGPL. By Marcello Federico, Nicola Bertoldi et al.

CMU-Cambridge Statistical Language Modeling toolkit

Downloadable, but requires registration

The SRI Language Modeling toolkit

by Andreas Stolcke is another good system for building language models, freely available for research purposes.

Not yet classified

Lextools

A package of tools for creating weighted finite-state transducers (WFST) from high-level linguistic descriptions. Lextools binaries are available free for non-commercial use at: http://www.research.att.com/sw/tools/lextools/ . Supported platforms are: linux (i686), sgi (mips2) and sun4. Lextools is built on top of, and requires, the AT&T WFST toolkit (version 3.6), available free for non-commercial use from: http://www.research.att.com/sw/tools/fsm/

Friendly concordancing and text analysis tools

Wordsmith Tools (Mike Scott)

The thing to get if you are working in the Windows world.

Text summarization tools

A prototype Java Summarisation applet (System Quirk)

MEAD

A public domain portable multi-document summarization system. (Dragomir Radev and others.)

Other

Downloadable

Tilburg University's TiMBL

Tilburg's Memory Based Learner by Walter Daelemans et al. A general near-neighbour-based machine learning package, but optimized for statistical NLP applications.

Time Expression taggers

TIMEX2 standard taggers (site at Mitre).

NLTK

An open source Python package for NLP application development with tools such as tokenization, POS TAGGING and parsers by Ed Loper and Steven Bird.

Ted Pedersen's code

Ngram Statistics Package: Perl code that implements: Fisher's exact test, the likelihood ratio, Pearson's chi squared test, the Dice Coefficient, and Mutual Information; Duluth Senseval-2 word sense disambiguation systems; Senseval-1 data in Senseval-2 format; various other WSD datasets in Senseval formats, and semantic distances derived via WordNet.

ISIP tools

The main aim is a publically available speech recognition system (alpha release available), but along the way there are also toolkits for discrete HMMs and statistical decision trees, and for various aspects of signal processing.

Mem . A Perl implementation of Generalized and Improved Iterative Scaling

by Hugo WL ter Doest.

Automorphology

A system (for Windows) for automatically learning the morphological forms of words in a corpus by John Goldsmith.

Wordnet

Wordnet is available by ftp , compiled for a variety of machine types. For money, one can also get EuroWordNet for various European languages, an Italian/English/Spanish MultiWordNet and there's now a site for Global Wordnet . (See also Mappings between WordNet versions and Perl WordNet-Similarity module by Ted Pedersen, and WordNet Domains (coarse-grained sense topic classifications).)

Penn XTAG project

A wide-coverage tree-adjoining grammar written in a mixture of C and Common Lisp. Also includes a large coverage morphological analyzer. Now includes more tools such as TCL/Tk tree viewer.

Dan Melamed's Assorted Tools

A collection of various tools including a simulated annealling program, a post-processor for English stemming for the Penn XTAG morphology system, Good-Turing smoothing software, general text processing tools, text statistics tools and bitext geometry tools (mainly written in Perl 5).

MULTEXT

Constructing corpora and tools for processing multilingual corpora. Contact: Jean Veronis [email protected] . Some stuff including a multilingual text editor is downloadable. MULTEXT EAST has parallel versions of Orwell's 1984 available free (upon registration) for a number of Central European languages.

Naive Bayes algorithm

Software from the Rainbow/Libbow software package that implements several algorithms for text categorization, including naive Bayes, TF.IDF, and probabilistic algorithms. Accompanies Tom Mitchell's ML text.

HDDI

Text Data Mining API from Lehigh University.

Emdros: a text database engine for linguistic analysis and research

Chasen

Japanese morphological analyzer. Descendent of JUMAN.

Free, but require registration

Stuttgart's IMS Corpus Workbench (CWB)

A workbench for full-text retrieval from large corpora (with a query language and corpus indexing). Includes the Corpus Query Processor (CQP) and xkwic. Available free for research groups (currently only as Solaris 1/2 or Linux binaries), on signing a license agreement.

Gate

University of Sheffield's General Architecture for Text Engineering. Primarily an Information Extraction system.

MITRE's Alembic Workbench

A workbench for the development of tagged corpora. Includes a tagger based on Brill's TBL approach.

SNoW

SNoW is a learning program that can be used as a general purpose multi-class classifier and is specifically tailored for learning in the presence of a very large number of features. The learning architecture is a sparse network of linear units over a pre-defined or incrementally acquired feature space (Dan Roth).

Unsure

INTEX

a finite-state transducer analysis system for English, French, and Italian that runs under NextStep. Contact: Max Silberztein [email protected]

The PennTools page collects information on a variety of NLP systems, many of which are available externally.

Corpora

Large collections aimed at the NLP community

LDC (Linguistic Data Consortium) and its catalogue by year .

Email: [email protected] . Provides the largest range of corpora on CD-ROM. Cost ranges from cheap (e.g., ACL-DCI disk) to pricey. CDs can be purchased individually; institutions can become members and receive discounts on CDs. There's an LDC Online service for searches over the web (mainly intended for members, but there are samplers available).

European Language Resources Association and its catalogue .

Distribution agency is ELDA . Rapidly growing collection of materials in European languages.

ICAME (International Computer Archive of Modern English)

Sells various corpora (including Brown and London-Lund). Information on corpora on the web , by sending the message help to [email protected] , by ftp to nora.hd.uib.no . Also, manuals for these corpora.

Reuters @ NIST

Reuters corpora are now distributed by NIST.

TRACTOR

TELRI Research Archive of Computational Tools and Resource. Corpora, many multilingual, in European community languages. Small fee for joining in order to be able to get corpora (unless you have contributed corpora).

CLR (Consortium for Lexical Research)

Email: [email protected] . Focuses more on language processing tools and lexicons, but does have some corpora. As of Feb 1996, you can get most of their stuff by anonymous ftp to clr.nmsu.edu . Their catalog is available as a postscript file.

OTA (Oxford Text Archive)

Provides mainly literary texts. Has a bright new web site. Email: [email protected] . Most materials are available on the web or by anonymous ftp to ota.ox.ac.uk . Some require negotiations with the providers.

Leipzig Corpora Collection

Sentence collections in MySQL database for 17 mainly European languages.

BNC (British National Corpus)

A 100 million word corpus of British English. You can search it online from their simple web interface or via View , a much better interface by Mark Davies, and there is an index to genres by David Lee. And now, an XML edition .

European Corpus Initiative Multilingual Corpus I (ECI/MCI)

A 98 million word corpus, covering most of the major European languages, as well as Turkish, Japanese, Russian, Chinese, and Malay. Cheap. Need to sign a license agreement available at either the WWW site. Also available from the LDC.

Survey of English Usage

At the Department of English Language and Literature at University College London. Includes the British part of ICE , the International Corpus of English project. Now available tagged, and parsed for function. 83,419 sentences. Includes ICECUP, dedicated retrieval software. Also, Diachronic Corpus of Present-Day Spoken English (800,000 words, tagged and parsed, half from ICE-GB and half from London-Lund).

International Corpus of English (ICE)

Million word collections of English from various world Englishes: ICE-NZ, ICE-HK, ICE-East Africa, etc. Several of them are downloadable from this site.

Corpora held by Lancaster University

This link provides its own annotations.

The European Language Activity Network

Promises a uniform query language for accessing corpora in all EU languages -- but isn't quite there yet.

Talkbank .

Rich video and transcripts.

Particular languages

English

English language corpora available from the sites above are not repeated here.

Corpora by Geoffrey Sampson's team

The SUSANNE corpus and the CHRISTINE corpus (SUSANNE markup of a speech corpus).

Michigan Corpus of Academic Spoken English (MICASE) . 1.7 million words from 1997-2001.

Penn-Helsinki Parsed Corpus of Middle English

A syntactically annotated corpus of the Middle English prose samples in the Helsinki Corpus of Historical English, with additions. 1.3 million words. $200.

Corpus of Professional, Spoken American-English (CPSA)

2 million words from faculty and committee meetings and White House press conferences (50K work sample free on internet).

Lancaster Parsed Corpus

Dialogue Diversity Corpus (Bill Mann)

American National Corpus

Chinese

English language corpora available from the sites above are not repeated here.

The Lancaster Corpus of Mandarin Chinese (LCMC)

By Tony McEnery and Richard Xiao. Distinguished by being a balanced corpus, and freely available.

Multilingual

JRC-Acquis

A parallel corpus of EU documents across all member states. 8 million words or more in each of 20 languages.

EMILLE/CIIL

Monolingual written corpus data for 14 South Asian languages (Assamese, Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Oriya, Punjabi, Sinhala, Tamil, Telegu and Urdu). Orthographically transcribed spoken data and parallel corpus data for five South Asian languages (Bengali, Gujarati, Hindi, Punjabi and Urdu). In addition, the parallel corpus contains the English originals from which the translations stored in the corpus were derived. All data in the corpus is CES and Unicode compliant. The EMILLE corpus totals some 94 million words. Downloadable.

OPUS

An open source parallel corpus, aligned, in many languages, based on free Linux etc. manuals.

World Health Organization Computer Assisted Translation page .

Also includes a good selection of links on Computer Assisted Translation. (See also the copyright page .)

Searchable Canadian Hansard French-English parallel texts (1986-1993)

From the Laboratoire de Recherche Appliquée en Linguistique Informatique, Universite de Montréal

European Union web server

Parallel text in all EU languages. (In particular try European legislation .)

TELRI CD-ROMs

Parallel and other text in central and eastern european languages.

Bosnian

The Oslo Corpus of Bosnian Texts .

Czech

Parallel Czech-English

Literature translations in Czech and English

Czech National Corpus project: SYN2000

100 million words of contemporary Czech.

French

Association des Bibliophiles Universels

Various French literary works.

American and French Research on the Treasury of the French Language (ARTFL)

150 million word corpus of various genres of French. You have to be a member to use it (but membership is fairly cheap).

German

COSMAS Corpus

Large (over a billion words!) online-searchable German and Austrian corpora. This is the publically available part of the 1.85 billion word Mannheimer Corpus Collection

NEGRA Corpus

Saarland University Syntactically Annotated Corpus of German Newspaper Texts. Available free of charge to academics. 20,000 sentences, tagged, and with syntactic structures. Free for academic use.

Russian

Russian National Corpus

150 million words, 5 million words POS-tagged, some in dependency treebank.

Library of Russian Internet Libraries

Various literary works.

Slovene

Slovene-English parallel corpus

1 M words, free to download + on-line concordances.

Coming soon: Slovene reference corpus of 100 M words

Spanish and Portuguese

TychoBrahe Parsed Corpus of Historical Portuguese

Over a million words of Portuguese from different historical periods, some of it morphologically analyzed/tagged. Free.

Information about Mark Davies' collection of (mainly historical Spanish and Portuguese .

It's not clear what their availability is.

The CUMBRE corpus. Contact Professor Aquilino Sánchez

The CRATER Spanish corpus

Morphosyntactically tagged telecommunication manuals) is available by ftp .

Corpus resources for Portuguese

In total about 70 million words, available free, from various sources (newswire, etc.)

Folha de S. Paulo newspaper

4 annual CDROMs with full text.

COMPARA

Portuguese-English parallel corpus. (In general, various resources at Linguateca site.

Swedish

Spraakdata , Department of Swedish, Göteborgs University.

Has various searcable part of speech tagged Swedish corpora (Parole, Bank of Swedish, etc.), and some material in Zimbabwean languages.

Treebanks

Name Language Size Availability Comments

Penn Treebank	US English	2 million + words	Available (distributed by LDC)	1 million WSJ, 1 million speech, surface syntax (1970s TG)
BLLIP WSJ corpus	US English	30 million words	Available (distributed by LDC)	WSJ newswire. Automatically parsed, not hand checked. Same structure as Penn Treebank, except for some additional coreference marking
ICE-GB	UK English	1 million words (83,394 sentences)	Available; c. 500 pounds	British part of ICE, the International Corpus of English project. Tagged and parsed for function. Half spoken material.
NEGRA Corpus	German	20,000 sentences	Available free of charge to academics on completion of license agreement.	Saarland University Syntactically Annotated Corpus of German Newspaper Texts. Tagged, and with syntactic structures.
TIGER corpus	German	700,000 words	Available free of charge for research purposes on completion of license agreement.	German newspaper text (Frankfurter Rundschau). Semi-automatically parsed. They also have a good treebank search tool, TIGERSearch .
Alpino Dependency Treebank	Dutch	150,000 words	Freely downloadable	Assorted subcorpora. By far the largest is the full cdbl (newspaper) part of the Eindhoven corpus.
The Prague Dependency Treebank 1.0	Czech	500,000 words	Free on completion of license agreement (available through LDC).	Analyzed at the levels of parts of speech, syntactic functions (and, in the future, semantic roles) level in a dependency framework. Text from newspapers and weekly magazines.
TUT: Turin University Treebank	Italian	2,400 sentences	Free download.	Morhpological analysis and dependency analysis. Penn Treebank translation. Civil law and newspaper texts.
Bulgarian Treebank	Bulgarian	n/a	POS-tagged texts and dependencies analyses are available (some are free on the web, others via a license agreement)	An under construction Bulgarian HPSG treebank.
Penn Chinese Treebank	Chinese	100,000 words	Available (LDC )	Based on Xinhua news articles. 1980s-style GB syntax.
Danish Dependency Treebank 1.0	Danish	100,000 words	Available free under the GPL.	Built on a portion of the Parole corpus.
Floresta Sintá(c)tica	Portuguese	168,000 words hand-corrected; 1,000,000 words automatically parsed	Hand corrected part is free web download; automatically parsed part available through email contact	Text from CETEMPúblico corpus . Phrase structure and dependency representations. Available in several formats, including Penn Treebank format.
Talbanken05	Swedish	300,000 words	Free download	Resurrects and modernizes an early treebank from the 1970s.

Verbmobil Tübingen : under construction treebanked corpus of German, English, and Japanese sentences from Verbmobil (appointment scheduling) data

Syntactic Spanish Database (SDB) University of Santago de Compostela. 160,000 clauses / 1.5 million words.

CKIP Chinese Treebank (Taiwan) . Based on Academia Sinica corpus. (There's also a 100 sentence Chinese treebank at U. Maryland.)

LDC Korean Treebank .

Dublin-Essex Treebank project

Deriving Linguistic Resources from Treebanks.

Treebanks

CSTBank : Cross-document Structure Theory: marking sentence functional relationships across related documents.

Resources for Word Sense Disambiguation

The Senseval web site

Has a comprehensive selection of resources for WSD, including a good list of WSD data resources , but not yet the new SEMCOR .

Ted Pedersen's code

Includes various WSD systems.

SenseClusters

Open source package for unsupervised discovery of word senses by clustering together instances of a word (or words) that are used in similar contexts in raw text, supporting a wide range of clustering techniques based on both context vectors and similarity matrices, and including links to SVDPACKC and CLUTO. Ted Pedersen and Amruta Purandare.

Evocation WordNet synset similarity judgments

Judgments on how similar the meanings of synsets are and how common they are in the BNC from Jordan Boyd-Graber.

Literature

There are now quite large collections of online literature, available in various languages (though the majority are in English, of course). Below are pointers to some of the main collections:

Entirely or mainly English

Alex: A Catalogue of Electronic Texts on the Internet

Seems to have one of the largest collection. Searching and browsing facilities through gopher menus. Many languages.

Wiretap Electronic Text Archive

Extensive and good quality. Still in the gopher age, though.

The On-line Books Page

The index here only covers books in English, but there are lots of links to other collections of material in all languages.

Project Gutenberg

The oldest and largest project to get out of copyright literature online, freely available. (Or see the mirror, Sailor's Project Gutenberg site .)

The Electronic Text Center of the University of Virginia

Large collection of SGML text, mainly in English, but also in other major languages.

Center for Electronic Texts in the Humanities

Princeton/Rutgers collaboration. They didn't have it together with their web site when I stopped by, but they may soon.

Oxford Electronic Text Library Editions

Available from Oxford University Press, 200 Madison Ave, NY, NY 10016 212-679-7300. The Complete Works of Jane Austen is $95.00, and is reviewed in Computers and the Humanities , 28:4-5 (Aug/Oct, 1994), 317-321.

Coreference annotated texts

From University of Woverhampton (R. Mitkov, C. Barbu et al.).

Acquisition data

CHILDES database .

Database of child language transcriptions in English and many other languages. Texts are also available by ftp . Certain usage requirements. Manuals and programs for accessing the data (the CLAN concordancer) are also available online. Now in Unicode XML.

SGML/XML

Robin Cover's SGML/XML Web Page

This is a wonderful compendium of information on SGML and XML, including information on the Text Encoding Initiative (TEI) . This document is also a guide to many text collections (ones usi

你可能感兴趣的:(C++,c,linux,Web,C#)

基于Anaconda环境开发IntelliJ IDEA实用JSON转Java实体插件七夜zippoe 后端 #Java java json intellij-idea
在软件开发中，将JSON数据转换为Java实体类是常见需求。借助Anaconda环境强大的包管理能力与IntelliJIDEA的插件开发体系，我们可以打造一款高效实用的JSON转Java实体插件，显著提升开发效率。下面将从需求分析、技术选型、开发实现到优化部署，全方位阐述这款插件的开发过程。需求分析：明确痛点与功能方向在日常开发中，开发者经常需要根据JSON数据结构手动创建对应的Java实体类，这
庙算兵棋推演AI开发初探（支线-AI平台注意及tips）超自然祈祷智能决策人工智能
总是停留在stage阶段一的问题输出回放数据，在显示中发现一动不动，发现stage字段一直是1部署阶段……解决方法：代码层面需要有type=333的行为告诉引擎部署完毕。pip卸载重装兵棋引擎这个我每次关机后都得重新来一遍，很讨厌（经过试验，此举会重新复制一个.engine_config到python包的目录）删除某文件确定发出了部署命令还没效果，看看你的用户根目录(root或者用户名)下有没有.
OpenLayers 选中移动要素 GIS之路 OpenLayers WebGIS microsoft 前端信息可视化
前言页面交互的复杂度体现系统使用的难易程度，在开发WebGIS系统过程中，总会涉及要素操作，如何设计才能使交互操作变得简洁呢？OpenLayers提供了一些成熟的交互控件可以做到。1.选中和移动控件Select和Translate分别是选中控件、移动控件，它们都在ol.interaction包下。Select控件用于选中矢量要素，被选中的要素会进行默认会进行高亮显示，为选中默认样式，也可以自定义设
我的世界进阶模组开发教程——机械动力的数据生成（1） lemon_sjdk 我的世界
机械动力注册元素的方式是依赖registrateAPI来实现注册的，这个API和之前说的GlitchCore库所用的注册方式高效多了，不管是开发效率还是可维护性，都比bop式注册好多了，因此学习第三篇和第四篇文章是重中之重代码解析：Create模组主类（Create.java）核心字段解析基础标识字段ID="create"：模组唯一标识符，用于资源定位（如create:gear）。NAME="Cr
我的世界1.20.1forge模组开发进阶教程——Geckolib动画实体（3） lemon_sjdk java 我的世界模组开发
注意：本章涉及大量的geckolib底层代码，补充讲解了上一节没讲的，如果看不懂请去学习JavaGeoEntity////Sourcecoderecreatedfroma.classfilebyIntelliJIDEA//(poweredbyFernFlowerdecompiler)//packagesoftware.bernie.geckolib.animatable;importjavax
我的世界1.20.1forge模组开发进阶教程——序列化（1） lemon_sjdk java 我的世界 mc forge模组开发序列化
mc的序列化在《Minecraft》（MC）中，序列化指将游戏数据（如方块、实体、玩家状态等）转换为可存储或传输的格式。这是游戏运行、存档保存和网络通信的关键技术。以下是Minecraft中常见的序列化方式及其用途：一、序列化在Minecraft中的作用存档数据持久化将玩家建筑、地图、物品栏等数据保存到硬盘（如.minecraft/saves中的区域文件）。网络传输服务器与客户端同步方块更新、实体
我的世界进阶模组开发教程——地形生成(1) lemon_sjdk 我的世界 forge模组开发进阶教程 java
找到mc的屎山代码，找到net.minecraft.world.level.levelgen包，我们来看看mc是如何完成地形生成的SurfaceRules代码结构与核心功能解析该代码是Minecraft世界生成模块中地表规则（SurfaceRules）的核心实现，用于控制地形表面的方块生成逻辑。以下从多角度进行拆解分析：一、顶层结构解析1.静态条件定义（ConditionSource）public
浅谈卷积神经网络(CNN) cyc&阿灿 cnn 人工智能神经网络
卷积神经网络(ConvolutionalNeuralNetworks,CNN)作为深度学习领域最具影响力的架构之一，已在计算机视觉、自然语言处理、医学影像分析等领域取得了革命性突破。本文将系统全面地剖析CNN的核心原理、关键组件、经典模型、数学基础、训练技巧以及最新进展，通过理论解析与代码实践相结合的方式，帮助读者深入掌握这一重要技术。一、CNN基础与核心思想1.1传统神经网络的局限性在处理图像等
大数据智能风控核心：模型 johnny233 读书笔记大数据
概述模型线性判别分析方法，SirRonaldFisher最早提出模型评分的概念。个人FICO模型信用分。巴塞尔委员会发布巴塞尔Ⅱ协议，推出内部评级法（InternalRatingBasedApproach，IRB）。IRB综合考虑客户评级和债项评级，通过违约概率(ProbabilityofDefault,PD)、违约损失率(LossGivenDefault,LGD)、违约风险暴露(Exposure
Go项目限流全攻略：超越中间件的全方位解决方案码农老gou golang 中间件开发语言
引言：限流在分布式系统中的重要性在当今高并发的互联网应用中，流量控制已成为保障系统稳定性的关键手段。一次突发的流量洪峰可能导致整个系统崩溃，造成不可估量的损失。作为Go开发者，我们常常会面临这样的面试问题：Go项目中如何实现限流？仅仅使用中间件就足够了吗？本文将深入探讨Go项目中的限流策略，分析中间件的局限性，并介绍超越中间件的全方位解决方案。一、常见限流算法解析1.令牌桶算法（TokenBuck
深入剖析 Linux 内核网络核心：sock.c 源码解析 109702008 编程 #C语言网络 linux 网络人工智能
作为Linux网络子系统的基石，sock.c承载着协议无关的核心功能。本文将深入分析其关键实现，揭示高性能网络通信背后的设计哲学。一、Socket生命周期管理1.1初始化与分配sock_init_data()是socket的初始化入口，负责设置核心回调函数和默认参数：voidsock_init_data(structsocket*sock,structsock*sk){sk->sk_state=T
随机森林详解：原理、优势与应用实践大千AI助手人工智能 Python #OTHER 随机森林算法机器学习决策树人工智能 DecisionTree 数据挖掘
本文由「大千AI助手」原创发布，专注用真话讲AI，回归技术本质。拒绝神话或妖魔化。搜索「大千AI助手」关注我，一起撕掉过度包装，学习真实的AI技术！随机森林介绍1.定义：随机森林是一种强大的、高度灵活的集成学习（EnsembleLearning）算法，主要用于分类和回归任务。它的核心思想是构建多棵决策树（DecisionTree），并将这些树的预测结果进行组合（例如，分类任务采用投票，回归任务采用
经济学神图：洛伦兹曲线大千AI助手人工智能 Python #OTHER 决策树人工智能 DecisionTree 算法洛伦兹曲线基尼
洛伦兹曲线（LorenzCurve）是衡量社会收入或财富分配不平等程度的经典可视化工具，由美国统计学家马克斯·洛伦兹（MaxOttoLorenz）于1905年提出。它不仅是理解基尼系数的核心基础，也是经济学、社会学中分析资源分配公平性的关键图表。本文由「大千AI助手」原创发布，专注用真话讲AI，回归技术本质。拒绝神话或妖魔化。搜索「大千AI助手」关注我，一起撕掉过度包装，学习真实的AI技术！往期文
为啥枚举天生线程安全？ chi_666 面试安全
枚举天生线程安全的特性，主要源于其在Java语言中的设计机制和类加载机制。以下是具体原因分析：一、枚举的本质：静态final的实例枚举在Java中本质上是一个继承了java.lang.Enum的特殊类，每个枚举常量在编译时会被转换为该类的静态final实例。例如：publicenumThreadSafeEnum{INSTANCE;//其他属性和方法}编译后等价于：publicfinalclassT
Modbus RTU 转 Profinet 网关接台安 N310 变频器与西门子plc通讯兴达易控工业以太网解决方案网络协议
ModbusRTU转Profinet网关接台安N310变频器与西门子plc通讯在工业自动化领域，设备之间的通信至关重要，它如同神经系统一般，连接着各个部分，确保系统的稳定运行。今天，我们就来深入探讨一下ModbusRTU转Profinet网关与台安N310变频器通讯的相关知识。ModbusRTU是一种广泛应用的工业通讯协议，以其简单、可靠等特点在众多工业场景中占据一席之地。它采用主从站架构，通过串
Date与LocalDate互转 chi_666 JAVA java
1、Date转LocalDateDatetoDay=newDate();LocalDatelocalDate=toDay.toInstant().atZone(ZoneId.systemDefault()).toLocalDate();2、LocalDate转DateLocalDatelocalDate=LocalDate.parse("2023-01-01",DateTimeFormatter.
【第二章:机器学习与神经网络概述】03.类算法理论与实践-(3)决策树分类器 IT古董人工智能课程机器学习算法神经网络
第二章:机器学习与神经网络概述第三部分：类算法理论与实践第三节：决策树分类器内容：信息增益、剪枝技术、过拟合与泛化能力。决策树是一种常用于分类和回归的树状结构模型，它通过一系列特征判断进行决策，有良好的可解释性。一、基本概念节点（Node）：表示特征判断条件边（Branch）：表示特征判断的结果路径叶子节点（Leaf）：表示分类结果二、划分准则：信息增益（InformationGain）信息增益衡
Z-library数字图书馆镜像网址入口及客户端/app (持续更新) 黄豆匿zlib 学习
Z-Library（简称z-lib，前身为BookFinder）是一个影子图书馆和开放获取文件分享计划，用户可在此网络下载期刊文章以及各种类型的书籍。截止2022年6月12日，该网站共收录了10,456,034本书和84,837,646篇文章。zlibrary电脑客户端/安卓appzlibrary（windows/mac/安卓/ipad）安装包下载：夸克网盘分享（随时失效，先保存）无需魔法正常使用
Java 期末复习（四）四谎真好看 java eclipse
1.创建一个标识有“关闭”按钮的语句是（）A.TextFieldb=newTextField(“关闭”);B.Lableb=newLable(“关闭”);C.Checkboxb=newCheckbox(“关闭”);D.Buttonb=newButton(“关闭”);解：①根据英语单词的意思来选择就行，Button类是专用于创建可点击的按钮控件。②TextField是输入框的意思，Lable是只读文
函数的进阶小盐巴小严 web前后端开发学习笔记 javascript 前端 es6
JavaScript函数概念构成函数主体的JavaScript代码在定义之时并不会执行，只有在调用函数时，函数才会执行。调用JavaScript函数的方法：作为函数作为方法作为构造函数通过函数的call（）和apply（）间接调用函数属性length属性在函数体例，arguments.length表示传入函数的实参的个数函数本身的length属性是只读的，代表函数声明的实际参数的数量functio
我的世界模组开发进阶教程——机械动力的数据生成（2） lemon_sjdk 我的世界模组开发 java
==这篇文字继续来看看机械动力的数据生成==Create源码AssetLookupAssetLookup是Minecraft模组开发中用于简化数据生成的工具类，专注于自动处理方块（Block）和物品（Item）的模型（Model）文件路径生成与状态映射。其核心功能是根据规则动态构造资源路径，并适配不同状态（如供电状态、指示器数值）的模型。以下从两个维度详细解析：一、String...语法：Java
基于Redis分布式的限流 chi_666 redis 分布式数据库
以下是基于Redis实现分布式限流的Java解决方案，包含多种限流算法和完整实现代码：一、限流算法选择与实现1.固定窗口算法（SimpleRateLimiter）publicclassRedisFixedWindowRateLimiter{privatefinalStringRedisTemplateredisTemplate;privatefinalStringscript="localcurr
【目标检测】YOLOv13：超图增强的实时目标检测新标杆，值得收藏。 Carl_奕然机器视觉与目标检测目标检测 YOLO 人工智能
一文掌握YOLOv13最新特性1、引言2、Yolov13详细讲解2.1发布时间与背景2.2相对于YOLOv12的核心提升2.2.1精度显著提升2.2.2轻量化与效率优化2.2.3高阶语义建模能力2.3架构设计与核心创新2.3.1超图自适应关联增强（HyperACE）2.3.2全流程聚合-分发（FullPAD）2.3.3轻量化模块设计2.4性能对比2.4代码示例2.4.1环境配置2.4.2训练代码2
Nginx快速上手浪裡遊 nginx 运维前端后端
什么是nginxNginx是一款开源的高性能HTTP和反向代理服务器，同时也提供了IMAP/POP3/SMTP代理功能。它由俄罗斯程序员IgorSysoev于2004年首次发布，最初设计目的是为了解决C10k问题，即如何让单台服务器同时处理1万个并发连接的问题。功能和作用Nginx主要的功能和作用包括但不限于以下几点：Web服务器：Nginx可以作为一个轻量级的Web服务器来处理静态文件、索引文件
Python爬虫实战：研究Bleach库相关技术 ylfhpy 爬虫项目实战 python 爬虫 php 开发语言 Bleach
1.引言在大数据时代，网络内容采集已成为信息获取的重要手段。Python凭借其丰富的爬虫库（如Requests、Scrapy）和灵活的数据处理能力，成为网页爬虫开发的首选语言。然而，从互联网获取的内容往往包含恶意脚本、不安全标签等安全隐患，直接使用可能导致XSS(跨站脚本攻击)、数据泄露等风险。Bleach作为专业的HTML净化库，通过白名单机制提供了可靠的内容安全过滤方案。本文将结合实际案例，详
Python爬虫实战：研究untangle库相关技术 ylfhpy 爬虫项目实战 python 爬虫 php 开发语言 untangle
1.引言在大数据时代，网络数据已成为重要的信息资源。XML和HTML作为互联网上最常用的数据表示格式，广泛应用于API接口、网站结构和数据交换等场景。Python凭借其丰富的爬虫库（如Requests、Scrapy）和灵活的数据处理能力，成为网络数据采集的首选语言。然而，从复杂的XML/HTML文档中提取结构化数据仍然面临诸多挑战，如文档结构多样性、动态内容渲染和数据格式转换等问题。Untangl
第 3 章：神经网络如何学习鱼摆摆拜拜神经网络学习人工智能
第3章：神经网络如何学习在第二章中，我们详细了解了神经网络的静态结构：由神经元组成的层，以及连接它们的权重和偏置。现在，我们将进入整个教程最核心的部分：神经网络是如何从数据中"学习"的？这个学习过程是一个动态的、不断调整自身参数以求更佳预测的过程。我们将通过四个关键概念来揭示这个秘密：前向传播(ForwardPropagation)：数据如何通过网络产生一个预测？损失函数(LossFunction
【二】19.关于LCD和LTDC 我滴Yang #STM32MP157驱动入门 fpga开发
前言：。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。。1.LCD简介：（1）什么是LCD:全称LiquidCrystalDisplay,其构造是在两片平行得玻璃基板中放置液晶盒，下基板玻璃上设置TFT（薄膜晶体管），上基板玻璃上设置彩色滤光片，通过TFT上的信号与电压改变来控制液晶分子的转动方向，从而达到控制每个像素点偏振光出射与否而达到显示目的。（2）
android launcher3,Android Launcher3 基本功能分析众卡之友 android launcher3
AndroidLauncher3基本功能分析1,界面的布局,从上往下分别为:DeleteDropTarget(应用卸载区域,它是一个DropTarget)Workspace(页面容器,一个页面是一个CellLayout)PageIndicator(指示器,指示workspace当前位于第几个页面)Hotseat(底部图标区域)2,Launcher桌面图标的加载:LauncherApplicatio
AI算力综述和资料整理木鱼时刻人工智能
目录总体介绍计算精度传输协议GPU池化资源调度CUDA技术GPU硬件参考链接总体介绍AI算力是人工智能系统的核心基础设施，涵盖了从计算精度、传输协议到硬件架构的完整技术栈。计算精度混合精度训练原生满血版DeepSeek671B是FP8精度。FP16在训练计算力占比有80-90%，FP32占比10%-20%。大模型训练中通常会用到FP16（半精度浮点数），但并不是只使用FP16，而是采用**混合精度
遍历dom 并且存储（将每一层的DOM元素存在数组中）换个号韩国红果果 JavaScript html
数组从0开始！！ var a=[],i=0; for(var j=0;j<30;j++){ a[j]=[];//数组里套数组，且第i层存储在第a[i]中 } function walkDOM(n){ do{ if(n.nodeType!==3)//筛选去除#text类型 a[i].push(n); //con
Android+Jquery Mobile学习系列(9)-总结和代码分享白糖_ JQuery Mobile
目录导航经过一个多月的边学习边练手，学会了Android基于Web开发的毛皮，其实开发过程中用Android原生API不是很多，更多的是HTML/Javascript/Css。个人觉得基于WebView的Jquery Mobile开发有以下优点： 1、对于刚从Java Web转型过来的同学非常适合，只要懂得HTML开发就可以上手做事。 2、jquerym
impala参考资料 dayutianfei impala
记录一些有用的Impala资料 1. 入门资料 >>官网翻译： http://my.oschina.net/weiqingbin/blog?catalog=423691 2. 实用进阶 >>代码&架构分析： Impala/Hive现状分析与前景展望：http
JAVA 静态变量与非静态变量初始化顺序之新解周凡杨 java 静态非静态顺序
今天和同事争论一问题，关于静态变量与非静态变量的初始化顺序，谁先谁后，最终想整理出来！测试代码： import java.util.Map; public class T { public static T t = new T(); private Map map = new HashMap(); public T(){ System.out.println(&quo
跳出iframe返回外层页面 g21121 iframe
在web开发过程中难免要用到iframe，但当连接超时或跳转到公共页面时就会出现超时页面显示在iframe中，这时我们就需要跳出这个iframe到达一个公共页面去。首先跳转到一个中间页，这个页面用于判断是否在iframe中，在页面加载的过程中调用如下代码： <script type="text/javascript"> //<!-- function
JAVA多线程监听JMS、MQ队列 510888780 java多线程
背景：消息队列中有非常多的消息需要处理，并且监听器onMessage（）方法中的业务逻辑也相对比较复杂，为了加快队列消息的读取、处理速度。可以通过加快读取速度和加快处理速度来考虑。因此从这两个方面都使用多线程来处理。对于消息处理的业务处理逻辑用线程池来做。对于加快消息监听读取速度可以使用1.使用多个监听器监听一个队列；2.使用一个监听器开启多线程监听。对于上面提到的方法2使用一个监听器开启多线
第一个SpringMvc例子布衣凌宇 spring mvc
第一步：导入需要的包；第二步：配置web.xml文件 <?xml version="1.0" encoding="UTF-8"?> <web-app version="2.5" xmlns="http://java.sun.com/xml/ns/javaee" xmlns:xsi=
我的spring学习笔记15-容器扩展点之PropertyOverrideConfigurer aijuans Spring3
PropertyOverrideConfigurer类似于PropertyPlaceholderConfigurer，但是与后者相比，前者对于bean属性可以有缺省值或者根本没有值。也就是说如果properties文件中没有某个bean属性的内容，那么将使用上下文（配置的xml文件）中相应定义的值。如果properties文件中有bean属性的内容，那么就用properties文件中的值来代替上下
通过XSD验证XML antlove xml schema xsd validation SchemaFactory
1. XmlValidation.java package xml.validation; import java.io.InputStream; import javax.xml.XMLConstants; import javax.xml.transform.stream.StreamSource; import javax.xml.validation.Schem
文本流与字符集百合不是茶 PrintWrite()的使用字符集名字别名获取
文本数据的输入输出; 输入;数据流,缓冲流输出;介绍向文本打印格式化的输出PrintWrite(); package 文本流; import java.io.FileNotFound
ibatis模糊查询sqlmap-mapping-**.xml配置 bijian1013 ibatis
正常我们写ibatis的sqlmap-mapping-*.xml文件时，传入的参数都用##标识，如下所示： <resultMap id="personInfo" class="com.bijian.study.dto.PersonDTO"> <res
java jvm常用命令工具——jdb命令(The Java Debugger) bijian1013 java jvm jdb
用来对core文件和正在运行的Java进程进行实时地调试，里面包含了丰富的命令帮助您进行调试，它的功能和Sun studio里面所带的dbx非常相似，但 jdb是专门用来针对Java应用程序的。现在应该说日常的开发中很少用到JDB了，因为现在的IDE已经帮我们封装好了，如使用ECLI
【Spring框架二】Spring常用注解之Component、Repository、Service和Controller注解 bit1129 controller
在Spring常用注解第一步部分【Spring框架一】Spring常用注解之Autowired和Resource注解（http://bit1129.iteye.com/blog/2114084）中介绍了Autowired和Resource两个注解的功能，它们用于将依赖根据名称或者类型进行自动的注入，这简化了在XML中，依赖注入部分的XML的编写，但是UserDao和UserService两个bea
cxf wsdl2java生成代码super出错,构造函数不匹配 bitray super
由于过去对于soap协议的cxf接触的不是很多,所以遇到了也是迷糊了一会.后来经过查找资料才得以解决. 初始原因一般是由于jaxws2.2规范和jdk6及以上不兼容导致的.所以要强制降为jaxws2.1进行编译生成.我们需要少量的修改: 我们原来的代码 wsdl2java com.test.xxx -client http://..... 修改后的代
动态页面正文部分中文乱码排障一例 ronin47
公司网站一部分动态页面，早先使用apache+resin的架构运行，考虑到高并发访问下的响应性能问题，在前不久逐步开始用nginx替换掉了apache。不过随后发现了一个问题，随意进入某一有分页的网页，第一页是正常的（因为静态化过了）；点“下一页”，出来的页面两边正常，中间部分的标题、关键字等也正常，唯独每个标题下的正文无法正常显示。因为有做过系统调整，所以第一反应就是新上
java-54- 调整数组顺序使奇数位于偶数前面 bylijinnan java
import java.util.Arrays; import java.util.Random; import ljn.help.Helper; public class OddBeforeEven { /** * Q 54 调整数组顺序使奇数位于偶数前面 * 输入一个整数数组，调整数组中数字的顺序，使得所有奇数位于数组的前半部分，所有偶数位于数组的后半
从100PV到1亿级PV网站架构演变 cfyme 网站架构
一个网站就像一个人，存在一个从小到大的过程。养一个网站和养一个人一样，不同时期需要不同的方法，不同的方法下有共同的原则。本文结合我自已14年网站人的经历记录一些架构演变中的体会。 1：积累是必不可少的架构师不是一天练成的。 1999年，我作了一个个人主页，在学校内的虚拟空间，参加了一次主页大赛，几个DREAMWEAVER的页面，几个TABLE作布局，一个DB连接，几行PHP的代码嵌入在HTM
[宇宙时代]宇宙时代的GIS是什么？ comsci Gis
我们都知道一个事实，在行星内部的时候，因为地理信息的坐标都是相对固定的，所以我们获取一组GIS数据之后，就可以存储到硬盘中，长久使用。。。但是，请注意，这种经验在宇宙时代是不能够被继续使用的宇宙是一个高维时空
详解create database命令 czmmiao database
完整命令 CREATE DATABASE mynewdb USER SYS IDENTIFIED BY sys_password USER SYSTEM IDENTIFIED BY system_password LOGFILE GROUP 1 ('/u01/logs/my/redo01a.log','/u02/logs/m
几句不中听却不得不认可的话 datageek
1、人丑就该多读书。 2、你不快乐是因为：你可以像猪一样懒，却无法像只猪一样懒得心安理得。 3、如果你太在意别人的看法，那么你的生活将变成一件裤衩，别人放什么屁，你都得接着。 4、你的问题主要在于：读书不多而买书太多，读书太少又特爱思考，还他妈话痨。 5、与禽兽搏斗的三种结局：(1)、赢了，比禽兽还禽兽。(2)、输了，禽兽不如。(3)、平了，跟禽兽没两样。结论：选择正确的对手很重要。 6
1 14:00 PHP中的“syntax error, unexpected T_PAAMAYIM_NEKUDOTAYIM”错误 dcj3sjt126com PHP
原文地址：http://www.kafka0102.com/2010/08/281.html 因为需要，今天晚些在本机使用PHP做些测试，PHP脚本依赖了一堆我也不清楚做什么用的库。结果一跑起来，就报出类似下面的错误：“Parse error: syntax error, unexpected T_PAAMAYIM_NEKUDOTAYIM in /home/kafka/test/
xcode6 Auto layout and size classes dcj3sjt126com ios
官方GUI https://developer.apple.com/library/ios/documentation/UserExperience/Conceptual/AutolayoutPG/Introduction/Introduction.html iOS中使用自动布局（一） http://www.cocoachina.com/ind
通过PreparedStatement批量执行sql语句【sql语句相同，值不同】梦见x光 sql 事务批量执行
比如说：我有一个List需要添加到数据库中，那么我该如何通过PreparedStatement来操作呢？ public void addCustomerByCommit(Connection conn , List<Customer> customerList) { String sql = "inseret into customer(id
程序员必知必会----linux常用命令之十【系统相关】 hanqunfeng Linux常用命令
一.linux快捷键 Ctrl+C : 终止当前命令 Ctrl+S : 暂停屏幕输出 Ctrl+Q : 恢复屏幕输出 Ctrl+U : 删除当前行光标前的所有字符 Ctrl+Z : 挂起当前正在执行的进程 Ctrl+L : 清除终端屏幕，相当于clear 二.终端命令 clear : 清除终端屏幕 reset : 重置视窗，当屏幕编码混乱时使用 time com
NGINX IXHONG nginx
pcre 编译安装 nginx conf/vhost/test.conf upstream admin { server 127.0.0.1:8080; } server { listen 80; &
设计模式--工厂模式 kerryg 设计模式
工厂方式模式分为三种： 1、普通工厂模式：建立一个工厂类，对实现了同一个接口的一些类进行实例的创建。 2、多个工厂方法的模式：就是对普通工厂方法模式的改进，在普通工厂方法模式中，如果传递的字符串出错，则不能正确创建对象，而多个工厂方法模式就是提供多个工厂方法，分别创建对象。 3、静态工厂方法模式：就是将上面的多个工厂方法模式里的方法置为静态，
Spring InitializingBean/init-method和DisposableBean/destroy-method mx_xiehd java spring bean xml
1.initializingBean/init-method 实现org.springframework.beans.factory.InitializingBean接口允许一个bean在它的所有必须属性被BeanFactory设置后，来执行初始化的工作，InitialzingBean仅仅指定了一个方法。通常InitializingBean接口的使用是能够被避免的，（不鼓励使用，因为没有必要
解决Centos下vim粘贴内容格式混乱问题 qindongliang1922 centos vim
有时候，我们在向vim打开的一个xml，或者任意文件中，拷贝粘贴的代码时，格式莫名其毛的就混乱了，然后自己一个个再重新，把格式排列好，非常耗时，而且很不爽，那么有没有办法避免呢？答案是肯定的，设置下缩进格式就可以了，非常简单：在用户的根目录下直接vi ~/.vimrc文件然后将set pastetoggle=<F9> 写入这个文件中，保存退出，重新登录，
netty大并发请求问题 tianzhihehe netty
多线程并发使用同一个channel java.nio.BufferOverflowException: null at java.nio.HeapByteBuffer.put(HeapByteBuffer.java:183) ~[na:1.7.0_60-ea] at java.nio.ByteBuffer.put(ByteBuffer.java:832) ~[na:1.7.0_60-ea]
Hadoop NameNode单点问题解决方案之一 AvatarNode wyz2009107220 NameNode
我们遇到的情况 Hadoop NameNode存在单点问题。这个问题会影响分布式平台24*7运行。先说说我们的情况吧。我们的团队负责管理一个1200节点的集群(总大小12PB)，目前是运行版本为Hadoop 0.20，transaction logs写入一个共享的NFS filer(注：NetApp NFS Filer)。经常遇到需要中断服务的问题是给hadoop打补丁。 DataNod

NLP Resources

Contents

Instructions

Freely downloadable

Free, but getting them requires hassle

Freely downloadable

Free, but require registration

Usable by email or on the web, but not distributed freely

Not free

No longer available

Downloadable

Downloadable

Downloadable

Downloadable

Downloadable

Downloadable, but requires registration

Not yet classified

Downloadable

Free, but require registration

Unsure

English

Chinese

Multilingual

Bosnian

Czech

French

German

Russian

Slovene

Spanish and Portuguese

Swedish

Entirely or mainly English

你可能感兴趣的:(C++,c,linux,Web,C#)