flyinbob12349878

Building a Simple Search Engine with C#

Everyone is talking about Search technology at the moment - how Google has risen to the top of the heap, how Yahoo is trying to regain its former number one spot, and how Microsoft is playing catch-up. But for the average ASP.NET developer, those sites are really about helping people find you on the Web. Once they've visited your website, how do you provide a cheap, fast, customised search to maximise the usability of your content?

There are a number of options available:

Search 'Technology'	Advantages	Disadvantages
Microsoft Index Server	Comes with Windows 2000, XP, 2003	File-system indexing only, doesn't spider website links or database-driven pages (there are tricks around this)
Other server-side software eg. DTSearch, mnoGoSearch	Shop around for features that you need, including multiple language support	Cost May be difficult to setup/customise
'Hosted services' eg. Google, PicoSearch	Often free or low cost Easy to set up	Lack of control Often template driven or host ads which may distract your users

Most website operators will find at least one of these products can meet their needs, but it will always be a trade-off between cost, features and flexibility.

This article describes a simple, free, easy to install Search feature. The goal is to build a simple search tool that can be installed simply by placing three files on a website, and that could be easily extended to rival the features of the products listed above!

There are two main parts to a Search engine:

the build process, which processes files, indexing their contents and creating the 'catalog'
the search process, which uses the 'catalog' to find the search term and the names of the files it appears in

Design

A Catalog contains a collection of Words,
and each Word contains a reference to every File that it appears in

The first step was to think about how to implement the catalog objects. A Binary Search Tree seemed like a good idea (see the great articles on MSDN), but in order to keep things simple Hashtables will do the job. We can always refactor the code to use a more sophisticated Collection class later on.

The simple object model looks like this:

You can see that some assumptions have been made in this model.
Firstly, we store limited information about the File - just enough to produce a familiar search results page:

Url - a web-based address for the file (this will become important later)
Title - the of the document </li> <li>FileDate - date the file was last modified </li> <li>Size - in bytes </li> <li>Description - a 'summary' of the document </li> </ul> The Word object is even simpler - the properties are: <ul> <li>Text - the actual word! We will standardise on lowercase for all the data stored we store </li> <li>InFiles - the collection of Files that this Word was found in </li> </ul> Lastly, the Catalog itself has a single property - the collection of Words called index . It also has two methods, one to add Words to the catalog and another to search the catalog and get back a list of files (the search results). There are two important assumptions which aren't immediately apparent from the model - there should only be ONE File object for each physical file, and ONE Word object for each word (so there will only be one Word object that represents the word "microsoft" for example), although that word will appear in many of the files we search. Why this is so, and how we manage it is covered in the catalog build process. <h3>Code Structure </h3> <table width="700" border="1"> <tbody> <tr> <th>Searcharoo.cs</th> <td>Implementation of the object model; compiled into both ASPX pages</td> </tr> <tr> <th>SearcharooCrawler.aspx </th> <td valign="top"><code><%@ Page Language="C#" Src="Searcharoo.cs" %> <%@ import Namespace="Searcharoo.Net"%> </code> Code to build the catalog using the common classes, and place the resulting Catalog object in the ASP.NET Application Cache</td> </tr> <tr> <th>Searcharoo.aspx </th> <td valign="top"><code><%@ Page Language="C#" Src="Searcharoo.cs" %> <%@ import Namespace="Searcharoo.Net"%> </code> Retrieves the Catalog object from the Cache and allows searching via an HTML form. </td> </tr> </tbody> </table> <h3>Object Model [Searcharoo.cs] </h3> This file contains the C# code that defines the object model for our catalog, including the methods to add and search Words. These objects are used by both the crawler and the search page. <table width="700" border="0"> <tbody> <tr> <td class="codelist"> <div class="dotnetcoders-code"> namespace Searcharoo .Net { public class Catalog { private System .Collections .Hashtable index ; public Catalog ( ) { } public bool Add ( string word , File infile , int position ) { } public Hashtable Search ( string searchWord ) { } } public class Word { public string Text ; private System .Collections .Hashtable fileCollection ; public Word ( string text , File infile , int position ) { } public void Add (File infile , int position ) { } public Hashtable InFiles ( ) { } } public class File { public string Url ; public string Title ; public string Description ; public DateTime CrawledDate ; public long Size ; public File ( string url , string title , string description , DateTime datecrawl , long length ) { } } } </div> </td> </tr> <tr> <td class="caption">Listing 1 - Overview of the object model (interfaces only - implementation code has been removed)</td> </tr> </tbody> </table> <h3>Build the Crawler [SearcharooCrawler.aspx] </h3> Now that we have a model and structure, what next? In the interests of 'getting something working', the first build task is to simulate how our 'build' process is going to find the files we want to search. There are two ways we can look for files <ol type="i"> <li>Spidering - following 'the web' of links in HTML pages to search an entire website (or sites) </li> <li>Crawling - crawling through a set of files and folders and indexing all the files in those folders, using the file system. This can only work when the files are locally accessible. </li> </ol> The big search engines - Yahoo, Google, MSN - all spider the internet to build their search catalogs. However following links to find documents requires us to write an HTML parser that can find and interpret the links, and then follow them! That's a little too much for one article, so we're going to start with some simple file crawling code to populate our catalog. The great thing about our object model is that it doesn't really care if it is populated by Spidering or Crawling - it will work for either method, only the code that populates it will change. Here is a simple method that we can use to locate the files we want to search by traversing the file system: <table width="700" border="0"> <tbody> <tr> <td class="codelist"> <div class="dotnetcoders-code"> private void CrawlPath ( string root , string path ) { System .IO .DirectoryInfo m_dir = new System .IO .DirectoryInfo ( path ) ; // ### Look for matching files to summarise what will be catalogued ### foreach (System .IO .FileInfo f in m_dir .GetFiles ( m_filter ) ) { Response .Write ( path .Substring ( root .Length ) + @ "/" + f.Name + "< br>" ) ; } // foreach foreach (System .IO .DirectoryInfo d in m_dir .GetDirectories ( ) ) { CrawlPath ( root , path + @"/" + d .Name ) ; } // foreach } </div> </td> </tr> <tr> <td class="caption">Listing 2 - Crawling the filesystem</td> </tr> </tbody> </table> <table width="700" border="0"> <tbody> <tr> <td></td> </tr> <tr> <td class="caption">Screenshot 1 - To test the file crawler we downloaded the HTML from the CIA World FactBook</td> </tr> </tbody> </table> Now that we are confident we can access the files, we need to process each one in order to populate the catalog. Firstly, let's be clear about what that process is: <ol> <li>get the list of files and folders in the root directory (done) </li> <li>open the first file and read its contents </li> <li>look for the file's Title, Description and calculate its size </li> <li>generate the file's Url (because we're crawling the file-system, but we want the file to have a web address to click on). </li> <li>clean up the text into a collection of words </li> <li>add each word to the catalog, linked to this file </li> <li>close the file and open the next one (or open a directory once all the files are processed) </li> <li>repeat until no more files are found </li> </ol> There's three different coding tasks to do: <ol type="a"> <li>opening the files we find - we'll use the System.IO namespace for this </li> <li>finding specific text in the file (the Title and Description) - either the System.String static methods or the System.RegularExpressions namespaces might help here </li> <li>cleaning up the text and parsing it into individual words - definitely a job for RegularExpressions. </li> </ol> Getting (a) working was easy: <table width="700"> <tbody> <tr> <td class="codelist"> <div class="dotnetcoders-code"> System .IO .DirectoryInfo m_dir = new System .IO .DirectoryInfo ( path ) ; // Look for matching files foreach (System .IO .FileInfo f in m_dir .GetFiles ( m_filter ) ) { Response .Write (DateTime .Now .ToString ( "t" ) + " " + path .Substring ( root .Length ) + @"/" + f .Name ) ;Response .Flush ( ) ; fileurl = m_url + path .Substring ( root .Length ) .Replace (@ "/", " / ") + " /" + f .Name ; System .IO .StreamReader reader = System .IO .File .OpenText ( path + @"/" + f .Name ) ; fileContents = reader .ReadToEnd ( ) ; reader .Close ( ) ; // now use the fileContents to build the catalog... </div> </td> </tr> <tr> <td class="caption">Listing 3 - Opening the files</td> </tr> </tbody> </table> A quick Google helped find a solution to (b). <table width="700"> <tbody> <tr> <td class="codelist"> <div class="dotnetcoders-code"> // ### Grab the <TITLE> ### Match TitleMatch = Regex .Match ( fileContents , "<title>([^<]*)" , RegexOptions .IgnoreCase | RegexOptions .Multiline ) ;
filetitle = TitleMatch .Groups [ 1 ] .Value ;
// ### Parse out META data ###
Match DescriptionMatch = Regex .Match ( fileContents , "" , RegexOptions .IgnoreCase | RegexOptions .Multiline ) ;
filedesc = DescriptionMatch .Groups [ 1 ] .Value ;
// ### Get the file SIZE ###
filesize = fileContents .Length ;
// ### Now remove HTML, convert to array, clean up words and index them ###
fileContents = stripHtml ( fileContents ) ;

Regex r = new Regex (@ "/s+" ) ; // remove all whitespace
string wordsOnly = stripHtml ( fileContents ) ;

// ### If no META DESC, grab start of file text ###
if ( null = = filedesc | | String .Empty = = filedesc ) {
if ( wordsOnly .Length > 350 )
 filedesc = wordsOnly .Substring ( 0 , 350 ) ;
else if ( wordsOnly .Length > 100 )
 filedesc = wordsOnly .Substring ( 0 , 100 ) ;
else
 filedesc = wordsOnly ; // file is only short!
}

Listing 4 - Massage the file contents

And finally (c) involved a very simple Regular Expression or two, and suddenly we have the document as an Array of words, ready for processing!

protected string stripHtml ( string strHtml ) {
 //Strips the HTML tags from strHTML
 System .Text .RegularExpressions .Regex objRegExp
 = new System .Text .RegularExpressions .Regex ( "<(.|/n)+?>" ) ;

 // Replace all tags with a space, otherwise words either side
 // of a tag might be concatenated
 string strOutput = objRegExp .Replace ( strHtml , " " ) ;

 // Replace all < and > with < and >
 strOutput = strOutput .Replace ( "<" , "<" ) ;
 strOutput = strOutput .Replace ( ">" , ">" ) ;

 return strOutput ;
}

Listing 5 - Remove HTML

and

Regex r = new Regex (@ "/s+" ) ; // remove all whitespace
wordsOnly = r .Replace ( wordsOnly , " " ) ; // compress all whitespace to one space
string [ ] wordsOnlyA = wordsOnly .Split ( ' ' ) ; // results in an array of words

Listing 6 - Remove unnecessary whitespace

To recap - we have the code that, given a starting directory, will crawl through it (and its subdirectories), opening each HTML file, removing the HTML tags and putting the words into an array of strings.

Now that we can parse each document into words, we can populate our Catalog!

Build the Catalog

All the hard work has been done in parsing the file - building the catalog is as simple as adding objects to

     // ### Loop through words in the file ###
     int i = 0 ;     // Position of the word in the file (starts at zero)
     string key = "" ; // the 'word' itself
     // Now loop through the words and add to the catalog
     foreach ( string word in wordsOnlyA ) {
         key = word .Trim ( ' ' , '?' , '/"' , ',' , '/'' , ';' , ':' , '.' , '(' , ')' ) .ToLower ( ) ;
         m_catalog .Add ( key , infile , i ) ;
        i + + ;
     } // foreach word in the file

Listing 7 - Add words to the catalog

As each file is processed a line is written to the browser to indicate the catalog build progess, showing the File.Url and the number of words parsed.

Screenshot 2 - Processing the CIA World FactBook - it contains 40,056 words according to our code.

After the last file is processed, the Catalog object is added to the Application Cache object, and is ready for searching!

Build the Search

The finished Catalog now contains a collection of Words, and each Word object has a collection of the Files it was found in. The Search method of the Catalog takes a single word as the search parameter, and returns the Hashtable of File objects where that Word was found. The returned Hashtable keys are File objects and the values are the rank (ie. count of the number of times the words appear).

All the hard work has been done in parsing the file - building the catalog is as simple as adding objects to

///

Returns all the Files which contain the searchWord

/// Hashtable
public Hashtable Search ( string searchWord ) {
     // apply the same 'trim' as when we're building the catalog
     searchWord = searchWord .Trim ( '?' , '/"' , ',' , '/'' , ';' , ':' , '.' , '(' , ')' ) .ToLower ( ) ;
    Hashtable retval = null ;
     if ( index .ContainsKey ( searchWord ) ) { // does all the work !!!
        Word thematch = (Word ) index [ searchWord ] ;
         retval = thematch .InFiles ( ) ; // return the collection of File objects
     }
     return retval ;
}

Listing 8 - the Search method

The key point is how simple the Search method can be, because of the amount of work performed during the cataloging.

Obviously there are a number of enhancements we could make here, starting with multiple word searches (finding the intersection of the File Hashtables for each Word), implementing Boolean searches, fuzzy matches (or matching word stems/roots)... the list is (almost) endless, but beyond the scope of this article.

Build the Results [Searcharoo.aspx]

Searcharoo.aspx initially displays an HTML form to allow the user to enter the search term.

Screenshot 3 - Enter the search term

When this form is submitted, we look for the Word in the index Hashtable using the ContainsKey() method, and rely on the efficiency of the .NET Framework's searching a collection for an object using its HashCode. The Hashtable.ContainsKey() method is actually doing the search for us.

The Catalog.Search() method returns a Hashtable containing the matching File objects, so all we have to do is display the them in HTML format!

The display process has been broken into a few steps below:

Firstly, we call the Search method to get the result Hashtable. If the result is null skip to Listing 13 because there were no matches, otherwise we have a little more work to do...

// Do the search
Hashtable searchResultsArray = m_catalog .Search ( searchterm ) ;
// Format the results
if ( null ! = searchResultsArray ) {

Listing 9 - The actual search is the easy bit

The Dictionary returned from the Search() method has File objects as the key and the page rank as the value. The problem is they are not in any particular order!
To access these objects in the foreach loop, we need to cast the key object to a File and the value object to int.

Firstly, we call the Search method to get the result Hashtable. If the result is null skip to the end because there were no matches, otherwise we have a little more work to do...

// intermediate data-structure for 'ranked' result HTML
SortedList output = new SortedList ( searchResultsArray .Count ) ; // empty sorted list
DictionaryEntry fo ;
File infile ;
string result = "" ;
// build each result row
foreach ( object foundInFile in searchResultsArray ) {
     // build the HTML output in the sorted list, so the 'unsorted'
     // searchResults are 'sorted' as they're added to the SortedList
     fo = (DictionaryEntry ) foundInFile ;

     infile = (File ) fo .Key ;
     int rank = ( int ) fo .Value ;

Listing 10 - Processing the results

Firstly, we call the Search method to get the result Hashtable. If the result is null, game over, otherwise we have a little more work to do.

     // Create the formatted output HTML
     result = ( "+ infile .Url + ">" ) ;
     result + = ( "" + infile .Title + "" ) ;
     result + = ( " + infile .Url + " target=/"_TOP/" " ) ;
     result + = ( "title=/"open in new window/" style=/"font-size:xx-small/">↑" ) ;
     result + = ( " (" + rank + ")" ) ;
     result + = ( "
" + infile .Description + "..." ) ;
     result + = ( "
" + infile .Url + " - " + infile .Size ) ;
     result + = ( "bytes - " + infile .CrawledDate + "

" ) ;

Listing 11 - Pure formatting

Before we can output the results, we need to get them in some order. We'll use a SortedList and add the HTML result string to it using the page rank as the key. If there is already an result with the same rank, we'll concatenate the results together (they'll appear one after the other).

     int sortrank = ( rank * - 1 ) ; // multiply by -1 so larger score goes to the top
     if ( output .Contains ( sortrank ) ) { // rank exists; concatenate same-rank output strings
         output [ sortrank ] = ( ( string ) output [ sortrank ] ) + result ;
     } else {
         output .Add ( sortrank , result ) ;
     }
     result = "" ; // clear string for next loop

Listing 12 - Sorting the results by rank

To make sure the highest rank appears at the top of the list, the rank is multiplied by -1!
Now all we have to do is Response.Write the SortedList, string by string, followed by the number of matches.

     // Now output to the HTML Response
     foreach ( object rows in output ) { // Already sorted!
        Response .Write ( ( string ) ( (DictionaryEntry ) rows ) .Value ) ;
     }
    Response .Write ( "

Matches: " + searchResultsArray .Count ) ;
} else {
Response .Write ( "

Matches: 0" ) ;
}
Response .Write ( "

↑ top" ) ;
Response .End ( ) ; // Stop here

Listing 13 - Output the results

The output should look familiar to any web search engine user. We've implemented a simple ranking mechanism (a word count, shown in parentheses after the Title/Url) however it doesn't support paging.

Screenshot 4 - Search results contain a familiar amount of information, and the word-count-rank value. Clicking a link opens the local copy of the HTML file (the ↑ opens in a new window).

Using the sample code

The goal of this article was to build a simple search engine that you can install just by placing some files on your website; so you can copy Searcharoo.cs, SearcharooSpider.aspx and Searcharoo.aspx to your web root and away your go!
However that means you accept all the default settings, such as only searching .HTML files, and the search starting from the location of the Searcharoo files.

To change those defaults you need to add some settings to web.config:

<appSettings>
    
    <add key="Searcharoo_PhysicalPath" value="c:/Inetpub/wwwroot/" />
    
    <add key="Searcharoo_VirtualRoot " value="http://localhost/" />
    
    <add key="Searcharoo_FileFilter" value="*.html"/>
appSettings>

Listing 14 - web.config

Then simply navigate to http://localhost/Searcharoo.aspx (or wherever you put the Searcharoo files) and it will build the catalog for the first time.

If your application re-starts for any reason (ie. You compile code into the /bin/ folder, or change web.config settings) the catalog will need to be rebuilt - the next user who performs a search will trigger the catalog build. This is accomplished by checking if the Cache contains a valid Catalog and if not using Server.Transfer to start the crawler.

Future

In the real world, most ASP.NET websites probably have more than just HTML pages, including links to DOC, PDF or other external files and ASPX dynamic/database-generated pages.
The other issue you might have is storing a large blob of data in your Application Cache. For most websites the size of this object will be manageable - but if you've got a lot of content you might not want that in memory all the time.
The good news is the code above can be easily extended to cope with these additional scenarios (including spidering web links, and using a database to store the catalog)... check back for future articles.

Postscript : What about code-behind and Visual-Studio.NET?

You'll notice the two ASPX pages use the src="Searcharoo.cs" @Page attribute to share the common object model without compiling to an assembly, with the page-specific 'inline' using

你可能感兴趣的:(Building a Simple Search Engine with C#)

C++游戏框架全解析！优缺点+使用场景一览！技术男老张编程语言 #编程语言 -C/C++c c++游戏
在当今数字娱乐的黄金时代，游戏作为最受欢迎的媒介之一，其背后离不开强大而高效的游戏开发框架支撑。C++语言，以其卓越的性能、精细的控制能力以及广泛的硬件支持，成为众多游戏开发者的首选工具。本文将深入探讨几个主流的C++游戏开发框架，揭示它们如何助力开发者打造出令人惊叹的游戏世界。1.SFML(SimpleandFastMultimediaLibrary)SFML（SimpleandFastMult
二分查找（Java版）爱学Java Java数据结构与算法 java 算法
二分查找算法Java版算法介绍算法复杂度算法思想算法注意事项算法基础版改进版平衡版最左侧查找最右侧查找总结二分查找算法介绍算法复杂度时间复杂度：O(logn)空间复杂度：O(1)算法思想二分查找（BinarySearch）是一种高效的搜索算法，适用于在有序数组或序列中查找目标元素的位置。其核心思想是利用数组的有序性，将查找范围逐步缩小至目标值所在的子范围。1，确定查找范围：在有序数组中，设定两个指
deepin-grep详解：文本搜索的强大工具 deepin
在Linux系统中，grep命令是一个极其强大的文本搜索工具，广泛应用于文本处理、日志分析和数据筛选等场景。它的全称是“GlobalsearchREgularexpressionandPrintouttheline”，即全局搜索正则表达式并打印匹配的行。本文将详细介绍grep命令的基本用法、常用选项以及正则表达式的使用技巧。1.grep命令的基本功能grep命令的主要作用是从文本文件或管道数据流中
Python 一行命令部署http、ftp服务程序员
Python一行命令部署http服务[TOC]具体操作命令如下这个比nginx相对来说更加简单，可以用于部署特殊场景时如银行等部署时，各种权限控制，内网之间可以分发部署包。首先进入需要访问下载对应目录root@raspberrypi:~$cdtmpfile如果Python版本为2.x，输入命令python-mSimpleHTTPServer80如果Python版本为3.x，输入命令python-m
防止 npm 安装不支持的 Node.js 版本 lio_zero npm node.js 前端 vue.js javascript
确保设置项目的使用特定的Node.js版本，使开发人员在gitclone或gitpull您的项目时，可以正常运行项目。我们可以通过在package.json中设置engines属性来指定版本范围。{"engines":{"node":">=15.0.0"}}许多项目定义了engine属性，但没有强制执行所需的Node.js版本。在不支持Node.js版本的项目中运行npminstall时，将显示以
Mac安装java及多版本快速切换 nanason Java macos java jdk mac bash
安装JDK法1.brew安装#旧adoptopenjdk8#brewinstall--caskhomebrew/cask-versions/adoptopenjdk8#新adoptopenjdk8brewinstall--casktemurin8brewsearchjdk会报错，查了下可能是库的问题，Homebrew的adoptopenjdk-jreCask定义中的appcast属性已被弃用,需要
C# WPF 使用LiveCharts绘制折线图的一些技巧 JingHua0327 c#wpf 开发语言
创作背景：近期项目由涉及到使用LiveCharts绘制曲线的需求，在项目推进过程中，反复去精进磨合，总结了一小部分关于LiveCharts使用的过程和技巧，整理如下：1、在NuGet程序包中搜索如下图所示的内容并添加到程序中。2、在需要使用LiveCharts的窗体中添加如下引用。xmlns:lvc="clr-namespace:LiveCharts.Wpf;assembly=LiveCharts
计算机网络（46）简单网络管理协议SNMP IT 青年一研为定计算机网络
前言简单网络管理协议（SNMP，SimpleNetworkManagementProtocol）是一种用于在计算机网络中管理网络节点的标准协议。一、概述SNMP是基于TCP/IP五层协议中的应用层协议，它使网络管理员能够管理网络效能，发现并解决网络问题以及规划网络增长。SNMP由互联网工程任务组（IETF）定义，确保了不同厂商设备之间的互操作性。由于其简单可靠，提供了一种监控和管理网络设备的系统方
用winform（c#窗体应用程序）实现推箱子小游戏新生的青菜 c#开发语言游戏程序矩阵
usingSystem;usingSystem.Collections.Generic;usingSystem.ComponentModel;usingSystem.Data;usingSystem.Drawing;usingSystem.Linq;usingSystem.Text;usingSystem.Threading.Tasks;usingSystem.Windows.Forms;name
CV高手是怎么炼成的? 工具
引言你平时都怎么复制粘贴的？是否每次都是复制一段粘贴一段？是否厌倦了每次只能复制粘贴一次的限制？那这篇文章就是为你量身订做的。CopyQ简介CopyQisclipboardmanager–adesktopapplicationwhichstorescontentofthesystemclipboardwheneveritchangesandallowstosearchthehistoryandco
基于TSN的混合5G网络中的确定性通信研究需求与综述神一样的老师论文阅读分享网络
ResearchDemandandReviewonDeterministicCommunicationinHybrid5GnetworksbasedonTSNMahmoudAlqudahUniversityofSiegenSiegen,Germanymahmoud.alqudah@uni-siegen.deRomanObermaisserUniversityofSiegenSiegen,Germa
RPA手把手：【Intermediate Python】一、*args 和 **kwargs 艺赛旗RPA RPA RPA教程 python基础 RPA python 艺赛旗
艺赛旗RPA10.0全新首发免费下载点击下载www.i-search.com.cn/index.html?from=line1我观察到，大部分新的Python程序员都需要花上大量时间理解清楚*args和**kwargs这两个魔法变量。那么它们到底是什么？首先让我告诉你，其实并不是必须写成args和**kwargs。只有变量前面的星号才是必须的，你也可以写成var和vars，而写成*args和kwa
C#高级：用Csharp操作鼠标和键盘我是苏苏 C#高级 c#开发语言
一、winform1.实时获取鼠标位置publicForm1(){InitializeComponent();InitialTime();}privatevoidInitialTime(){//初始化Timer控件vartimer=newSystem.Windows.Forms.Timer();timer.Interval=100;//设置为100毫秒，即每0.1秒更新一次timer.Tick+=
C#高级：用控制台程序模拟WebAPI处理接口请求信息我是苏苏 c#开发语言
1.基础DemoclassProgram{staticvoidMain(){//创建HttpListener实例HttpListenerlistener=newHttpListener();//添加监听的前缀（模拟WebAPI路径）listener.Prefixes.Add("http://localhost:18110/api/");//启动监听listener.Start();Console.
kafka学习笔记5 PLAIN认证——筑梦之路筑梦之路 linux系统运维 Java技术 kafka 学习笔记
在Kafka中，SASL（SimpleAuthenticationandSecurityLayer）机制包括三种常见的身份验证方式：SASL/PLAIN认证：含义是简单身份验证和授权层应用程序接口，PLAIN认证是其中一种最简单的用户名、密码认证方式，生产环境使用维护简单易用。可用于Kafka和其他应用程序之间的认证。SASL/SCRAM认证：SCRAM-SHA-256、SCRAM-SHA-512
ELK Stack学习笔记在线打码学习笔记 redis linux centos es elk
一、ELKStack简介1、Elasticsearch一个实时的分布式搜索和分析引擎，它可以用于全文搜索，结构化搜索以及分析。它是一个建立在全文搜索引擎ApacheLucene(信息检索的工具jar包)基础上的搜索引擎，使用Java语言编写2、Logstash一个完全开源的工具，可以对日志进行收集、过滤，并将其存储供以后使用。是开源的服务器端数据处理管道，能够从多个来源收集数据、转换数据。并保存到
解锁C#中Regex.Replace的高阶玩法 myshare2022 c#
一、引言在C#的编程世界里，字符串处理是一项极为常见且重要的任务。而Regex.Replace作为C#中强大的字符串处理工具，如同一位技艺精湛的工匠，能够按照我们设定的规则，对字符串进行精准的修改和调整。它不仅能实现简单的查找与替换，还在处理复杂文本模式时展现出卓越的能力。在文本解析、数据清洗、格式转换等众多场景中，Regex.Replace都发挥着不可替代的作用。接下来，就让我们一同深入探索Re
ILI9806G 4.3吋触摸屏 LVGL9 描点函数 UIUI lvgl9 stm32f407 ucos3
staticvoiddisp_flush(lv_display_t*disp_drv,constlv_area_t*area,uint16_t*px_map){if(disp_flush_enabled){/*Themostsimplecase(butalsotheslowest)toputallpixelstothescreenone-by-one*/int32_tx;int32_ty;int3
我的软件架构师——Java 职位面试经历。小蜗牛慢慢爬行 java 面试开发语言职场和发展后端 spring boot spring
最近，我参加了一家领先的服务型公司的软件架构师（Java）职位的面试。我在这里分享了一些面试官问我的问题。我只列出了与Java相关的问题，因为本文主要关注Java。面试官问我有关AWS、Docker、Kubernetes、Kafka、ElasticSearch、SQL/NoSQL和设计模式的问题。ClassNotFoundException和NoClassDefFoundError有什么区别？当您
【从零开始入门unity游戏开发之——C#篇46】C#补充知识点——命名参数和可选参数向宇it unity c#游戏引擎编辑器开发语言
考虑到每个人基础可能不一样，且并不是所有人都有同时做2D、3D开发的需求，所以我把【零基础入门unity游戏开发】分为成了C#篇、unity通用篇、unity3D篇、unity2D篇。【C#篇】：主要讲解C#的基础语法，包括变量、数据类型、运算符、流程控制、面向对象等，适合没有编程基础的同学入门。【unity通用篇】：主要讲解unity的基础通用的知识，包括unity界面、unity脚本、unit
MySQL 尽量避免使用 TIMESTAMP！！ 2401_89210258 mysql adb android
mysql>CREATETABLEemployee(->entry_timetimestampNOTNULLDEFAULTCURRENT_TIMESTAMPONUPDATECURRENT_TIMESTAMP->)ENGINE=InnoDB->;QueryOK,0rowsaffected(0.01sec)mysql>INSERTINTOemployee(entry_time)VALUES(CURRE
二分(C++) 数的范围三次方根你干码，哎哟算法 c++排序算法
二分通常指的是二分查找（BinarySearch），它是一种高效的查找算法，用于在有序数组中查找某一特定元素的位置。二分查找的思路是：每次取中间位置的元素与目标值进行比较。如果中间位置的元素正好等于目标值，则查找成功。如果中间位置的元素大于目标值，则在数组的左半部分继续查找。如果中间位置的元素小于目标值，则在数组的右半部分继续查找。重复上述过程，直到找到目标值或查找范围为空。一.数的范围题目给定一
Python在WRF模型自动化运行及前后处理中实践技术应用-包括数据处理、模型运行、结果可视化等步骤。 KY_chenzhao python 自动化开发语言
1.背景与目标WRF（WeatherResearchandForecasting）模型是中尺度气象数值模式的佼佼者，广泛应用于气象预报和气候研究。Python在WRF模型中的应用主要体现在前后处理、自动化运行和数据可视化等方面。本文将以风速预测为例，详细说明Python在WRF模型中的具体应用，包括数据处理、模型运行、结果可视化等步骤。2.数据准备数据来源包括WRF模型的输出数据和实际观测数据。这
代码编写java代做c++程序代编程Python代c#设计C语言接单软件定制 matlabgoodboy java c++c#
您提到的服务涵盖了多种编程语言和软件开发需求，包括Java代码编写、C++程序代编、Python编程代做、C#设计、C语言编程，以及软件定制服务。这些服务在软件开发领域非常常见，且有着广泛的应用。以下是对这些服务更详细的解释和接单时的一些建议：服务详解Java代码编写Java以其跨平台性、面向对象和丰富的API而著称，广泛应用于企业级应用、Android应用开发、Web服务端开发等领域。您可以提供
Dockerfile -＞ Docker image -＞ Docker container BILLY BILLY 开发必备工具 docker
1.Dockfile->Dockerimagedockerbuild-tshuai_image-fxxx/xxx/Dockerfile.(.不能少)出现：[+]Buildingxxx(10/17)=>[internal]loadbuilddefinitionfromDockerfile=>=>transferringdockerfile:…=>=>transferringcontext=>CACH
mysql连接池 persist_gd 数据库
先建表createtableusers(idintnotnullauto_increment,namevarchar(50)notnull,primarykey(id))engine=innodbdefaultcharset=utf8;连接池importtimeimportpymysqlimportthreadingfromDBUtils.PooledDBimportPooledDB,Shared
C# 程序加密发布：守护知识产权的坚固防线 code_shenbing C#c#
在当今数字化时代，软件行业蓬勃发展，C#凭借其强大的功能和广泛的应用场景，成为众多开发者的首选语言之一。然而，随着软件的传播与使用，知识产权保护问题日益凸显。辛辛苦苦开发的C#程序，一旦被轻易破解和盗用，不仅会损害开发者的经济利益，还会打击创新积极性。因此，对C#程序进行加密发布，成为保障知识产权的关键举措。一、C#程序为何需要加密防止反编译：C#程序编译后生成的中间语言（IL）代码相对容易被反编
C# 解析 HTML 实战指南 code_shenbing C#c#html 开发语言
在网页开发和数据处理的场景中，经常需要从HTML文档里提取有用的信息。C#作为一门强大的编程语言，提供了丰富的工具和库来实现HTML的解析。这篇博客就带你深入了解如何使用C#高效地解析HTML。一、为什么要在C#中解析HTML在实际项目中，无论是进行网页数据采集、网页内容分析，还是开发网页爬虫，都离不开对HTML的解析。例如，电商平台可能需要从竞品网站上采集商品价格和库存信息；新闻聚合应用可能需要
网络技术发展的演变与未来展望大丈夫立于天地间水网络
网络技术作为信息社会的重要基石，在过去几十年中经历了快速的发展和巨大的变革。从最初的ARPANET，到现在广泛使用的互联网，再到未来多国正在积极研发的6G网络，人类社会对网络技术的依赖程度不断加深，网络技术也持续推动着社会经济生活和文化方式的转变。第一代网络技术：构造基础互联网的起源可以追溯到1960年代的美国，美国国防部DAC(AdvanceResearchProjcetsAgency)为了军事
Java的DatagramPacket在C#中体现 hh_fine c#java
C#创建UDP客户端和服务端在C#中，DatagramPacket是Java中用于UDP通信的一个类，而C#并没有直接对应的DatagramPacket类。不过，C#提供了类似的机制来处理基于UDP的数据报（datagram）通信，主要通过System.Net.Sockets命名空间中的UdpClient和Socket类来实现使用UDP客户端发送UdpClient是相对于Socket更高级的类，适
PHP，安卓，UI，java，linux视频教程合集 cocos2d-x小菜 java UI linux PHP android
╔-----------------------------------╗┆
zookeeper admin 笔记 braveCS zookeeper
Required Software 1) JDK>=1.6 2)推荐使用ensemble的ZooKeeper(至少3台)，并run on separate machines 3)在Yahoo!，zk配置在特定的RHEL boxes里，2个cpu，2G内存，80G硬盘数据和日志目录 1)数据目录里的文件是zk节点的持久化备份，包括快照和事务日
Spring配置多个连接池 easterfly spring
项目中需要同时连接多个数据库的时候，如何才能在需要用到哪个数据库就连接哪个数据库呢？ Spring中有关于dataSource的配置： <bean id="dataSource" class="com.mchange.v2.c3p0.ComboPooledDataSource" &nb
Mysql 171815164 mysql
例如，你想myuser使用mypassword从任何主机连接到mysql服务器的话。 GRANT ALL PRIVILEGES ON *.* TO 'myuser'@'%'IDENTIFIED BY 'mypassword' WI TH GRANT OPTION; 如果你想允许用户myuser从ip为192.168.1.6的主机连接到mysql服务器，并使用mypassword作
CommonDAO（公共/基础DAO） g21121 DAO
好久没有更新博客了，最近一段时间工作比较忙，所以请见谅，无论你是爱看呢还是爱看呢还是爱看呢，总之或许对你有些帮助。 DAO(Data Access Object)是一个数据访问（顾名思义就是与数据库打交道）接口，DAO一般在业
直言有讳永夜-极光感悟随笔
1.转载地址:http://blog.csdn.net/jasonblog/article/details/10813313 精华: “直言有讳”是阿里巴巴提倡的一种观念，而我在此之前并没有很深刻的认识。为什么呢？就好比是读书时候做阅读理解，我喜欢我自己的解读，并不喜欢老师给的意思。在这里也是。我自己坚持的原则是互相尊重，我觉得阿里巴巴很多价值观其实是基本的做人
安装CentOS 7 和Win 7后，Win7 引导丢失随便小屋 centos
一般安装双系统的顺序是先装Win7，然后在安装CentOS，这样CentOS可以引导WIN 7启动。但安装CentOS7后，却找不到Win7 的引导，稍微修改一点东西即可。一、首先具有root 的权限。即进入Terminal后输入命令su，然后输入密码即可二、利用vim编辑器打开/boot/grub2/grub.cfg文件进行修改 v
Oracle备份与恢复案例 aijuans oracle
Oracle备份与恢复案例一. 理解什么是数据库恢复当我们使用一个数据库时，总希望数据库的内容是可靠的、正确的，但由于计算机系统的故障（硬件故障、软件故障、网络故障、进程故障和系统故障）影响数据库系统的操作，影响数据库中数据的正确性，甚至破坏数据库，使数据库中全部或部分数据丢失。因此当发生上述故障后，希望能重构这个完整的数据库，该处理称为数据库恢复。恢复过程大致可以分为复原(Restore)与
JavaEE开源快速开发平台G4Studio v5.0发布無為子
我非常高兴地宣布,今天我们最新的JavaEE开源快速开发平台G4Studio_V5.0版本已经正式发布。访问G4Studio网站 http://www.g4it.org 2013-04-06 发布G4Studio_V5.0版本功能新增 (1). 新增了调用Oracle存储过程返回游标，并将游标映射为Java List集合对象的标
Oracle显示根据高考分数模拟录取百合不是茶 PL/SQL编程 oracle例子模拟高考录取学习交流
题目要求: 1,创建student表和result表 2,pl/sql对学生的成绩数据进行处理 3,处理的逻辑是根据每门专业课的最低分线和总分的最低分数线自动的将录取和落选 1,创建student表,和result表学生信息表; create table student( student_id number primary key,--学生id
优秀的领导与差劲的领导 bijian1013 领导管理团队
责任优秀的领导：优秀的领导总是对他所负责的项目担负起责任。如果项目不幸失败了，那么他知道该受责备的人是他自己，并且敢于承认错误。差劲的领导：差劲的领导觉得这不是他的问题，因此他会想方设法证明是他的团队不行，或是将责任归咎于团队中他不喜欢的那几个成员身上。努力工作优秀的领导：团队领导应该是团队成员的榜样。至少，他应该与团队中的其他成员一样努力工作。这仅仅因为他
js函数在浏览器下的兼容 Bill_chen jquery 浏览器 IE DWR ext
做前端开发的工程师，少不了要用FF进行测试，纯js函数在不同浏览器下，名称也可能不同。对于IE6和FF，取得下一结点的函数就不尽相同： IE6：node.nextSibling,对于FF是不能识别的； FF：node.nextElementSibling,对于IE是不能识别的；兼容解决方式：var Div = node.nextSibl
【JVM四】老年代垃圾回收：吞吐量垃圾收集器(Throughput GC) bit1129 垃圾回收
吞吐量与用户线程暂停时间衡量垃圾回收算法优劣的指标有两个：吞吐量越高，则算法越好暂停时间越短，则算法越好首先说明吞吐量和暂停时间的含义。垃圾回收时，JVM会启动几个特定的GC线程来完成垃圾回收的任务，这些GC线程与应用的用户线程产生竞争关系，共同竞争处理器资源以及CPU的执行时间。GC线程不会对用户带来的任何价值，因此，好的GC应该占
J2EE监听器和过滤器基础白糖_ J2EE
Servlet程序由Servlet，Filter和Listener组成，其中监听器用来监听Servlet容器上下文。监听器通常分三类：基于Servlet上下文的ServletContex监听，基于会话的HttpSession监听和基于请求的ServletRequest监听。 ServletContex监听器 ServletContex又叫application
博弈AngularJS讲义(16) - 提供者 boyitech js AngularJS api Angular Provider
Angular框架提供了强大的依赖注入机制，这一切都是有注入器(injector)完成. 注入器会自动实例化服务组件和符合Angular API规则的特殊对象，例如控制器，指令，过滤器动画等。那注入器怎么知道如何去创建这些特殊的对象呢？ Angular提供了5种方式让注入器创建对象，其中最基础的方式就是提供者(provider), 其余四种方式(Value, Fac
java-写一函数f(a,b)，它带有两个字符串参数并返回一串字符，该字符串只包含在两个串中都有的并按照在a中的顺序。 bylijinnan java
public class CommonSubSequence { /** * 题目：写一函数f(a,b)，它带有两个字符串参数并返回一串字符，该字符串只包含在两个串中都有的并按照在a中的顺序。 * 写一个版本算法复杂度O(N^2)和一个O(N) 。 * * O(N^2)：对于a中的每个字符，遍历b中的每个字符，如果相同，则拷贝到新字符串中。 * O(
sqlserver 2000 无法验证产品密钥 Chen.H sql windows SQL Server Microsoft
在 Service Pack 4 (SP 4), 是运行 Microsoft Windows Server 2003、 Microsoft Windows Storage Server 2003 或 Microsoft Windows 2000 服务器上您尝试安装 Microsoft SQL Server 2000 通过卷许可协议 (VLA) 媒体。这样做, 收到以下错误信息CD KEY的 SQ
[新概念武器]气象战争 comsci
气象战争的发动者必须是拥有发射深空航天器能力的国家或者组织.... 原因如下: 地球上的气候变化和大气层中的云层涡旋场有密切的关系,而维持一个在大气层某个层次
oracle 中 rollup、cube、grouping 使用详解 daizj oracle grouping rollup cube
oracle 中 rollup、cube、grouping 使用详解 -- 使用oracle 样例表演示转自namesliu -- 使用oracle 的样列库，演示 rollup, cube, grouping 的用法与使用场景 --- ROLLUP ，为了理解分组的成员数量，我增加了分组的计数 COUNT(SAL)
技术资料汇总分享 Dead_knight 技术资料汇总分享
本人汇总的技术资料，分享出来，希望对大家有用。 http://pan.baidu.com/s/1jGr56uE 资料主要包含： Workflow->工作流相关理论、框架(OSWorkflow、JBPM、Activiti、fireflow...) Security->java安全相关资料(SSL、SSO、SpringSecurity、Shiro、JAAS...) Ser
初一下学期难记忆单词背诵第一课 dcj3sjt126com english word
could 能够 minute 分钟 Tuesday 星期二 February 二月 eighteenth 第十八 listen 听 careful 小心的，仔细的 short 短的 heavy 重的 empty 空的 certainly 当然 carry 携带；搬运 tape 磁带 basket 蓝子 bottle 瓶 juice 汁，果汁 head 头；头部
截取视图的图片, 然后分享出去 dcj3sjt126com OS Objective-C
OS 7 has a new method that allows you to draw a view hierarchy into the current graphics context. This can be used to get an UIImage very fast. I implemented a category method on UIView to get the vi
MySql重置密码 fanxiaolong MySql重置密码
方法一: 在my.ini的[mysqld]字段加入： skip-grant-tables 重启mysql服务，这时的mysql不需要密码即可登录数据库然后进入mysql mysql>use mysql; mysql>更新 user set password=password('新密码') WHERE User='root'; mysq
Ehcache（03）——Ehcache中储存缓存的方式 234390216 ehcache MemoryStore DiskStore 存储驱除策略
Ehcache中储存缓存的方式目录 1 堆内存（MemoryStore） 1.1 指定可用内存 1.2 驱除策略 1.3 元素过期 2 &nbs
spring mvc中的@propertysource jackyrong spring mvc
在spring mvc中，在配置文件中的东西，可以在java代码中通过注解进行读取了： @PropertySource 在spring 3.1中开始引入比如有配置文件 config.properties mongodb.url=1.2.3.4 mongodb.db=hello 则代码中 @PropertySource(&
重学单例模式 lanqiu17 单例 Singleton 模式
最近在重新学习设计模式，感觉对模式理解更加深刻。觉得有必要记下来。第一个学的就是单例模式，单例模式估计是最好理解的模式了。它的作用就是防止外部创建实例，保证只有一个实例。单例模式的常用实现方式有两种，就人们熟知的饱汉式与饥汉式，具体就不多说了。这里说下其他的实现方式静态内部类方式: package test.pattern.singleton.statics; publ
.NET开源核心运行时，且行且珍惜 netcome java .net 开源
背景 2014年11月12日，ASP.NET之父、微软云计算与企业级产品工程部执行副总裁Scott Guthrie，在Connect全球开发者在线会议上宣布，微软将开源全部.NET核心运行时，并将.NET 扩展为可在 Linux 和 Mac OS 平台上运行。.NET核心运行时将基于MIT开源许可协议发布，其中将包括执行.NET代码所需的一切项目——CLR、JIT编译器、垃圾收集器（GC）和核心
使用oscahe缓存技术减少与数据库的频繁交互 Everyday都不同 Web 高并发 oscahe缓存
此前一直不知道缓存的具体实现，只知道是把数据存储在内存中，以便下次直接从内存中读取。对于缓存的使用也没有概念，觉得缓存技术是一个比较”神秘陌生“的领域。但最近要用到缓存技术，发现还是很有必要一探究竟的。缓存技术使用背景：一般来说，对于web项目，如果我们要什么数据直接jdbc查库好了，但是在遇到高并发的情形下，不可能每一次都是去查数据库，因为这样在高并发的情形下显得不太合理——
Spring+Mybatis 手动控制事务 toknowme mybatis
@Override public boolean testDelete(String jobCode) throws Exception { boolean flag = false; &nbs
菜鸟级的android程序员面试时候需要掌握的知识点 xp9802 android
熟悉Android开发架构和API调用掌握APP适应不同型号手机屏幕开发技巧熟悉Android下的数据存储熟练Android Debug Bridge Tool 熟练Eclipse/ADT及相关工具熟悉Android框架原理及Activity生命周期熟练进行Android UI布局熟练使用SQLite数据库；熟悉Android下网络通信机制，S

按字母分类： A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 其他