#R_packages#10 R packages I wish I knew about earlier

10 R packages I wish I knew about earlier

February 10, 2013 by yhat

I started using R about 3 years ago. It was slow going at first. R had tricky and less intuitive syntax than languages I was used to, and it took a while to get accustomed to the nuances. It wasn't immediately clear to me that the power of the language was bound up with the community and the diverse packages available.

R can be more prickly and obscure than other languages like Python or Java. The good news is that there are tons of packages which provide simple and familiar interfaces on top of Base R. This post is about ten packages I love and use everyday and ones I wish I knew about earlier.

sqldf

install.packages("sqldf")

One of the steepest parts of the R learning curve is the syntax. It took me a while to get over using <- instead of =. I hear people say a lot of times How do I just do a VLOOKUP?!? R is great for general data munging tasks, but it takes a while to master. I think it's safe to say that sqldf was my R "training wheels".

sqldf let's you perform SQL queries on your R data frames. People coming over from SAS will find it very familiar and anyone with basic SQL skills will have no trouble using it--sqldf uses SQLite syntax.

 
             library
             
             (sqldf
             
             )
            
             sqldf
             
             (
             
             "SELECT
            
              day
            
              , avg(temp) as avg_temp
            
              FROM beaver2
            
              GROUP BY
            
              day;"
             
             )
            
             # day avg_temp
            
             #1 307 37.57931
            
             #2 308 37.71308
            
             #beavers1 and beavers2 come with base R
            
             beavers 
             
             <- sqldf
             
             (
             
             "select * from beaver1
            
              union all
            
              select * from beaver2;"
             
             )
            
             #head(beavers)
            
             # day time temp activ
            
             #1 346 840 36.33 0
            
             #2 346 850 36.34 0
            
             #3 346 900 36.35 0
            
             #4 346 910 36.42 0
            
             #5 346 920 36.55 0
            
             #6 346 930 36.69 0
            
             movies 
             
             <- data.frame
             
             (
            
               title
             
             =c
             
             (
             
             "The Great Outdoors"
             
             , 
             
             "Caddyshack"
             
             , 
             
             "Fletch"
             
             , 
             
             "Days of Thunder"
             
             , 
             
             "Crazy Heart"
             
             ),
            
               year
             
             =c
             
             (
             
             1988
             
             , 
             
             1980
             
             , 
             
             1985
             
             , 
             
             1990
             
             , 
             
             2009
             
             )
            
             )
            
             boxoffice 
             
             <- data.frame
             
             (
            
               title
             
             =c
             
             (
             
             "The Great Outdoors"
             
             , 
             
             "Caddyshack"
             
             , 
             
             "Fletch"
             
             , 
             
             "Days of Thunder"
             
             ,
             
             "Top Gun"
             
             ),
            
               revenue
             
             =c
             
             (
             
             43455230
             
             , 
             
             39846344
             
             , 
             
             59600000
             
             , 
             
             157920733
             
             , 
             
             353816701
             
             )
            
             )
            
             sqldf
             
             (
             
             "SELECT
            
              m.*
            
              , b.revenue
            
              FROM
            
              movies m
            
              INNER JOIN
            
              boxoffice b
            
              ON m.title = b.title;"
             
             )
            
             # title year revenue
            
             #1 The Great Outdoors 1988 43455230
            
             #2 Caddyshack 1980 39846344
            
             #3 Fletch 1985 59600000
            
             #4 Days of Thunder 1990 157920733
            
       view raw sqldf_examples.R This Gist brought to you by GitHub.

forecast

install.packages("forecast")

I don't do time series analysis very often, but when I do forecast is my library of choice. forecast makes it incredibly easy to fit time series models like ARIMA, ARMA, AR, Exponential Smoothing, etc.

 
             library
             
             (forecast
             
             )
            
             # mdeaths: Monthly Deaths from Lung Diseases in the UK
            
             fit 
             
             <- auto.arima
             
             (mdeaths
             
             )
            
             #customize your confidence intervals
            
             forecast
             
             (fit
             
             , level
             
             =c
             
             (
             
             80
             
             , 
             
             95
             
             , 
             
             99
             
             ), h
             
             =
             
             3
             
             )
            
             # Point Forecast Lo 80 Hi 80 Lo 95 Hi 95 Lo 99 Hi 99
            
             #Jan 1980 1822.863 1564.192 2081.534 1427.259 2218.467 1302.952 2342.774
            
             #Feb 1980 1923.190 1635.530 2210.851 1483.251 2363.130 1345.012 2501.368
            
             #Mar 1980 1789.153 1495.048 2083.258 1339.359 2238.947 1198.023 2380.283
            
             plot
             
             (forecast
             
             (fit
             
             ), shadecols
             
             =
             
             "oldstyle"
             
             )
            
       view raw forecast_example.R This Gist brought to you by GitHub.

My favorite feature is the resulting forecast plot.

plyr

install.packages("plyr")

When I first started using R, I was using basic control operations for manipulating data (for, if, while, etc.). I quickly learned that this was an amateur move, and that there was a better way to do it.

In R, the apply family of functions is the preferred way to call a function on each element of a list or vector. While Base R has this out of the box, its usage can be tricky to master. I've found the plyr package to be an easy to use substitute for split, apply, combine functionality in Base R.

plyr gives you several functions (ddply, daply, dlply, adply, ldply) following a common blueprint: Split a data structure into groups, apply a function on each group, return the results in a data structure.

ddply splits a data frame and returns a data frame (hence the dd). daply splits a data frame and results an array (hence the da). Hopefully you're getting the idea here.

 
             library
             
             (plyr
             
             )
            
             # split a data frame by Species, summarize it, then convert the results
            
             # into a data frame
            
             ddply
             
             (iris
             
             , .
             
             (Species
             
             ), summarise
             
             ,
            
                   mean_petal_length
             
             =mean
             
             (Petal.Length
             
             )
            
             )
            
             # Species mean_petal_length
            
             #1 setosa 1.462
            
             #2 versicolor 4.260
            
             #3 virginica 5.552
            
             # split a data frame by Species, summarize it, then convert the results
            
             # into an array
            
             unlist
             
             (daply
             
             (iris
             
             [,
             
             4
             
             :
             
             5
             
             ], .
             
             (Species
             
             ), colwise
             
             (mean
             
             )))
            
             # setosa.Petal.Width versicolor.Petal.Width virginica.Petal.Width 
            
             # 0.246 1.326 2.026 
            
       view raw plyr_examples.R This Gist brought to you by GitHub.

stringr

install.packages("stringr")

I find base R's string functionality to be extremely difficult and cumbersome to use. Another package written by Hadley Wickham, stringr, provides some much needed string operators in R. Many of the functions use data strcutures that aren't commonly used when doing basic analysis.

#R_packages#10 R packages I wish I knew about earlier_第1张图片

stringr is remarkably easy to use. Nearly all of the functions (and all of the important ones) are prefixed with "str" so they're very easy to remember.

 
             library
             
             (stringr
             
             )
            
             names
             
             (iris
             
             )
            
             #[1] "Sepal.Length" "Sepal.Width" "Petal.Length" "Petal.Width" "Species"
            
             names
             
             (iris
             
             ) 
             
             <- str_replace_all
             
             (names
             
             (iris
             
             ), 
             
             "[.]"
             
             , 
             
             "_"
             
             )
            
             names
             
             (iris
             
             )
            
             #[1] "Sepal_Length" "Sepal_Width" "Petal_Length" "Petal_Width" "Species"
            
             s 
             
             <- c
             
             (
             
             "Go to Heaven for the climate, Hell for the company."
             
             )
            
             str_extract_all
             
             (s
             
             , 
             
             "[H][a-z]+ "
             
             )
            
             #[[1]]
            
             #[1] "Heaven " "Hell " 
            
       view raw stringr_example.R This Gist brought to you by GitHub.

The database driver package of your choice

install.packages("RPostgreSQL")
install.packages("RMySQL")
install.packages("RMongo")
install.packages("RODBC")
install.packages("RSQLite")

Everyone does it when they first start (myself included). You've just written an awesome query in your preferred SQL editor. Everything is perfect - the column names are all snake case, the dates have the right datatype, you finally debugged the "must appear in the GROUP BY clause or be used in an aggregate function" issue. You're ready to do some analysis in R, so you run the query in your SQL editor, copy the results to a csv (or...God forbid... .xlsx) and read into R. You don't have to do this!

R has great drivers for nearly every conceivable database. On the off chance you're using a database which doesn't have a standalone driver (SQL Server), you can always use RODBC.

 
             library
             
             (RPostgreSQL
             
             )
            
             drv 
             
             <- dbDriver
             
             (
             
             "PostgreSQL"
             
             )
            
             db 
             
             <- dbConnect
             
             (drv
             
             , dbname
             
             =
             
             "ncaa"
             
             ,
            
                              user
             
             =
             
             "YOUR USER NAME"
             
             , password
             
             =
             
             "YOUR PASSWORD"
             
             )
            
             q 
             
             <- 
             
             "SELECT
            
              *
            
              FROM
            
              game_scores;"
            
             data 
             
             <- dbGetQuery
             
             (db
             
             , q
             
             )
            
             head
             
             (data
             
             )
            
             #id school game_date spread school_score opponent opp_score was_home
            
             #1 45111 Boston College 1985-11-16 6.0 21 Syracuse 41 False
            
             #2 45112 Boston College 1985-11-02 13.5 12 Penn State 16 False
            
             #3 45113 Boston College 1985-10-26 -11.0 17 Cincinnati 24 False
            
             #4 45114 Boston College 1985-10-12 -2.0 14 Army 45 False
            
             #5 45115 Boston College 1985-09-28 5.0 10 Miami 45 True
            
             #6 45116 Boston College 1985-09-21 6.5 29 Pittsburgh 22 False
            
             nrow
             
             (data
             
             )
            
             #[1] 30932
            
             ncol
             
             (data
             
             )
            
             #[1] 8
            
       view raw dbdriver_example.R This Gist brought to you by GitHub.

Next time you've got that perfect query written, just paste it into R and execute it using RPostgreSQL, RMySQL, RMongo, RMongo, or RODBC. In addition to preventing you from having tens of hundreds of CSV files sitting arround, running the query in R saves you time both in I/O but also in converting datatypes. Dates, times, and datetimes will be automatically set to their R equivalent. It also makes your R script reproducible, so you or someone else on your team can easily produce the same results.

lubridate

install.packages("lubridate")

I've never had great luck with dates in R. I've never fully grasped the idiosyncracies of working with POSIXs vs. R Dates. Enter lubridate.

lubridate is one of those magical libraries that just seems to do exactly what you expect it to. The functions all have obvious names like year, month, ymd, and ymd_hms. It's similar to Moment.js for those familiar with javascript.

 
             library
             
             (lubridate
             
             )
            
             year
             
             (
             
             "2012-12-12"
             
             )
            
             #[1] 2012
            
             day
             
             (
             
             "2012-12-12"
             
             )
            
             #[1] 12
            
             ymd
             
             (
             
             "2012-12-12"
             
             )
            
             #1 parsed with %Y-%m-%d
            
             #[1] "2012-12-12 UTC"
            
       view raw lubridate_example.R This Gist brought to you by GitHub.

Here's a really handy reference card that I found in a paper. It covers just about everything you might conveivably want to do to a date. I've also found this Date Cheat Sheet to be a handy reference.

ggplot2
```
install.packages("ggplot2")
```
Another Hadley Wickham pacakge and probably his most widely known one. ggplot2 ranks high on everyone's list of favorite R pacakges. It's easy to use and it produces some great looking plots. It's a great way to present your work, and there are many resources available to help you get started.
- Elegant Graphics for Data Analysis by Hadley Wickham (Amazon)
- A Rosetta Stone for Excel to ggplot (Yaksis Blog)
- Hadley Wickham ggplot2 Presentation at Google (youtube)
- R Graphics Cookbook by Winston Chang (Amazon)

qcc

install.packages("qcc")

qcc is a library for statistical quality control. Back in the 1950s, the now defunct Western Electric Company was looking for a better way to detect problems with telephone and eletrical lines. They came up with a set of rules to help them identify problematic lines. The rules look at the historical mean of a series of datapoints and based on the standard deviation, the rules help judge whether a new set of points is experiencing a mean shift.

The classic example is monitoring a machine that produces lug nuts. Let's say the machine is supposed to produce 2.5 inch long lug nuts. We measure a series of lug nuts: 2.48, 2.47, 2.51, 2.52, 2.54, 2.42, 2.52, 2.58, 2.51. Is the machine broken? Well it's hard to tell, but the Western Electric Rules can help.

 
             library
             
             (qcc
             
             )
            
             # series of value w/ mean of 10 with a little random noise added in
            
             x 
             
             <- rep
             
             (
             
             10
             
             , 
             
             100
             
             ) 
             
             + rnorm
             
             (
             
             100
             
             )
            
             # a test series w/ a mean of 11
            
             new.x 
             
             <- rep
             
             (
             
             11
             
             , 
             
             15
             
             ) 
             
             + rnorm
             
             (
             
             15
             
             )
            
             # qcc will flag the new points
            
             qcc
             
             (x
             
             , newdata
             
             =new.x
             
             , type
             
             =
             
             "xbar.one"
             
             )
            
       view raw qcc_example.R This Gist brought to you by GitHub.

While you might not be monitoring telephone lines, qcc can help you monitor transaction volumes, visitors or logins on your website, database operations, and lots of other processes.

#R_packages#10 R packages I wish I knew about earlier_第2张图片

reshape2

install.packages("reshape2")

I always find that the hardest part of any sort of analysis is getting the data into the right format. reshape2 is yet another package by Hadley Wickham that specializes in converting data from wide to long format and vice versa. I use it all the time in conjunction with ggplot2 and plyr.

 
             library
             
             (reshape2
             
             )
            
             # generate a unique id for each row; this let's us go back to wide format later
            
             iris
             
             $id 
             
             <- 
             
             1
             
             :nrow
             
             (iris
             
             )
            
             iris.lng 
             
             <- melt
             
             (iris
             
             , id
             
             =c
             
             (
             
             "id"
             
             , 
             
             "Species"
             
             ))
            
             head
             
             (iris.lng
             
             )
            
             # id Species variable value
            
             #1 1 setosa Sepal.Length 5.1
            
             #2 2 setosa Sepal.Length 4.9
            
             #3 3 setosa Sepal.Length 4.7
            
             #4 4 setosa Sepal.Length 4.6
            
             #5 5 setosa Sepal.Length 5.0
            
             #6 6 setosa Sepal.Length 5.4
            
             iris.wide 
             
             <- dcast
             
             (iris.lng
             
             , id 
             
             + Species 
             
             ~ variable
             
             )
            
             head
             
             (iris.wide
             
             )
            
             # id Species Sepal.Length Sepal.Width Petal.Length Petal.Width
            
             #1 1 setosa 5.1 3.5 1.4 0.2
            
             #2 2 setosa 4.9 3.0 1.4 0.2
            
             #3 3 setosa 4.7 3.2 1.3 0.2
            
             #4 4 setosa 4.6 3.1 1.5 0.2
            
             #5 5 setosa 5.0 3.6 1.4 0.2
            
             #6 6 setosa 5.4 3.9 1.7 0.4
            
             library
             
             (ggplot2
             
             )
            
             # plots a histogram for each numeric column in the dataset
            
             p 
             
             <- ggplot
             
             (aes
             
             (x
             
             =value
             
             , fill
             
             =Species
             
             ), data
             
             =iris.lng
             
             )
            
             p 
             
             + geom_histogram
             
             () 
             
             +
            
               facet_wrap
             
             (
             
             ~variable
             
             , scales
             
             =
             
             "free"
             
             )
            
       view raw reshape_example.R This Gist brought to you by GitHub.

It's a great way to quickly take a look at a dataset and get your bearings. You can use the melt function to convert wide data to long data, and dcast to go from long to wide.

randomForest

install.packages("randomForest")

This list wouldn't be complete without including at least one machine learning package you can impress your friends with. Random Forest is a great algorithm to start with. It's easy to use, can do supervised or unsupervised learning, it can be used with many differnet types of datasets, but most importantly it's effective! Here's how it works in R.

 
             library
             
             (randomForest
             
             )
            
             # download Titanic Survivors data
            
             data 
             
             <- read.table
             
             (
             
             "http://math.ucdenver.edu/RTutorial/titanic.txt"
             
             , h
             
             =
             
             T
             
             , sep
             
             =
             
             "\t"
             
             )
            
             # make survived into a yes/no
            
             data
             
             $Survived 
             
             <- as.factor
             
             (ifelse
             
             (data
             
             $Survived
             
             ==
             
             1
             
             , 
             
             "yes"
             
             , 
             
             "no"
             
             ))                 
            
             # split into a training and test set
            
             idx 
             
             <- runif
             
             (nrow
             
             (data
             
             )) 
             
             <= 
             
             .75
            
             data.train 
             
             <- data
             
             [idx
             
             ,]
            
             data.test 
             
             <- data
             
             [
             
             -idx
             
             ,]
            
             # train a random forest
            
             rf 
             
             <- randomForest
             
             (Survived 
             
             ~ PClass 
             
             + Age 
             
             + Sex
             
             , 
            
                          data
             
             =data.train
             
             , importance
             
             =
             
             TRUE
             
             , na.action
             
             =na.omit
             
             )
            
             # how important is each variable in the model
            
             imp 
             
             <- importance
             
             (rf
             
             )
            
             o 
             
             <- order
             
             (imp
             
             [,
             
             3
             
             ], decreasing
             
             =
             
             T
             
             )
            
             imp
             
             [o
             
             ,]
            
             # no yes MeanDecreaseAccuracy MeanDecreaseGini
            
             #Sex 51.49855 53.30255 55.13458 63.46861
            
             #PClass 25.48715 24.12522 28.43298 22.31789
            
             #Age 20.08571 14.07954 24.64607 19.57423
            
             # confusion matrix [[True Neg, False Pos], [False Neg, True Pos]]
            
             table
             
             (data.test
             
             $Survived
             
             , predict
             
             (rf
             
             , data.test
             
             ), dnn
             
             =list
             
             (
             
             "actual"
             
             , 
             
             "predicted"
             
             ))
            
             # predicted
            
             #actual no yes
            
             # no 427 16
            
             # yes 117 195
            
       view raw randomforest_example.R This Gist brought to you by GitHub.

linux清空文件夹的命令 getapi linux github git
在Linux系统中，清空文件夹（即删除文件夹中的所有内容，但保留文件夹本身）可以通过多种方法实现。以下是几种常见的命令和操作方式：方法1:使用rm命令rm是一个强大的命令，用于删除文件和目录。要清空文件夹的内容，可以使用以下命令：rm-rf/path/to/folder/*解释：rm：删除命令。-r：递归删除，用于处理目录及其子目录。-f：强制删除，无需确认。/path/to/folder/*：指
强化学习-Chapter2-贝尔曼方程 Rsbs 算法机器学习概率论
强化学习-Chapter2-贝尔曼方程贝尔曼方程推导继续展开贝尔曼方程的矩阵形式状态值的求解动作价值函数与状态价值函数的关系贝尔曼方程推导Vπ(s)=E[Gt∣St=s]=E[rt+1+(γrt+2+…)∣St=s]=E[rt+1+γGt+1∣St=s]=∑a∈Aπ(s,a)∑s′∈SPs→s′a⋅(Rs→s′a+γE[Gt+1∣St+1=s′])=∑a∈Aπ(s,a)∑s′∈SPs→s′a⋅(R
字典遍历时不能修改字典元素 nihuhui666 python 列表
a={'a':1,'b':2,'c':0}foriina:ifa[i]==0:dela[i]print(a)报错RuntimeError:dictionarychangedsizeduringiteration字典在迭代时改变了因为要删除的是值为0的元素所以迭代键的列表就行了foriinlist(a.keys()):ifa[i]==0:dela[i]print(a)这样就行了结果发现foriina
算法训练-拓扑排序2 往往歌咏理想算法深度优先
洛谷P1807最长路https://www.luogu.com.cn/problem/P1807本题数据范围过大盲目使用dfs容易超时爆栈题目要求中提到i#defineintlonglong#defineendl'\n'/*===\\================//\\===================//\\============//\\==========//=========\\=
高安全可靠MCU芯片AS32X601应用解析国科安芯产品单片机嵌入式硬件 risc-v 架构 fpga开发
1.AS32X601简介AS32X601系列是国科安芯基于32位RISC-V指令集研发的高性能MCU产品，具备高安全、低失效、多接口、低成本等核心优势。该系列包含工业级（AS32I601ZIT6）、车规级（AS32A601ZIT3）、企业宇航级（AS32S601ZIT2）及企军级（AS32M601ZIT2）四个型号，覆盖工业控制、汽车电子、航天及军工等严苛场景。其关键特性包括：高安全设计：支持AS
Win32 SDK Gui编程系列之--Win32 API通用控件「已注销」 Win32 SDK Gui C/C++工具 C windows c++c语言
要使用普通控件的话，包含语句#include通过追加和初始化InitCommonControls();是必要的。也可以通过InitCommonControlsEx函数单独注册使用的类。另外，需要导入comctl32.dll。1.标签、控件只显示标签、控件的程序tabctrl.c和根据按下的标签进行显示的程序tabcontr0l.c和各自的执行结果如下所示。Tabctrl01.c#include#i
HuggingFace下载模型并导入Ollama指南 Repetion_Maxumim embedding 语言模型人工智能自然语言处理 ai
此处以moka-ai/m3e-base模型下载为例。众所周知，HuggingFace仓库托管了诸多训练模型。DeepSeek官方也将完整满血版DeepSeek-R1:671B模型镜像托管在此仓库，但是目前国内无法直接从HugingFace下载。并且，一般为了快速部署，会选择Ollama这类管模型管理工具，类似Docker引擎一样，但是Ollama支持的模型镜像格式（如GGUF）HuggingFac
用Python打造智能家居安防系统，让科技守护你的家 Echo_Wish Python 笔记 Python 算法 python 智能家居科技
友友们好！我是Echo_Wish，我的的新专栏《Python进阶》以及《Python！实战！》正式启动啦！这是专为那些渴望提升Python技能的朋友们量身打造的专栏，无论你是已经有一定基础的开发者，还是希望深入挖掘Python潜力的爱好者，这里都将是你不可错过的宝藏。在这个专栏中，你将会找到：●深入解析：每一篇文章都将深入剖析Python的高级概念和应用，包括但不限于数据分析、机器学习、Web开发
RISC-V汇编学习（二）—— 汇编语法禾仔仔 RISC-V risc-v 汇编
在具体汇编指令和汇编实战之前，还是有必要对RISC-V汇编进行下介绍，我一般称之为RISC-V汇编的“语法”，可能“语法”较少，也相对比较简单的原因，大部分的博主都是一笔带过，但本着循序渐进的原则，还是简单概述下，以便加深认识。RISCV汇编学习系列：RISC-V汇编学习（一）——基础认识RISC-V汇编学习（二）——汇编语法RISC-V汇编学习（三）——RV指令集RISC-V汇编学习（四）——R
输入10个数字，然后逆序输出。 |CXHAO| Python 数字反转数组循环字符输出
输入1234567890输出0987654321#includeusingnamespacestd;intmain(){inta[10];for(inti=0;i>a[i];}for(intj=9;j>=0;j--){cout<<a[j]<<'';}cout<<endl;return0;}
基于NXP+FPGA轨道交通3U机箱结构远程输入/输出模块（RIOM）深圳信迈主板定制专家轨道交通 NXP+FPGA fpga开发人工智能大数据边缘计算运维
基于NXP+FPGA轨道交通6U机箱结构远程输入/输出模块（RIOM）RIOM使得数据通过就近的I/O源输入和输出。也可以直接将I/O源连接到列车计算机（如VCU），可以减少电缆用量从而节约成本。关键特性支持模拟和数字输入/输出。可配置的模块包括DI、DIO、MDO、RDO、AIO、PTI等。接口选项MVBRIOM设备支持MVB/CAN/串行链路三种接口；TRDPRIOM设备知此恨TRDP/CAN
第十三届蓝桥杯研究生组C++省赛格格巫ZYX 算法 c语言 c++蓝桥杯
有一根围绕原点O顺时针旋转的棒OA，初始时指向正上方（Y轴正向）。在平面中有若干物件，第i个物件的坐标为(xi,yi)，价值为zi。当棒扫到某个物件时，棒的长度会瞬间增长zi，且物件瞬间消失（棒的顶端恰好碰到物件也视为扫到），如果此时增长完的棒又额外碰到了其他物件，也按上述方式消去（它和上述那个点视为同时消失）。如果将物件按照消失的时间排序，则每个物件有一个排名，同时消失的物件排名相同，请输出每个
C/C++ R-Tree原理及源代码猿来如此yyy C/C++算法详解及源码 r-tree c语言 c++开发语言算法数据结构
R树是一种用于高维空间数据的索引结构，它是由AntoninGuttman于1984年提出的。R树旨在提高对多维数据进行范围查询的性能。它被广泛应用于空间数据库中。R树的核心思想是将数据划分为不相交的矩形区域，并逐层构建一个树结构。每个非叶子节点都是一个矩形，它覆盖了它的所有子节点。每个叶子节点都是一个数据对象与其坐标范围的组合。通过这种方式，R树能够将相邻的数据对象聚集在一起，从而减少对数据的搜索
【开源代码解读】AI检索系统R1-Searcher通过强化学习RL激励大模型LLM的搜索能力 accurater 人工智能深度学习 R1-Searcher
关于R1-Searcher的报告：第一章：引言-AI检索系统的技术演进与R1-Searcher的创新定位1.1信息检索技术的范式转移在数字化时代爆发式增长的数据洪流中，信息检索系统正经历从传统关键词匹配到语义理解驱动的根本性变革。根据IDC的统计，2023年全球数据总量已突破120ZB，其中非结构化数据占比超过80%。这种数据形态的转变对检索系统提出了三个核心的挑战：语义歧义消除：如何准确理解"A
【python error】cannot import name ‘TorchDispatchMode‘ from ‘torch.utils._python_dispatch‘ Eternal-Student Jetson Orin NX Python python 开发语言
报错：cannotimportname‘TorchDispatchMode’from‘torch.utils._python_dispatch’(/home/nvidia/.conda/envs/pytorch/lib/python3.8/site-packages/torch/utils/_python_dispatch.py)File“/media/nvidia/Ubuntu/xxxxx/ev
不与最大数相同的数字之和（信息学奥赛一本通-1113） Doopny@ 信息学奥赛一本通算法数据结构
【题目描述】输出一个整数数列中不与最大数相同的数字之和。【输入】输入分为两行：第一行为N(N为接下来数的个数，Nusingnamespacestd;constintN=1e4+10;intnums[N];intmain(){intn;cin>>n;cin>>nums[1];intmax_v=nums[1],sum=0;for(inti=2;i>nums[i];if(nums[i]>max_v)ma
ssh -i key 执行时 Permissions 0644 for '你的.pem' are too open 问题 Java菜鸟在北京 Linux常见问题
ssh-ikey地址；使用密钥登录时的Permissions0644for'你的.pem'aretooopen.ItisrequiredthatyourprivatekeyfilesareNOTaccessiblebyothers.Thisprivatekeywillbeignored.Loadkey"你的.pem":[email protected]
【贪心算法5】 m0_46150269 贪心算法算法
力扣738.单调递增的数字链接:link思路遇到c[i]>c[i+1]则c[i]–,然后就是给c[i+1]赋值‘9’；需要注意的是star初值问题，可见注释部分。classSolution{publicintmonotoneIncreasingDigits(intn){Strings=String.valueOf(n);char[]c=s.toCharArray();intstar=c.lengt
2025-3-14 leetcode刷题情况（贪心算法）肖筱小瀟蓝桥杯 leetcode 贪心算法算法
一、53.最大子序和1.题目描述2.代码3.思路先特殊处理数组只有一个数的情况，再定义两个变量，sum用于记录最大子数组和，count用于记录当前连续子数组的和。使用for循环遍历数组nums中的每个元素。对于每个元素nums[i]，将其累加到count中。每次累加后，使用Math.max函数比较sum和count的大小，将较大值更新到sum中，确保sum始终记录最大子数组和。如果count小于等
STM32F407 SPI通信 Klein、凉城 STM32F407标准库 stm32 嵌入式硬件单片机
1、SPI介绍SPI（串行外设接口）是一种由摩托罗拉公司开发的同步串行通信协议，主要用于短距离、高速通信的场景（如芯片间通信）。其核心特点是主从架构、全双工通信和硬件简单，广泛应用于嵌入式系统中连接微控制器（MCU）与传感器、存储器（如EEPROMFlash）、显示屏、实时时钟和网络控制器等外设。SPI接口提供两个主要功能，支持SPI协议或I2S音频协议。默认情况下，选择的是SPI功能。可通过软件
实时时钟（RTC）/日历芯片PCF8563的I2C读写驱动（4）：基于HAL库实现硬件I2C读写接口 NW嵌入式开发驱动开发单片机开发 Linux开发 RTC PCF8563 实时时钟 I2C 驱动
0参考资料PCF8563数据手册（第11版——2015年10月26日）.pdf1基于HAL库实现硬件I2C读写接口1.1初始化硬件I2C引脚/***@brief硬件I2C1初始化*配置为350KHz*@returnint0：成功-1：失败*/intbsp_hw_i2c1_init(void){/*速度配置为350KHzI2C1总线挂载器件：1.PCF8563（RTC器件，最高支持400KHz，实测
C语言占位符详细介绍南玖yy C语言基础 c语言开发语言
1.printf()的占位符有许多种类，与C语⾔的数据类型相对应。下⾯按照字⺟顺序，颜色标出常⽤的占位符，⽅便大家记忆。•%a：⼗六进制浮点数，字⺟输出为⼩写。•%A：⼗六进制浮点数，字⺟输出为⼤写。•%c：字符。•%d：⼗进制整数。•%e：使⽤科学计数法的浮点数，指数部分的e为⼩写。•%E：使⽤科学计数法的浮点数，指数部分的E为⼤写。•%i：整数，基本等同于%d，除了scanf有一点区别。•%f
二叉树的所有路径（leetcode 257 JohnFF leetcode linux 算法
leetcode系列文章目录一、核心操作二、外层配合操作三、核心模式代码总结使用递归法一、核心操作1.判断是不是叶子节点（该节点的左右子节点都为空2.收获该路径（将储存的节点一个一个拿出来，用->连接if(cur->left==nullptr&&cur->right==nullptr){stringspath;for(inti=0;i";}spath+=to_string(path[path.si
leetcode1005:K次取反后最大化的数组和 0cfjg0 leetcode 算法 java 数据结构
K次取反后最大化的数组和给你一个整数数组nums和一个整数k，按以下方法修改该数组：选择某个下标i并将nums[i]替换为-nums[i]。重复这个过程恰好k次。可以多次选择同一个下标i。以这种方式修改数组后，返回数组可能的最大和。publicintlargestSumAfterKNegations(int[]nums,intk){intmin;intindex;while(true){min=I
【2025年饿了么春招-3月14日-第二题（200分）- 小红的排列构造】（题目+思路+Java&C++&Python解析+在线测试) 塔子哥学算法 java c++python 算法数据结构饿了么
题目内容小红希望你构造一个长度为nnn的排列，满足∑i=1n∗i\sum_{i
【动态规划1】 m0_46150269 动态规划算法
力扣509.斐波那契数链接:link思路这是一道经典的动态规划DP题，做动态有5步：1.确定dp[i]含义，表示第i个数的斐波那契数值是dp[i]2.dp数组初始化3.确定递推公式4.确定遍历顺序，从递推公式可以知道dp[i]是依赖dp[i-1]和dp[i-2]，那么遍历的顺序一定是从前到后遍历的5.举例推导，草稿完成classSolution{publicintfib(intn){if(n<=1
Leetcode1005:k次取反后最大化的数组和(贪心算法) immortalize leetcode算法题解答 java 算法贪心算法 leetcode
Leetcode1005:k次取反后最大化的数组和题目：给你一个整数数组nums和一个整数k，按以下方法修改该数组：选择某个下标i并将nums[i]替换为-nums[i]。重复这个过程恰好k次。可以多次选择同一个下标i。以这种方式修改数组后，返回数组可能的最大和。思路：贪心算法代码如下：classSolution{publicintlargestSumAfterKNegations(int[]nu
阿里巴巴发布 R1-Omni：首个基于 RLVR 的全模态大语言模型，用于情感识别新加坡内哥谈技术语言模型人工智能自然语言处理
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/情感识别一直是AI领域的难题，尤其是视觉与音频信号的融合。单独依赖视觉或音频的模型，往往
用Python玩转Hyperledger：构建企业级区块链解决方案 Echo_Wish Python！实战！perl python opencv 人工智能
用Python玩转Hyperledger：构建企业级区块链解决方案大家好，我是Echo_Wish。在区块链技术的炙手可热中，“企业级区块链”俨然成为了下一个重磅关键词。相比于公有区块链，企业级区块链更注重隐私性、灵活性和高效性。而在这片“蓝海”中，Hyperledger项目无疑是企业级区块链解决方案的标杆。如果再搭配上Python这种“高效工具”，简直让人事半功倍！那么，如何将Python与Hyp
“杀疯了”，头部玩家纷纷下场，冲榜高阶智驾第一梯队！高工智能汽车自动驾驶人工智能
2025年的中国乘用车市场份额战与销量突破口，高阶智驾无疑是关键赛点。日前，吉利也高调入场，再次掀起高阶智驾市场普及战的新高潮。类似于比亚迪的天神之眼分成ABC三挡，对应不同级别车型，吉利的千里浩瀚智驾方案更加多元化，分为H1、H3、H5、H7和H9共5个不同层级的智驾方案。其中H1采用10V5R传感器方案，功能方面可实现高速NOA和记忆泊车HPA，主打极致性价比。根据资料来看，H1采用的是双黑芝
VMware Workstation 11 或者 VMware Player 7安装MAC OS X 10.10 Yosemite iwindyforest vmware mac os 10.10 workstation player
最近尝试了下VMware下安装MacOS 系统，安装过程中发现网上可供参考的文章都是VMware Workstation 10以下， MacOS X 10.9以下的文章，只能提供大概的思路，但是实际安装起来由于版本问题，走了不少弯路，所以我尝试写以下总结，希望能给有兴趣安装OSX的人提供一点帮助。写在前面的话：其实安装好后发现，由于我的th
关于《基于模型驱动的B/S在线开发平台》源代码开源的疑虑？ deathwknight JavaScript java 框架
本人从学习Java开发到现在已有10年整，从一个要自学 java买成javascript的小菜鸟，成长为只会java和javascript语言的老菜鸟（个人邮箱：[email protected]）一路走来，跌跌撞撞。用自己的三年多业余时间，瞎搞一个小东西（基于模型驱动的B/S在线开发平台，非MVC框架、非代码生成）。希望与大家一起分享，同时有许些疑虑，希望有人可以交流下平台
如何把maven项目转成web项目 Kai_Ge maven MyEclipse
创建Web工程，使用eclipse ee创建maven web工程 1.右键项目,选择Project Facets,点击Convert to faceted from 2.更改Dynamic Web Module的Version为2.5.(3.0为Java7的,Tomcat6不支持). 如果提示错误,可能需要在Java Compiler设置Compiler compl
主管？？？ Array_06 工作
转载：http://www.blogjava.net/fastzch/archive/2010/11/25/339054.html 很久以前跟同事参加的培训，同事整理得很详细，必须得转！前段时间，公司有组织中高阶主管及其培养干部进行了为期三天的管理训练培训。三天的课程下来，虽然内容较多，因对老师三天来的课程内容深有感触，故借着整理学习心得的机会，将三天来的培训课程做了一个
python内置函数大全 2002wmj python
最近一直在看python的document，打算在基础方面重点看一下python的keyword、Build-in Function、Build-in Constants、Build-in Types、Build-in Exception这四个方面，其实在看的时候发现整个《The Python Standard Library》章节都是很不错的，其中描述了很多不错的主题。先把Build-in Fu
JSP页面通过JQUERY合并行 357029540 JavaScript jquery
在写程序的过程中我们难免会遇到在页面上合并单元行的情况，如图所示如果对于会的同学可能很简单，但是对没有思路的同学来说还是比较麻烦的，提供一下用JQUERY实现的参考代码 function mergeCell(){ var trs = $("#table tr"); &nb
Java基础冰天百华 java基础
学习函数式编程 package base; import java.text.DecimalFormat; public class Main { public static void main(String[] args) { // Integer a = 4; // Double aa = (double)a / 100000; // Decimal
unix时间戳相互转换 adminjun 转换 unix 时间戳
如何在不同编程语言中获取现在的Unix时间戳(Unix timestamp)？ Java time JavaScript Math.round(new Date().getTime()/1000) getTime()返回数值的单位是毫秒 Microsoft .NET / C# epoch = (DateTime.Now.ToUniversalTime().Ticks - 62135
作为一个合格程序员该做的事 aijuans 程序员
作为一个合格程序员每天该做的事 1、总结自己一天任务的完成情况最好的方式是写工作日志，把自己今天完成了什么事情，遇见了什么问题都记录下来，日后翻看好处多多 2、考虑自己明天应该做的主要工作把明天要做的事情列出来，并按照优先级排列，第二天应该把自己效率最高的时间分配给最重要的工作 3、考虑自己一天工作中失误的地方，并想出避免下一次再犯的方法出错不要紧，最重
由html5视频播放引发的总结 ayaoxinchao html5 视频 video
前言项目中存在视频播放的功能，前期设计是以flash播放器播放视频的。但是现在由于需要兼容苹果的设备，必须采用html5的方式来播放视频。我就出于兴趣对html5播放视频做了简单的了解，不了解不知道，水真是很深。本文所记录的知识一些浅尝辄止的知识，说起来很惭愧。视频结构本该直接介绍html5的<video>的，但鉴于本人对视频
解决httpclient访问自签名https报javax.net.ssl.SSLHandshakeException: sun.security.validat bewithme httpclient
如果你构建了一个https协议的站点，而此站点的安全证书并不是合法的第三方证书颁发机构所签发，那么你用httpclient去访问此站点会报如下错误 javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path bu
Jedis连接池的入门级使用 bijian1013 redis redis数据库 jedis
Jedis连接池操作步骤如下： a.获取Jedis实例需要从JedisPool中获取； b.用完Jedis实例需要返还给JedisPool； c.如果Jedis在使用过程中出错，则也需要还给JedisPool； packag
变与不变 bingyingao 不变变亲情永恒
变与不变周末骑车转到了五年前租住的小区，曾经最爱吃的西北面馆、江西水饺、手工拉面早已不在，各种店铺都换了好几茬，这些是变的。三年前还很流行的一款手机在今天看起来已经落后的不像样子。三年前还运行的好好的一家公司，今天也已经不复存在。一座座高楼拔地而起，
【Scala十】Scala核心四：集合框架之List bit1129 scala
Spark的RDD作为一个分布式不可变的数据集合，它提供的转换操作，很多是借鉴于Scala的集合框架提供的一些函数，因此，有必要对Scala的集合进行详细的了解 1. 泛型集合都是协变的，对于List而言，如果B是A的子类，那么List[B]也是List[A]的子类，即可以把List[B]的实例赋值给List[A]变量 2. 给变量赋值(注意val关键字，a，b
Nested Functions in C bookjovi c closure
Nested Functions 又称closure，属于functional language中的概念，一直以为C中是不支持closure的，现在看来我错了，不过C标准中是不支持的，而GCC支持。既然GCC支持了closure，那么 lexical scoping自然也支持了，同时在C中label也是可以在nested functions中自由跳转的
Java-Collections Framework学习与总结-WeakHashMap BrokenDreams Collections
总结这个类之前，首先看一下Java引用的相关知识。Java的引用分为四种：强引用、软引用、弱引用和虚引用。强引用：就是常见的代码中的引用，如Object o = new Object();存在强引用的对象不会被垃圾收集
读《研磨设计模式》-代码笔记-解释器模式-Interpret bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 解释器（Interpreter）模式的意图是可以按照自己定义的组合规则集合来组合可执行对象 * * 代码示例实现XML里面1.读取单个元素的值 2.读取单个属性的值 * 多
After Effects操作&快捷键 cherishLC After Effects
1、快捷键官方文档中文版：https://helpx.adobe.com/cn/after-effects/using/keyboard-shortcuts-reference.html 英文版：https://helpx.adobe.com/after-effects/using/keyboard-shortcuts-reference.html 2、常用快捷键
Maven 常用命令 crabdave maven
Maven 常用命令 mvn archetype:generate mvn install mvn clean mvn clean complie mvn clean test mvn clean install mvn clean package mvn test mvn package mvn site mvn dependency:res
shell bad substitution daizj shell 脚本
#!/bin/sh /data/script/common/run_cmd.exp 192.168.13.168 "impala-shell -islave4 -q 'insert OVERWRITE table imeis.${tableName} select ${selectFields}, ds, fnv_hash(concat(cast(ds as string), im
Java SE 第二讲（原生数据类型 Primitive Data Type） dcj3sjt126com java
Java SE 第二讲： 1. Windows: notepad, editplus, ultraedit, gvim Linux: vi, vim, gedit 2. Java 中的数据类型分为两大类： 1）原生数据类型（Primitive Data Type） 2）引用类型（对象类型）（R
CGridView中实现批量删除 dcj3sjt126com PHP yii
1，CGridView中的columns添加 array( 'selectableRows' => 2, 'footer' => '<button type="button" onclick="GetCheckbox();" style=&
Java中泛型的各种使用 dyy_gusi java 泛型
Java中的泛型的使用：1.普通的泛型使用在使用类的时候后面的<>中的类型就是我们确定的类型。 public class MyClass1<T> {//此处定义的泛型是T private T var; public T getVar() { return var; } public void setVa
Web开发技术十年发展历程 gcq511120594 Web 浏览器数据挖掘
回顾web开发技术这十年发展历程： Ajax 03年的时候我上六年级，那时候网吧刚在小县城的角落萌生。传奇，大话西游第一代网游一时风靡。我抱着试一试的心态给了网吧老板两块钱想申请个号玩玩，然后接下来的一个小时我一直在，注，册，账，号。彼时网吧用的512k的带宽，注册的时候，填了一堆信息，提交，页面跳转，嘣，”您填写的信息有误，请重填”。然后跳转回注册页面，以此循环。我现在时常想，如果当时a
openSession()与getCurrentSession()区别： hetongfei java DAO Hibernate
来自 http://blog.csdn.net/dy511/article/details/6166134 1.getCurrentSession创建的session会和绑定到当前线程,而openSession不会。 2. getCurrentSession创建的线程会在事务回滚或事物提交后自动关闭,而openSession必须手动关闭。这里getCurrentSession本地事务(本地
第一章安装Nginx+Lua开发环境 jinnianshilongnian nginx lua openresty
首先我们选择使用OpenResty，其是由Nginx核心加很多第三方模块组成，其最大的亮点是默认集成了Lua开发环境，使得Nginx可以作为一个Web Server使用。借助于Nginx的事件驱动模型和非阻塞IO，可以实现高性能的Web应用程序。而且OpenResty提供了大量组件如Mysql、Redis、Memcached等等，使在Nginx上开发Web应用更方便更简单。目前在京东如实时价格、秒
HSQLDB In-Process方式访问内存数据库 liyonghui160com
HSQLDB一大特色就是能够在内存中建立数据库，当然它也能将这些内存数据库保存到文件中以便实现真正的持久化。先睹为快！下面是一个In-Process方式访问内存数据库的代码示例：下面代码需要引入hsqldb.jar包（hsqldb-2.2.8） import java.s
Java线程的5个使用技巧 pda158 java 数据结构
Java线程有哪些不太为人所知的技巧与用法？　　萝卜白菜各有所爱。像我就喜欢Java。学无止境，这也是我喜欢它的一个原因。日常工作中你所用到的工具，通常都有些你从来没有了解过的东西，比方说某个方法或者是一些有趣的用法。比如说线程。没错，就是线程。或者确切说是Thread这个类。当我们在构建高可扩展性系统的时候，通常会面临各种各样的并发编程的问题，不过我们现在所要讲的可能会略有不同。
开发资源大整合：编程语言篇——JavaScript（1） shoothao JavaScript
概述：本系列的资源整合来自于github中各个领域的大牛，来收藏你感兴趣的东西吧。程序包管理器管理javascript库并提供对这些库的快速使用与打包的服务。 Bower - 用于web的程序包管理。 component - 用于客户端的程序包管理，构建更好的web应用程序。 spm - 全新的静态的文件包管
避免使用终结函数 vahoa.ma java jvm C++
终结函数（finalizer）通常是不可预测的，常常也是很危险的，一般情况下不是必要的。使用终结函数会导致不稳定的行为、更差的性能，以及带来移植性问题。不要把终结函数当做C++中的析构函数（destructors）的对应物。我自己总结了一下这一条的综合性结论是这样的： 1）在涉及使用资源，使用完毕后要释放资源的情形下，首先要用一个显示的方