msw521sg

基于RNA-seq的基因表达分析

我的青春
最近在做一些小麦基因的表达分析，想到使用RNA-seq的数据进行生物信息学分析，并且比我做实验用的组织还要多。

序列预处理

下载数据之后，首先要对数据进行低质量序列和载体序列等污染序列去除，我这里结合了两个软件AdapterRemoval和bbduk2, bbduk2是bbmap中的一个子程序。

AdapterRemoval --file1 input1.fastq.gz --file2 input2.fastq.gz --qualitybase 33 --trimns --minlength 40 --threads 10 --adapter-list ~/adapterremoval-2.1.7/benchmark/adapters/adapters.fasta --output1 output1.fastq.gz --output2 output2.fastq.gz

可在终端键入AdapterRemoval，即可看见详细参数。如下

AdapterRemoval ver. 2.1.7

This program searches for and removes remnant adapter sequences from
your read data.  The program can analyze both single end and paired end
data.  For detailed explanation of the parameters, please refer to the
man page.  For comments, suggestions  and feedback please contact Stinus
Lindgreen ([email protected]) and Mikkel Schubert ([email protected]).

If you use the program, please cite the paper:
    Schubert, Lindgreen, and Orlando (2016). AdapterRemoval v2: rapid
    adapter trimming, identification, and read merging.
    BMC Research Notes, 12;9(1):88.

    http://bmcresnotes.biomedcentral.com/articles/10.1186/s13104-016-1900-2


Arguments:                           Description:
  --help                             Display this message.
  --version                          Print the version string.

  --file1 FILE                       Input file containing mate 1 reads or single-ended reads [REQUIRED].
  --file2 FILE                       Input file containing mate 2 reads [OPTIONAL].

FASTQ OPTIONS:
  --qualitybase BASE                 Quality base used to encode Phred scores in input; either 33, 64, or solexa [current: 33].
  --qualitybase-output BASE          Quality base used to encode Phred scores in output; either 33, 64, or solexa. By default, reads will be written in the same format as the that specified using --qualitybase.
  --qualitymax BASE                  Specifies the maximum Phred score expected in input files, and used when writing output. ASCII encoded values are limited to the characters '!' (ASCII = 33) to'~' (ASCII = 126), meaning that possible scores are 0 - 93 with offset 33, and 0 - 62 for offset 64 and Solexa scores [default: 41].
  --mate-separator CHAR              Character separating the mate number (1 or 2) from the read name in FASTQ records [default: '/'].
  --interleaved                      This option enables both the --interleaved-input option and the
                                       --interleaved-output option [current: off].
  --interleaved-input                The (single) input file provided contains both the mate 1 and mate 2 reads, one pair after the other, with one mate 1 reads followed by one mate 2 read. This option is implied by the --interleaved option [current: off].
  --interleaved-output               If set, trimmed paired-end reads are written to a single file containing mate 1 and mate 2 reads, one pair after the other. This option is implied by the --interleaved option [current: off].

OUTPUT FILES:
  --basename BASENAME                Default prefix for all output files for which no filename was explicitly set [current: your_output].
  --settings FILE                    Output file containing information on the parameters used in the run as well as overall statistics on the reads after trimming [default: BASENAME.settings]
  --output1 FILE                     Output file containing trimmed mate1 reads [default: BASENAME.pair1.truncated (PE), BASENAME.truncated (SE), or BASENAME.paired.truncated (interleaved PE)]
  --output2 FILE                     Output file containing trimmed mate 2 reads [default: BASENAME.pair2.truncated (only used in PE mode, but not if --interleaved-output is enabled)]
  --singleton FILE                   Output file to which containing paired reads for which the mate has been discarded [default: BASENAME.singleton.truncated]
  --outputcollapsed FILE             If --collapsed is set, contains overlapping mate-pairs which have been merged into a single read (PE mode) or reads for which the adapter was identified by a minimum overlap, indicating that the entire template molecule is present. This does not include which have subsequently been trimmed due to low-quality or ambiguous nucleotides [default: BASENAME.collapsed]
  --outputcollapsedtruncated FILE    Collapsed reads (see --outputcollapsed) which were trimmed due the presence of low-quality or ambiguous nucleotides [default: BASENAME.collapsed.truncated]
  --discarded FILE                   Contains reads discarded due to the --minlength, --maxlength or --maxns options [default: BASENAME.discarded]

OUTPUT COMPRESSION:
  --gzip                             Enable gzip compression [current: off]
  --gzip-level LEVEL                 Compression level, 0 - 9 [current: 6]
  --bzip2                            Enable bzip2 compression [current: off]
  --bzip2-level LEVEL                Compression level, 0 - 9 [current: 9]

TRIMMING SETTINGS:
  --adapter1 SEQUENCE                Adapter sequence expected to be found in mate 1 reads [current: AGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNATCTCGTATGCCGTCTTCTGCTTG].
  --adapter2 SEQUENCE                Adapter sequence expected to be found in mate 2 reads [current: AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT].
  --adapter-list FILENAME            Read table of white-space separated adapters pairs, used as if the first column was supplied to --adapter1, and the second column was supplied to --adapter2; only the first adapter in each pair is required SE trimming mode [current:<not set>].

  --mm MISMATCH_RATE                 Max error-rate when aligning reads and/or adapters. If > 1, the max error-rate is set to 1 / MISMATCH_RATE; if < 0, the defaults are used, otherwise the user-supplied value is used directly. [defaults: 1/3 for trimming; 1/10 when identifing adapters].
  --maxns MAX                        Reads containing more ambiguous bases (N) than this number after trimming are discarded [current: 1000].
  --shift N                          Consider alignments where up to N nucleotides are missing from the 5' termini [current: 2].

  --trimns                           If set, trim ambiguous bases (N) at 5'/3' termini [current: off]
  --trimqualities                    If set, trim bases at 5'/3' termini with quality scores <= to --minquality value [current: off]
  --minquality PHRED                 Inclusive minimum; see --trimqualities for details [current: 2]
  --minlength LENGTH                 Reads shorter than this length are discarded following trimming [current: 15].
  --maxlength LENGTH                 Reads longer than this length are discarded following trimming [current:4294967295].
  --collapse                         When set, paired ended read alignments of --minalignmentlength or more bases are combined into a single consensus sequence, representing the complete insert,and written to either basename.collapsed or basename.collapsed.truncated (if trimmed due to low-quality bases following collapse); for single-ended reads,putative complete inserts are identified as having at least --minalignmentlength bases overlap with the adapter sequence, and are written to the the same files [current: off].
  --minalignmentlength LENGTH        If --collapse is set, paired reads must overlap at least this number of bases to be collapsed, and single-ended reads must overlap at least this number of bases with the adapter to be considered complete template molecules [current:11].
  --minadapteroverlap LENGTH         In single-end mode, reads are only trimmed if the overlap between read and the adapter is at least X bases long, not counting ambiguous nucleotides (N); this is independant of the --minalignmentlength when using --collapse, allowing a conservative selection of putative complete inserts while ensuring that all possible adapter contamination is trimmed [current: 0].

DEMULTIPLEXING:
  --barcode-list FILENAME            List of barcodes or barcode pairs for single or double-indexed demultiplexing. Note that both indexes should be specified for both single-end and paired-end trimming, if double-indexed multiplexing was used, in order to ensure that the demultiplexed reads can be trimmed correctly [current: <not set>].
  --barcode-mm N                     Maximum number of mismatches allowed when counting mismatches in both the mate 1 and the mate 2 barcode for paired reads.
  --barcode-mm-r1 N                  Maximum number of mismatches allowed for the mate 1 barcode; if not set, this value is equal to the '--barcode-mm' value; cannot be higher than the '--barcode-mm value'.
  --barcode-mm-r2 N                  Maximum number of mismatches allowed for the mate 2 barcode; if not set, this value is equal to the '--barcode-mm' value; cannot be higher than the '--barcode-mm value'.

MISC:
  --identify-adapters                Attempt to identify the adapter pair of PE reads, by searching for overlapping reads [current: off].
  --seed SEED                        Sets the RNG seed used when choosing between bases with equal Phred scores when collapsing. Note that runs are not deterministic if more than one thread is used. If not specified, a seed is generated using the current time.
  --threads THREADS                  Maximum number of threads [current: 1]

其中--identify-adapters 参数可以在PE reads中鉴定载体序列
bbduk2的命令如下

/data1/masw/bbmap/bbduk2.sh -da in=ATW_AKOSW_2_1_D0KD1ACXX.IND12.fastq_1.gz IN2=ATW_AKOSW_2_2_D0KD1ACXX.IND12.fastq_1.gz out=ATW_AKOSW_2_1_D0KD1ACXX.IND12.fastq_2.gz out2=ATW_AKOSW_2_2_D0KD1ACXX.IND12.fastq_2.gz stats=1.2.txt k=20 minlength=40 mink=8 hdist=2 ref=/data1/masw/bbmap/resources/sequencing_artifacts.fa.gz tbo entropy=0.5 entropywindow=50 entropyk=5

同样的在终端下键入命令/data1/masw/bbmap/bbduk2.sh 可以查看详细的参数

Written by Brian Bushnell
Last modified June 27, 2016

BBDuk2 is like BBDuk but can kfilter, kmask, and ktrim in a single pass.
It does not replace BBDuk, and is only provided to allow maximally efficient
pipeline integration when multiple steps will be performed.  The syntax is 
slightly different.

Description:  Compares reads to the kmers in a reference dataset, optionally 
allowing an edit distance. Splits the reads into two outputs - those that 
match the reference, and those that don't. Can also trim (remove) the matching 
parts of the reads rather than binning the reads.

Usage:  bbduk2.sh in=file> out=file> fref=

Input may be stdin or a fasta or fastq file, compressed or uncompressed.
If you pipe via stdin/stdout, please include the file type; e.g. for gzipped 
fasta input, set in=stdin.fa.gz


Input parameters:
in=<file>           Main input. in=stdin.fq will pipe from stdin.
in2=<file>          Input for 2nd read of pairs in a different file.
fref=<file,file>    Comma-delimited list of fasta reference files for filtering.
rref=<file,file>    Comma-delimited list of fasta reference files for right-trimming.
lref=<file,file>    Comma-delimited list of fasta reference files for left-trimming.
mref=<file,file>    Comma-delimited list of fasta reference files for masking.
fliteral=  Comma-delimited list of literal sequences for filtering.
rliteral=  Comma-delimited list of literal sequences for right-trimming.
lliteral=  Comma-delimited list of literal sequences for left-trimming.
mliteral=  Comma-delimited list of literal sequences for masking.
touppercase=f       (tuc) Change all bases upper-case.
interleaved=auto    (int) t/f overrides interleaved autodetection.
qin=auto            Input quality offset: 33 (Sanger), 64, or auto.
reads=-1            If positive, quit after processing X reads or pairs.
copyundefined=f     (cu) Process non-AGCT IUPAC reference bases by making all
                    possible unambiguous copies.  Intended for short motifs
                    or adapter barcodes, as time/memory use is exponential.

Output parameters:
out=<file>          (outnonmatch) Write reads here that do not contain 
                    kmers matching the database.  'out=stdout.fq' will pipe 
                    to standard out.
out2=<file>         (outnonmatch2) Use this to write 2nd read of pairs to a 
                    different file.
outm=<file>         (outmatch) Write reads here that contain kmers matching
                    the database.
outm2=<file>        (outmatch2) Use this to write 2nd read of pairs to a 
                    different file.
outs=<file>         (outsingle) Use this to write singleton reads whose mate 
                    was trimmed shorter than minlen.
stats=<file>        Write statistics about which contamininants were detected.
refstats=<file>     Write statistics on a per-reference-file basis.
rpkm=<file>         Write RPKM for each reference sequence (for RNA-seq).
dump=<file>         Dump kmer tables to a file, in fasta format.
nzo=t               Only write statistics about ref sequences with nonzero hits.
overwrite=t         (ow) Grant permission to overwrite files.
showspeed=t         (ss) 'f' suppresses display of processing speed.
ziplevel=2          (zl) Compression level; 1 (min) through 9 (max).
fastawrap=80        Length of lines in fasta output.
qout=auto           Output quality offset: 33 (Sanger), 64, or auto.
statscolumns=3      (cols) Number of columns for stats output, 3 or 5.
                    5 includes base counts.
rename=f            Rename reads to indicate which sequences they matched.
refnames=f          Use names of reference files rather than scaffold IDs.
trd=f               Truncate read and ref names at the first whitespace.
ordered=f           Set to true to output reads in same order as input.

Histogram output parameters:
bhist=<file>        Base composition histogram by position.
qhist=<file>        Quality histogram by position.
qchist=<file>       Count of bases with each quality value.
aqhist=<file>       Histogram of average read quality.
bqhist=<file>       Quality histogram designed for box plots.
lhist=<file>        Read length histogram.
gchist=<file>       Read GC content histogram.
gcbins=100          Number gchist bins.  Set to 'auto' to use read length.

Histograms for sam files only (requires sam format 1.4 or higher):

ehist=<file>        Errors-per-read histogram.
qahist=<file>       Quality accuracy histogram of error rates versus quality 
                    score.
indelhist=<file>    Indel length histogram.
mhist=<file>        Histogram of match, sub, del, and ins rates by read location.
idhist=<file>       Histogram of read count versus percent identity.
idbins=100          Number idhist bins.  Set to 'auto' to use read length.

Processing parameters:
k=27                Kmer length used for finding contaminants.  Contaminants 
                    shorter than k will not be found.  k must be at least 1.
rcomp=t             Look for reverse-complements of kmers in addition to 
                    forward kmers.
maskmiddle=t        (mm) Treat the middle base of a kmer as a wildcard, to 
                    increase sensitivity in the presence of errors.
minkmerhits=1       (mkh) Reads need at least this many matching kmers 
                    to be considered as matching the reference.
hammingdistance=0   (hdist) Maximum Hamming distance for ref kmers (subs only).
                    Memory use is proportional to (3*K)^hdist.
qhdist=0            Hamming distance for query kmers; impacts speed, not memory.
editdistance=0      (edist) Maximum edit distance from ref kmers (subs 
                    and indels).  Memory use is proportional to (8*K)^edist.
hammingdistance2=0  (hdist2) Sets hdist for short kmers, when using mink.
qhdist2=0           Sets qhdist for short kmers, when using mink.
editdistance2=0     (edist2) Sets edist for short kmers, when using mink.
forbidn=f           (fn) Forbids matching of read kmers containing N.
                    By default, these will match a reference 'A' if 
                    hdist>0 or edist>0, to increase sensitivity.
removeifeitherbad=t (rieb) Paired reads get sent to 'outmatch' if either is 
                    match (or either is trimmed shorter than minlen).  
                    Set to false to require both.
findbestmatch=f     (fbm) If multiple matches, associate read with sequence 
                    sharing most kmers.  Reduces speed.
skipr1=f            Don't do kmer-based operations on read 1.
skipr2=f            Don't do kmer-based operations on read 2.
ecco=f              For overlapping paired reads only.  Performs error-
                    correction with BBMerge prior to kmer operations.
recalibrate=f       (recal) Recalibrate quality scores.  Requires calibration
                    matrices generated by CalcTrueQuality.
sam=<file,file>     If recalibration is desired, and matrices have not already
                    been generated, BBDuk will create them from the sam file.

Speed and Memory parameters:
threads=auto        (t) Set number of threads to use; default is number of 
                    logical processors.
prealloc=f          Preallocate memory in table.  Allows faster table loading 
                    and more efficient memory usage, for a large reference.
monitor=f           Kill this process if it crashes.  monitor=600,0.01 would 
                    kill after 600 seconds under 1% usage.
minrskip=1          (mns) Force minimal skip interval when indexing reference 
                    kmers.  1 means use all, 2 means use every other kmer, etc.
maxrskip=1          (mxs) Restrict maximal skip interval when indexing 
                    reference kmers. Normally all are used for scaffolds<100kb, 
                    but with longer scaffolds, up to maxrskip-1 are skipped.
rskip=              Set both minrskip and maxrskip to the same value.
                    If not set, rskip will vary based on sequence length.
qskip=1             Skip query kmers to increase speed.  1 means use all.
speed=0             Ignore this fraction of kmer space (0-15 out of 16) in both
                    reads and reference.  Increases speed and reduces memory.
Note: Do not use more than one of 'speed', 'qskip', and 'rskip'.

Trimming/Filtering/Masking parameters:
Note - for BBDuk2, kmer filtering, trimming, and masking are independent,
and all can be performed at the same time.

ktrim=f             Trim reads to remove bases matching reference kmers.
                    Values: 
                            f (don't trim), 
                            r (trim to the right), 
                            l (trim to the left)
kmask=f             Replace bases matching ref kmers with another symbol.
                    Allows any non-whitespace character other than t or f,
                    and processes short kmers on both ends.  'kmask=lc' will
                    convert masked bases to lowercase.
mink=0              Look for shorter kmers at read tips down to this length, 
                    when k-trimming or masking.  0 means disabled.  Enabling
                    this will disable maskmiddle.
qtrim=f             Trim read ends to remove bases with quality below trimq.
                    Performed AFTER looking for kmers.
                    Values: 
                            rl (trim both ends), 
                            f (neither end), 
                            r (right end only), 
                            l (left end only),
                            w (sliding window)
trimq=6             Regions with average quality BELOW this will be trimmed.
minlength=10        (ml) Reads shorter than this after trimming will be 
                    discarded.  Pairs will be discarded if both are shorter.
mlf=0               (minlengthfraction) Reads shorter than this fraction of 
                    original length after trimming will be discarded.
maxlength=          Reads longer than this after trimming will be discarded.
                    Pairs will be discarded only if both are longer.
minavgquality=0     (maq) Reads with average quality (after trimming) below 
                    this will be discarded.
maqb=0              If positive, calculate maq from this many initial bases.
chastityfilter=f    (cf) Discard reads with id containing ' 1:Y:' or ' 2:Y:'.
barcodefilter=f     Remove reads with unexpected barcodes if barcodes is set,
                    or barcodes containing 'N' otherwise.  A barcode must be
                    the last part of the read header.
barcodes=           Comma-delimited list of barcodes or files of barcodes.
maxns=-1            If non-negative, reads with more Ns than this 
                    (after trimming) will be discarded.
mcb=0               (minconsecutivebases) Discard reads without at least 
                    this many consecutive called bases.
ottm=f              (outputtrimmedtomatch) Output reads trimmed to shorter 
                    than minlength to outm rather than discarding.
tp=0                (trimpad) Trim this much extra around matching kmers.
tbo=f               (trimbyoverlap) Trim adapters based on where paired 
                    reads overlap.
strictoverlap=t     Adjust sensitivity for trimbyoverlap mode.
minoverlap=14       Require this many bases of overlap for detection.
mininsert=50        Require insert size of at least this for overlap. 
                    Should be reduced to 16 for small RNA sequencing.
tpe=f               (trimpairsevenly) When kmer right-trimming, trim both 
                    reads to the minimum length of either.
forcetrimleft=0     (ftl) If positive, trim bases to the left of this position
                    (exclusive, 0-based).
forcetrimright=0    (ftr) If positive, trim bases to the right of this position
                    (exclusive, 0-based).
forcetrimright2=0   (ftr2) If positive, trim this many bases on the right end.
forcetrimmod=0      (ftm) If positive, right-trim length to be equal to zero,
                    modulo this number.
restrictleft=0      If positive, only look for kmer matches in the 
                    leftmost X bases.
restrictright=0     If positive, only look for kmer matches in the 
                    rightmost X bases.
mingc=0             Discard reads with GC content below this.
maxgc=1             Discard reads with GC content above this.
gcpairs=t           Use average GC of paired reads.
                    Also affects gchist.

Entropy/Complexity parameters:
entropy=-1          Set between 0 and 1 to filter reads with entropy below
                    that value.  Higher is more stringent.
entropywindow=50    Calculate entropy using a sliding window of this length.
entropyk=5          Calculate entropy using kmers of this length.
minbasefrequency=0  Discard reads with a minimum base frequency below this.

Cardinality estimation:
cardinality=f           (loglog) Count unique kmers using the LogLog algorithm.
loglogk=31              Use this kmer length for counting.
loglogbuckets=1999      Use this many buckets for counting.

Java Parameters:

-Xmx                This will be passed to Java to set memory usage, overriding 
                    the program's automatic memory detection. -Xmx20g will 
                    specify 20 gigs of RAM, and -Xmx200m will specify 200 megs.  
                    The max is typically 85% of physical memory.

There is a changelog at /bbmap/docs/changelog_bbduk.txt
Please contact Brian Bushnell at [email protected] if you encounter any problems.

去除载体序列后，可以查看mapping rate是否提高，正常情况下mapping应该在80％以上。如果mapping rate实在太低，要考虑这个sample的质量问题，有可能影响结果的准确性

基因组index

hisat2-build-l -p 20 ./IWGSC_v1.0/Wheat_IWGSC_WGA_v1.0_pseudomolecules/161010_Chinese_Spring_v1.0_pseudomolecules.fasta IWGSCv1.0_hiast2

序列比对到基因组

这一步使用 hisat2，hisat2 比对非常快而且资源要求较少，但是需要先对参考基因组index。mapping使用的命令是：

#!/usr/bin/env python
# -*- coding: utf-8 -*-


import subprocess


with open('hisat2_list.txt', 'r') as f:
    for line in f:
        line = line.strip().split()
        input1, input2, output1, output2 = line
        print input1, input2
        proc = subprocess.Popen(['hisat2', '-p', '20', '--dta', '-x', '../NRGenome_hisat2/NRGenome', '--known-splicesite-infile', '../annotation/1.ss', '--novel-splicesite-infile', 'all.ss', '--novel-splicesite-outfile',output1, \
                                 '-t', '-1', input1, '-2', input2, '-S', output2], shell=False)
        proc.wait()

接下来就是筛选sam结果，比如只保留一个hit的reads或者完全匹配的reads等。如果能够对sam格式熟悉，就能够简单的做到filter，这里也不在详述。将hisat2的详细参数列出

No index, query, or output file specified!
HISAT2 version 2.0.4 by Daehwan Kim ([email protected], www.ccb.jhu.edu/people/infphilo)
Usage: 
  hisat2 [options]* -x  {-1  -2  | -U  | --sra-acc } [-S ]

    Index filename prefix (minus trailing .X.ht2).
         Files with #1 mates, paired with files in .
             Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2).
         Files with #2 mates, paired with files in .
             Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2).
          Files with unpaired reads.
             Could be gzip'ed (extension: .gz) or bzip2'ed (extension: .bz2).
          Comma-separated list of SRA accession numbers, e.g. --sra-acc SRR353653,SRR353654.
        File for SAM output (default: stdout)

  , ,  can be comma-separated lists (no whitespace) and can be
  specified many times.  E.g. '-U file1.fq,file2.fq -U file3.fq'.

Options (defaults in parentheses):

 Input:
  -q                 query input files are FASTQ .fq/.fastq (default)
  --qseq             query input files are in Illumina's qseq format
  -f                 query input files are (multi-)FASTA .fa/.mfa
  -r                 query input files are raw one-sequence-per-line
  -c                 , ,  are sequences themselves, not files
  -s/--skip     skip the first  reads/pairs in the input (none)
  -u/--upto     stop after first  reads/pairs (no limit)
  -5/--trim5    trim  bases from 5'/left end of reads (0)
  -3/--trim3    trim  bases from 3'/right end of reads (0)
  --phred33          qualities are Phred+33 (default)
  --phred64          qualities are Phred+64
  --int-quals        qualities encoded as space-delimited integers
  --sra-acc          SRA accession ID

 Alignment:
  --n-ceil     func for max # non-A/C/G/Ts permitted in aln (L,0,0.15)
  --ignore-quals     treat all quality values as 30 on Phred scale (off)
  --nofw             do not align forward (original) version of read (off)
  --norc             do not align reverse-complement version of read (off)

 Spliced Alignment:
  --pen-cansplice               penalty for a canonical splice site (0)
  --pen-noncansplice            penalty for a non-canonical splice site (12)
  --pen-canintronlen           penalty for long introns (G,-8,1) with canonical splice sites
  --pen-noncanintronlen        penalty for long introns (G,-8,1) with noncanonical splice sites
  --min-intronlen               minimum intron length (20)
  --max-intronlen               maximum intron length (500000)
  --known-splicesite-infile    provide a list of known splice sites
  --novel-splicesite-outfile   report a list of splice sites
  --novel-splicesite-infile    provide a list of novel splice sites
  --no-temp-splicesite               disable the use of splice sites found
  --no-spliced-alignment             disable spliced alignment
  --rna-strandness           Specify strand-specific information (unstranded)
  --tmo                              Reports only those alignments within known transcriptome
  --dta                              Reports alignments tailored for transcript assemblers
  --dta-cufflinks                    Reports alignments tailored specifically for cufflinks

 Scoring:
  --ma          match bonus (0 for --end-to-end, 2 for --local) 
  --mp ,   max and min penalties for mismatch; lower qual = lower penalty <2,6>
  --sp ,   max and min penalties for soft-clipping; lower qual = lower penalty <1,2>
  --np          penalty for non-A/C/G/Ts in read/ref (1)
  --rdg ,  read gap open, extend penalties (5,3)
  --rfg ,  reference gap open, extend penalties (5,3)
  --score-min  min acceptable alignment score w/r/t read length
                     (L,0.0,-0.2)

 Reporting:
  (default)          look for multiple alignments, report best, with MAPQ
   OR
  -k            report up to  alns per read; MAPQ not meaningful
   OR
  -a/--all           report all alignments; very slow, MAPQ not meaningful

 Paired-end:
  --fr/--rf/--ff     -1, -2 mates align fw/rev, rev/fw, fw/fw (--fr)
  --no-mixed         suppress unpaired alignments for paired reads
  --no-discordant    suppress discordant alignments for paired reads

 Output:
  -t/--time          print wall-clock time taken by search phases
  --un            write unpaired reads that didn't align to 
  --al            write unpaired reads that aligned at least once to 
  --un-conc       write pairs that didn't align concordantly to 
  --al-conc       write pairs that aligned concordantly at least once to 
  (Note: for --un, --al, --un-conc, or --al-conc, add '-gz' to the option name, e.g.
  --un-gz , to gzip compress output, or add '-bz2' to bzip2 compress output.)
  --quiet            print nothing to stderr except serious errors
  --met-file   send metrics to file at  (off)
  --met-stderr       send metrics to stderr (off)
  --met         report internal counters & metrics every  secs (1)
  --no-head          supppress header lines, i.e. lines starting with @
  --no-sq            supppress @SQ header lines
  --rg-id      set read group id, reflected in @RG line and RG:Z: opt field
  --rg         add  ("lab:value") to @RG line of SAM header.
                     Note: @RG line only printed when --rg-id is set.
  --omit-sec-seq     put '*' in SEQ and QUAL fields for secondary alignments.

 Performance:
  -o/--offrate  override offrate of index; must be >= index's offrate
  -p/--threads  number of alignment threads to launch (1)
  --reorder          force SAM output order to match order of input reads
  --mm               use memory-mapped I/O for index; many 'bowtie's can share

 Other:
  --qc-filter        filter out reads that are bad according to QSEQ filter
  --seed        seed for random number generator (0)
  --non-deterministic seed rand. gen. arbitrarily instead of using read attributes
  --remove-chrname   remove 'chr' from reference names in alignment
  --add-chrname      add 'chr' to reference names in alignment 
  --version          print version information and quit
  -h/--help          print this usage message
(ERR): hisat2-align exited with value 1

这里我需要保留完全匹配的reads，筛选如下

#!/usr/bin/env python
# -*- coding: utf-8 -*-


import subprocess


with open('sam_file.txt', 'r') as f:
    for line in f:
        line = line.strip()
        print line
        proc = subprocess.Popen('grep -E "@|NM:i:0" ' + line + ' > ' + line[:-3] + 'perfectmatch.sam', shell=True) 
        proc.wait()

注意此处筛选遗漏了插入缺失的情况，会在这里

有了sam文件我们可以组装出转录本，但是本研究的目的是给定一个基因的转录本去衡量表达情况，所以这一步骤非必需。对于如何组装出转录本可参考文献Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

统计转录本表达的counts

根据基因在参考基因组上的位置信息进行表达量统计，基因在基因组上的位置信息一般保存成gff3或gtf格式，可以使blastn，gmap等软件获取位置信息(注意exon-intron一定要准确),gff3格式如下

chr1A   NRGenome    mRNA    5946352 5946999 .   -   .   ID=UN044011.mrna1;Name=UN044011;Parent=UN044011.path1;coverage=100.0;identity=100.0;matches=648;mismatches=0;indels=0;unknowns=0    
chr1A   NRGenome    exon    5946352 5946999 100 -   .   ID=UN044011.mrna1.exon1;Name=UN044011;Parent=UN044011.mrna1;Target=UN044011 1   648 +   
chr1A   NRGenome    mRNA    9968301 9968632 .   +   .   ID=UN080299.mrna1;Name=UN080299;Parent=UN080299.path1;coverage=100.0;identity=100.0;matches=213;mismatches=0;indels=0;unknowns=0    
chr1A   NRGenome    exon    9968301 9968396 100 +   .   ID=UN080299.mrna1.exon1;Name=UN080299;Parent=UN080299.mrna1;Target=UN080299 1   96  +   
chr1A   NRGenome    exon    9968516 9968632 100 +   .   ID=UN080299.mrna1.exon2;Name=UN080299;Parent=UN080299.mrna1;Target=UN080299 97  213 +   
chr1A   NRGenome    mRNA    12807377    12808514    .   -   .   ID=UN129475.mrna1;Name=UN129475;Parent=UN129475.path1;coverage=100.0;identity=100.0;matches=156;mismatches=0;indels=0;unknowns=0    
chr1A   NRGenome    exon    12808501    12808514    100 -   .   ID=UN129475.mrna1.exon1;Name=UN129475;Parent=UN129475.mrna1;Target=UN129475 1   14  +   
chr1A   NRGenome    exon    12807377    12807518    100 -   .   ID=UN129475.mrna1.exon2;Name=UN129475;Parent=UN129475.mrna1;Target=UN129475 15  156 +

有了位置信息，使用featurecounts 计算表达的counts。这里只统计unique reads，命令如下(每次只需要修改输入的基因位置信息以及输出文件即可):

featureCounts -T 20 -t exon -g Name --readExtension5 70  --readExtension3 70 -p --donotsort -C -a ../Triticum_aestivum.TGACv1.cds.1.gff3 -o TGAC_unique_in_expression.txt ATW_AOSW_1.perfectmatch.sam ATW_AAOSW_6.perfectmatch.sam ATW_ANOSW_1.perfectmatch.sam ATW_LOSW_5.perfectmatch.sam ATW_ADOSW_1.perfectmatch.sam ATW_AEOSW_1.perfectmatch.sam ATW_DOSW_2.perfectmatch.sam ATW_POSW_6.perfectmatch.sam ATW_IOSW_4.perfectmatch.sam ATW_KOSW_4.perfectmatch.sam ATW_ROSW_7.perfectmatch.sam ATW_ALOSW_3.perfectmatch.sam ATW_TOSW_8.perfectmatch.sam ATW_VOSW_6.perfectmatch.sam ATW_MOSW_5.perfectmatch.sam ATW_NOSW_6.perfectmatch.sam ATW_COSW_1.perfectmatch.sam ATW_AGOSW_2.perfectmatch.sam ATW_GOSW_3.perfectmatch.sam ATW_HOSW_3.perfectmatch.sam ATW_ABOSW_7.perfectmatch.sam ATW_ACOSW_1.perfectmatch.sam ATW_QOSW_7.perfectmatch.sam ATW_AHOSW_3.perfectmatch.sam SRR1175868.perfectmatch.sam SRR1177760.perfectmatch.sam SRR1177761.perfectmatch.sam NG-5789_1A_lib7482.perfectmatch.sam NG-5789_1B_lib7486.perfectmatch.sam NG-5789_2A_lib7483.perfectmatch.sam NG-5789_2B_lib7487.perfectmatch.sam NG-5789_3A_lib7484.perfectmatch.sam NG-5789_3B_lib7488.perfectmatch.sam NG-5789_4A_lib7485.perfectmatch.sam NG-5789_4B_lib7489.perfectmatch.sam ATW_SOSW_8.perfectmatch.sam ATW_AFOSW_2.perfectmatch.sam ATW_AIOSW_2.perfectmatch.sam ATW_AKOSW_2.perfectmatch.sam ATW_FOSW_2.perfectmatch.sam ATW_AMOSW_4.perfectmatch.sam

同样的在这里列出featureCounts的详细参数。具体每项参数的意义请自行了解

Version 1.5.1

Usage: featureCounts [options] -a  -o  input_file1 [input_file2] ... 

## Required arguments:

  -a <string>         Name of an annotation file. GTF/GFF format by default.
                      See -F option for more formats.

  -o <string>         Name of the output file including read counts. A separate
                      file including summary statistics of counting results is
                      also included in the output (`<string>.summary')

  input_file1 [input_file2] ...   A list of SAM or BAM format files.

## Options:
# Annotation

  -F <string>         Specify format of provided annotation file. Acceptable
                      formats include `GTF/GFF' and `SAF'. `GTF/GFF' by default.
                      See Users Guide for description of SAF format.

  -t <string>         Specify feature type in GTF annotation. `exon' by 
                      default. Features used for read counting will be 
                      extracted from annotation using the provided value.

  -g <string>         Specify attribute type in GTF annotation. `gene_id' by 
                      default. Meta-features used for read counting will be 
                      extracted from annotation using the provided value.

  -A <string>         Provide a chromosome name alias file to match chr names in
                      annotation with those in the reads. This should be a two-
                      column comma-delimited text file. Its first column should
                      include chr names in the annotation and its second column
                      should include chr names in the reads. Chr names are case
                      sensitive. No column header should be included in the
                      file.

# Level of summarization

  -f                  Perform read counting at feature level (eg. counting 
                      reads for exons rather than genes).

# Overlap between reads and features

  -O                  Assign reads to all their overlapping meta-features (or 
                      features if -f is specified).

  --minOverlap   Minimum number of overlapping bases in a read that is
                      required for read assignment. 1 by default. Number of
                      overlapping bases is counted from both reads if paired
                      end. If a negative value is provided, then a gap of up
                      to specified size will be allowed between read and the
                      feature that the read is assigned to.

  --fracOverlap  Minimum fraction of overlapping bases in a read that is
                      required for read assignment. Value should be within range
                      [0,1]. 0 by default. Number of overlapping bases is
                      counted from both reads if paired end. Both this option
                      and '--minOverlap' option need to be satisfied for read
                      assignment.

  --largestOverlap    Assign reads to a meta-feature/feature that has the 
                      largest number of overlapping bases.

  --readExtension5  Reads are extended upstream by  bases from their
                      5' end.

  --readExtension3  Reads are extended upstream by  bases from their
                      3' end.

  --read2pos <5:3>    Reduce reads to their 5' most base or 3' most base. Read
                      counting is then performed based on the single base the 
                      read is reduced to.

# Multi-mapping reads

  -M                  Multi-mapping reads will also be counted. For a multi-
                      mapping read, all its reported alignments will be 
                      counted. The `NH' tag in BAM/SAM input is used to detect 
                      multi-mapping reads.

# Fractional counting

  --fraction          Assign fractional counts to features. This option must
                      be used together with '-M' or '-O' or both. When '-M' is
                      specified, each reported alignment from a multi-mapping
                      read (identified via 'NH' tag) will carry a fractional
                      count of 1/x, instead of 1 (one), where x is the total
                      number of alignments reported for the same read. When '-O'
                      is specified, each overlapping feature will receive a
                      fractional count of 1/y, where y is the total number of
                      features overlapping with the read. When both '-M' and
                      '-O' are specified, each alignment will carry a fraction
                      count of 1/(x*y).

# Read filtering

  -Q             The minimum mapping quality score a read must satisfy in
                      order to be counted. For paired-end reads, at least one
                      end should satisfy this criteria. 0 by default.

  --splitOnly         Count split alignments only (ie. alignments with CIGAR
                      string containing 'N'). An example of split alignments is
                      exon-spanning reads in RNA-seq data.

  --nonSplitOnly      If specified, only non-split alignments (CIGAR strings do
                      not contain letter 'N') will be counted. All the other
                      alignments will be ignored.

  --primary           Count primary alignments only. Primary alignments are 
                      identified using bit 0x100 in SAM/BAM FLAG field.

  --ignoreDup         Ignore duplicate reads in read counting. Duplicate reads 
                      are identified using bit Ox400 in BAM/SAM FLAG field. The 
                      whole read pair is ignored if one of the reads is a 
                      duplicate read for paired end data.

# Strandness

  -s             Perform strand-specific read counting. Acceptable values:
                      0 (unstranded), 1 (stranded) and 2 (reversely stranded).
                      0 by default.

# Exon-exon junctions

  -J                  Count number of reads supporting each exon-exon junction.
                      Junctions were identified from those exon-spanning reads
                      in the input (containing 'N' in CIGAR string). Counting
                      results are saved to a file named '.jcounts'

  -G <string>         Provide the name of a FASTA-format file that contains the
                      reference sequences used in read mapping that produced the
                      provided SAM/BAM files. This optional argument can be used
                      with '-J' option to improve read counting for junctions.

# Parameters specific to paired end reads

  -p                  If specified, fragments (or templates) will be counted
                      instead of reads. This option is only applicable for
                      paired-end reads.

  -B                  Count read pairs that have both ends successfully aligned 
                      only.

  -P                  Check validity of paired-end distance when counting read 
                      pairs. Use -d and -D to set thresholds.

  -d             Minimum fragment/template length, 50 by default.

  -D             Maximum fragment/template length, 600 by default.

  -C                  Do not count read pairs that have their two ends mapping 
                      to different chromosomes or mapping to same chromosome 
                      but on different strands.

  --donotsort         Do not sort reads in BAM/SAM input. Note that reads from 
                      the same pair are required to be located next to each 
                      other in the input.

# Number of CPU threads

  -T             Number of the threads. 1 by default.

# Miscellaneous

  -R                  Output detailed assignment result for each read. A text 
                      file will be generated for each input file, including 
                      names of reads and meta-features/features reads were 
                      assigned to. See Users Guide for more details.

  --tmpDir    Directory under which intermediate files are saved (later
                      removed). By default, intermediate files will be saved to
                      the directory specified in '-o' argument.

  --maxMOp       Maximum number of 'M' operations allowed in a CIGAR
                      string. 10 by default. Both 'X' and '=' are treated as 'M'
                      and adjacent 'M' operations are merged in the CIGAR
                      string.

  -v                  Output version of the program.

均一化表达量

这里使用FPKM表示，因为我用的是PE数据，而单端测序数据可以使用RPKM。我自己写了一个python脚本统计，其他人使用需要进行修改

#!/usr/bin/env python
# -*- coding: utf-8 -*- 
__author__ = 'shengwei ma'
__author_email__ = '[email protected]'

import numpy as np

raw_total = [('root_Z10_rep1', 49168553), ('root_Z10_rep2', 44047402), ('root_Z13_rep1', 78098556),
             ('root_Z13_rep2', 38474362), ('root_Z39_rep1', 79981030), ('root_Z39_rep2', 41041508),
             ('stem_Z30_rep1', 46935246), ('stem_Z30_rep2', 38803969), ('stem_Z32_rep1', 51627704),
             ('stem_Z32_rep2', 37219517), ('stem_Z65_rep1', 39849949), ('stem_Z65_rep2', 40299574),
             ('leaf_Z10_rep1', 38168988), ('leaf_Z10_rep2', 43073693), ('leaf_Z23_rep1', 44071613),
             ('leaf_Z23_rep2', 40380776), ('leaf_Z71_rep1', 32810256), ('leaf_Z71_rep2', 35749803),
             ('spike_Z32_rep1', 46203474), ('spike_Z32_rep2', 43612313), ('spike_Z39_rep1', 40406588),
             ('spike_Z39_rep2', 47596209), ('spike_Z65_rep1', 43071042), ('spike_Z65_rep2', 48443902),
             ('carpel', 57881099), ('carpel-like structure', 63914055), ('stamen', 72275259),
             ('latent_lepto_rep1', 31693600), ('latent_lepto_rep2', 40260140), ('diplo_dia_rep1', 56486977),
             ('diplo_dia_rep2', 43990501), ('zygo_pachy_rep1', 37037924), ('zygo_pachy_rep2', 37678253),
             ('metaphaseI_rep1', 26954435), ('metaphaseI_rep2', 32180104), ('grain_Z71_rep1', 44263291),
             ('grain_Z71_rep2', 36875603), ('grain_Z75_rep1', 47740143), ('grain_Z75_rep2', 51819168),
             ('grain_Z85_rep1', 36879170), ('grain_Z85_rep2', 31412470), ('Wheat_Room1_10DPA', 16712256),
             ('Wheat_Room1_10DPA_Rep', 22819483), ('Wheat_Room2_10DPA', 27121510), ('Wheat_Room2_10DPA_Rep', 29453109),
             ('Wheat_Room1_AL_20DPA', 30598515), ('Wheat_Room1_AL_20DPA_Rep', 28518937), ('Wheat_Room2_AL_20DPA', 24838220),
             ('Wheat_Room2_AL_20DPA_Rep', 27715580), ('Wheat_Room1_AL_20DPA_Extra1', 29978007), ('Wheat_Room1_AL_20DPA_Extra2', 30079461),
             ('Wheat_Room1_SE_20DPA', 25140145), ('Wheat_Room1_SE_20DPA_Rep', 24446796), ('Wheat_Room2_SE_20DPA', 21339690),
             ('Wheat_Room2_SE_20DPA_Rep', 22815780),
             ('Wheat_Room1_TC_20DPA', 16629117), ('Wheat_Room1_TC_20DPA_Rep', 27612315), ('Wheat_Room2_TC_20DPA', 25304622),
             ('Wheat_Room2_TC_20DPA_Rep', 25352139), ('Wheat_Room1_REF_20DPA', 29929219), ('Wheat_Room1_REF_20DPA_Rep', 26636425),
             ('Wheat_Room2_REF_20DPA', 24316737), ('Wheat_Room2_REF_20DPA_Rep', 29330096), ('Wheat_Room1_SE_30DPA', 22777481),
             ('Wheat_Room1_SE_30DPA_Rep', 22777481), ('Wheat_Room2_SE_30DPA', 30513836), ('Wheat_Room2_SE_30DPA_Rep', 21486098),
             ('Wheat_Room1_AL_SE_30DPA', 28821672), ('Wheat_Room1_AL_SE_30DPA_Rep', 20134665), ('Wheat_Room2_AL_SE_30DPA', 23721856),
             ('Wheat_Room2_AL_SE_30DPA_Rep', 24896811), ('wheat_23_1', 28444918), ('wheat_23_2', 67968193),
             ('wheat_23_3', 24321425), ('wheat_4_1', 35430306), ('wheat_4_2', 22527710), ('wheat_4_3', 16848204)]


with open('MLJ_unique_expression.txt', 'r') as f:
    print "%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t" % \
                  ('Geneid', 'Chr', 'Start', 'End', 'Strand', 'Length', 'root_Z10', 'root_Z13','root_Z39',
                   'stem_Z30', 'stem_Z32', 'stem_Z65', 'leaf_Z10', 'leaf_Z23', 'leaf_Z71',
                   'spike_Z32', 'spike_Z39', 'spike_Z65', 'carpel', 'carpel_like_structure',
                   'stamen', 'latet_lepto', 'diplo_dia', 'zygo_pachy', 'metaphaseI',
                   'grain_Z71', 'grain_Z75', 'grain_Z85', 'Wheat_10DPA', 'Wheat_AL_20DPA',
                   'Wheat_SE_20DPA', 'Wheat_TC_20DPA', 'Wheat_REF_20DPA', 'Wheat_SE_30DPA',
                   'Wheat_AL.SE_30DPA', 'wheat_23', 'wheat_4', 'root_Z10_std', 'root_Z13_std', 'root_Z39_std',
                   'stem_Z30_std', 'stem_Z32_std', 'stem_Z65_std', 'leaf_Z10_std', 'leaf_Z23_std', 'leaf_Z71_std',
                   'spike_Z32_std', 'spike_Z39_std', 'spike_Z65_std', 'carpel_std', 'carpel-like_std', 'stamen_std',
                   'latet_lepto_std', 'diplo_dia_std', 'zygo_pachy_std', 'metaphaseI_std', 'grain_Z71_std',
                   'grain_Z75_std', 'grain_Z85_std','Wheat_10DPA_std', 'Wheat_AL_20DPA_std','Wheat_SE_20DPA_std',
                   'Wheat_TC_20DPA_std', 'Wheat_REF_20DPA_std', 'Wheat_SE_30DPA_std',
                   'Wheat_AL.SE_30DPA_std', 'wheat_23_std', 'wheat_4_std')
    for line in f:
        if line.startswith('#') or line.startswith('Geneid'):
            pass
        else:
            new = line.strip().split('\t')
            (Geneid, Chr, Start, End, Strand, Length, root_Z10_rep1, root_Z10_rep2, root_Z13_rep1, root_Z13_rep2,
             root_Z39_rep1, root_Z39_rep2, stem_Z30_rep1, stem_Z30_rep2, stem_Z32_rep1, stem_Z32_rep2, stem_Z65_rep1,
             stem_Z65_rep2, leaf_Z10_rep1, leaf_Z10_rep2, leaf_Z23_rep1, leaf_Z23_rep2, leaf_Z71_rep1, leaf_Z71_rep2,
             spike_Z32_rep1, spike_Z32_rep2, spike_Z39_rep1, spike_Z39_rep2, spike_Z65_rep1, spike_Z65_rep2, carpel,
             carpel_like_structure, stamen, latet_lepto_rep1, latent_lepto_rep2, diplo_dia_rep1, diplo_dia_rep2,
             zygo_pachy_rep1, zygo_pachy_rep2, metaphaseI_rep1, metaphaseI_rep2, grain_Z71_rep1, grain_Z71_rep2,
             grain_Z75_rep1, grain_Z75_rep2, grain_Z85_rep1, grain_Z85_rep2, Wheat_Room1_10DPA, Wheat_Room1_10DPA_Rep,
             Wheat_Room2_10DPA, Wheat_Room2_10DPA_Rep, Wheat_Room1_AL_20DPA, Wheat_Room1_AL_20DPA_Rep,
             Wheat_Room2_AL_20DPA, Wheat_Room2_AL_20DPA_Rep, Wheat_Room1_AL_20DPA_Extra1, Wheat_Room1_AL_20DPA_Extra2,
             Wheat_Room1_SE_20DPA, Wheat_Room1_SE_20DPA_Rep, Wheat_Room2_SE_20DPA, Wheat_Room2_SE_20DPA_Rep,
             Wheat_Room1_TC_20DPA, Wheat_Room1_TC_20DPA_Rep, Wheat_Room2_TC_20DPA, Wheat_Room2_TC_20DPA_Rep,
             Wheat_Room1_REF_20DPA, Wheat_Room1_REF_20DPA_Rep, Wheat_Room2_REF_20DPA, Wheat_Room2_REF_20DPA_Rep,
             Wheat_Room1_SE_30DPA, Wheat_Room1_SE_30DPA_Rep, Wheat_Room2_SE_30DPA, Wheat_Room2_SE_30DPA_Rep,
             Wheat_Room1_AL_SE_30DPA, Wheat_Room1_AL_SE_30DPA_Rep, Wheat_Room2_AL_SE_30DPA, Wheat_Room2_AL_SE_30DPA_Rep,
             wheat_23_1, wheat_23_2, wheat_23_3, wheat_4_1, wheat_4_2, wheat_4_3) = new
            new_root_Z10_rep1 = int(root_Z10_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[0][-1]))
            new_root_Z10_rep2 = int(root_Z10_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[1][-1]))
            new_root_Z13_rep1 = int(root_Z13_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[2][-1]))
            new_root_Z13_rep2 = int(root_Z13_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[3][-1]))
            new_root_Z39_rep1 = int(root_Z39_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[4][-1]))
            new_root_Z39_rep2 = int(root_Z39_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[5][-1]))
            new_stem_Z30_rep1 = int(stem_Z30_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[6][-1]))
            new_stem_Z30_rep2 = int(stem_Z30_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[7][-1]))
            new_stem_Z32_rep1 = int(stem_Z32_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[8][-1]))
            new_stem_Z32_rep2 = int(stem_Z32_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[9][-1]))
            new_stem_Z65_rep1 = int(stem_Z65_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[10][-1]))
            new_stem_Z65_rep2 = int(stem_Z65_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[11][-1]))
            new_leaf_Z10_rep1 = int(leaf_Z10_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[12][-1]))
            new_leaf_Z10_rep2 = int(leaf_Z10_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[13][-1]))
            new_leaf_Z23_rep1 = int(leaf_Z23_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[14][-1]))
            new_leaf_Z23_rep2 = int(leaf_Z23_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[15][-1]))
            new_leaf_Z71_rep1 = int(leaf_Z71_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[16][-1]))
            new_leaf_Z71_rep2 = int(leaf_Z71_rep2) * pow(10.0 , 6) / (int(Length) * int(raw_total[17][-1]))
            new_spike_Z32_rep1 = int(spike_Z32_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[18][-1]))
            new_spike_Z32_rep2 = int(spike_Z32_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[19][-1]))
            new_spike_Z39_rep1 = int(spike_Z39_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[20][-1]))
            new_spike_Z39_rep2 = int(spike_Z39_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[21][-1]))
            new_spike_Z65_rep1 = int(spike_Z65_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[22][-1]))
            new_spike_Z65_rep2 = int(spike_Z65_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[23][-1]))
            new_carpel = int(carpel) * pow(10.0, 9) / (int(Length) * int(raw_total[24][-1]))
            new_carpel_like_structure = int(carpel_like_structure) * pow(10.0, 9) / (int(Length) * int(raw_total[25][-1]))
            new_stamen = int(stamen) * pow(10.0, 9) / (int(Length) * int(raw_total[26][-1]))
            new_latet_lepto_rep1 = int(latet_lepto_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[27][-1]))
            new_latet_lepto_rep2 = int(latent_lepto_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[28][-1]))
            new_diplo_dia_rep1 = int(diplo_dia_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[29][-1]))
            new_diplo_dia_rep2 = int(diplo_dia_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[30][-1]))
            new_zygo_pachy_rep1 = int(zygo_pachy_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[31][-1]))
            new_zygo_pachy_rep2 = int(zygo_pachy_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[32][-1]))
            new_metaphaseI_rep1 = int(metaphaseI_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[33][-1]))
            new_metaphaseI_rep2 = int(metaphaseI_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[34][-1]))
            new_grain_Z71_rep1 = int(grain_Z71_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[35][-1]))
            new_grain_Z71_rep2 = int(grain_Z71_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[36][-1]))
            new_grain_Z75_rep1 = int(grain_Z75_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[37][-1]))
            new_grain_Z75_rep2 = int(grain_Z75_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[38][-1]))
            new_grain_Z85_rep1 = int(grain_Z85_rep1) * pow(10.0, 9) / (int(Length) * int(raw_total[39][-1]))
            new_grain_Z85_rep2 = int(grain_Z85_rep2) * pow(10.0, 9) / (int(Length) * int(raw_total[40][-1]))
            Wheat_Room1_10DPA = int(Wheat_Room1_10DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[41][-1]))
            Wheat_Room1_10DPA_Rep = int(Wheat_Room1_10DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[42][-1]))
            Wheat_Room2_10DPA = int(Wheat_Room2_10DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[43][-1]))
            Wheat_Room2_10DPA_Rep = int(Wheat_Room2_10DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[44][-1]))
            Wheat_Room1_AL_20DPA = int(Wheat_Room1_AL_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[45][-1]))
            Wheat_Room1_AL_20DPA_Rep = int(Wheat_Room1_AL_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[46][-1]))
            Wheat_Room2_AL_20DPA = int(Wheat_Room2_AL_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[47][-1]))
            Wheat_Room2_AL_20DPA_Rep = int(Wheat_Room2_AL_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[48][-1]))
            Wheat_Room1_AL_20DPA_Extra1 = int(Wheat_Room1_AL_20DPA_Extra1) * pow(10.0, 9) / (int(Length) * int(raw_total[49][-1]))
            Wheat_Room1_AL_20DPA_Extra2 = int(Wheat_Room1_AL_20DPA_Extra2) * pow(10.0, 9) / (int(Length) * int(raw_total[50][-1]))
            Wheat_Room1_SE_20DPA = int(Wheat_Room1_SE_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[51][-1]))
            Wheat_Room1_SE_20DPA_Rep = int(Wheat_Room1_SE_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[52][-1]))
            Wheat_Room2_SE_20DPA = int(Wheat_Room2_SE_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[53][-1]))
            Wheat_Room2_SE_20DPA_Rep = int(Wheat_Room2_SE_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[54][-1]))
            Wheat_Room1_TC_20DPA = int(Wheat_Room1_TC_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[55][-1]))
            Wheat_Room1_TC_20DPA_Rep = int(Wheat_Room1_TC_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[56][-1]))
            Wheat_Room2_TC_20DPA = int(Wheat_Room2_TC_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[57][-1]))
            Wheat_Room2_TC_20DPA_Rep = int(Wheat_Room2_TC_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[58][-1]))
            Wheat_Room1_REF_20DPA = int(Wheat_Room1_REF_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[59][-1]))
            Wheat_Room1_REF_20DPA_Rep = int(Wheat_Room1_REF_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[60][-1]))
            Wheat_Room2_REF_20DPA = int(Wheat_Room2_REF_20DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[61][-1]))
            Wheat_Room2_REF_20DPA_Rep = int(Wheat_Room2_REF_20DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[62][-1]))
            Wheat_Room1_SE_30DPA = int( Wheat_Room1_SE_30DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[63][-1]))
            Wheat_Room1_SE_30DPA_Rep = int(Wheat_Room1_SE_30DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[64][-1]))
            Wheat_Room2_SE_30DPA = int(Wheat_Room2_SE_30DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[65][-1]))
            Wheat_Room2_SE_30DPA_Rep = int(Wheat_Room2_SE_30DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[66][-1]))
            Wheat_Room1_AL_SE_30DPA = int(Wheat_Room1_AL_SE_30DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[67][-1]))
            Wheat_Room1_AL_SE_30DPA_Rep = int(Wheat_Room1_AL_SE_30DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[68][-1]))
            Wheat_Room2_AL_SE_30DPA = int(Wheat_Room2_AL_SE_30DPA) * pow(10.0, 9) / (int(Length) * int(raw_total[69][-1]))
            Wheat_Room2_AL_SE_30DPA_Rep = int(Wheat_Room2_AL_SE_30DPA_Rep) * pow(10.0, 9) / (int(Length) * int(raw_total[70][-1]))
            wheat_23_1 = int(wheat_23_1) * pow(10.0, 9) / (int(Length) * int(raw_total[71][-1]))
            wheat_23_2 = int(wheat_23_2) * pow(10.0, 9) / (int(Length) * int(raw_total[72][-1]))
            wheat_23_3 = int(wheat_23_3) * pow(10.0, 9) / (int(Length) * int(raw_total[73][-1]))
            wheat_4_1 = int(wheat_4_1) * pow(10.0, 9) / (int(Length) * int(raw_total[74][-1]))
            wheat_4_2 = int(wheat_4_2) * pow(10.0, 9) / (int(Length) * int(raw_total[75][-1]))
            wheat_4_3 = int(wheat_4_3) * pow(10.0, 9) / (int(Length) * int(raw_total[76][-1]))

            root_Z10_mean = np.mean(np.array([new_root_Z10_rep1, new_root_Z10_rep2]))
            root_Z10_std = np.std(np.array([new_root_Z10_rep1, new_root_Z10_rep2]))
            root_Z13_mean = np.mean(np.array([new_root_Z13_rep1, new_root_Z13_rep2]))
            root_Z13_std = np.std(np.array([new_root_Z13_rep1, new_root_Z13_rep2]))
            root_Z39_mean = np.mean(np.array([new_root_Z39_rep1, new_root_Z39_rep2]))
            root_Z39_std = np.std(np.array([new_root_Z39_rep1, new_root_Z39_rep2]))
            stem_Z30_mean = np.mean(np.array([new_stem_Z30_rep1, new_stem_Z30_rep2]))
            stem_Z30_std = np.std(np.array([new_stem_Z30_rep1, new_stem_Z30_rep2]))
            stem_Z32_mean = np.mean(np.array([new_stem_Z32_rep1, new_stem_Z32_rep2]))
            stem_Z32_std = np.std(np.array([new_stem_Z32_rep1, new_stem_Z32_rep2]))
            stem_Z65_mean = np.mean(np.array([new_stem_Z65_rep1, new_stem_Z65_rep2]))
            stem_Z65_std = np.std(np.array([new_stem_Z65_rep1, new_stem_Z65_rep2]))
            leaf_Z10_mean = np.mean(np.array([new_leaf_Z10_rep1, new_leaf_Z10_rep2]))
            leaf_Z10_std = np.std(np.array([new_leaf_Z10_rep1, new_leaf_Z10_rep2]))
            leaf_Z23_mean = np.mean(np.array([new_leaf_Z23_rep1, new_leaf_Z23_rep2]))
            leaf_Z23_std = np.std(np.array([new_leaf_Z23_rep1, new_leaf_Z23_rep2]))
            leaf_Z71_mean = np.mean(np.array([new_leaf_Z71_rep1, new_leaf_Z71_rep2]))
            leaf_Z71_std = np.std(np.array([new_leaf_Z71_rep1, new_leaf_Z71_rep2]))
            spike_Z32_mean = np.mean(np.array([new_spike_Z32_rep1, new_spike_Z32_rep2]))
            spike_Z32_std = np.std(np.array([new_spike_Z32_rep1, new_spike_Z32_rep2]))
            spike_Z39_mean = np.mean(np.array([new_spike_Z39_rep1, new_spike_Z39_rep2]))
            spike_Z39_std = np.std(np.array([new_spike_Z39_rep1, new_spike_Z39_rep2]))
            spike_Z65_mean = np.mean(np.array([new_spike_Z65_rep1, new_spike_Z65_rep2]))
            spike_Z65_std = np.std(np.array([new_spike_Z65_rep1, new_spike_Z65_rep2]))
            latet_lepto_mean = np.mean(np.array([new_latet_lepto_rep1, new_latet_lepto_rep2]))
            latet_lepto_std = np.std(np.array([new_latet_lepto_rep1, new_latet_lepto_rep2]))
            diplo_dia_mean = np.mean(np.array([new_diplo_dia_rep1, new_diplo_dia_rep2]))
            diplo_dia_std = np.std(np.array([new_diplo_dia_rep1, new_diplo_dia_rep2]))
            zygo_pachy_mean = np.mean(np.array([new_zygo_pachy_rep1, new_zygo_pachy_rep2]))
            zygo_pachy_std = np.std(np.array([new_zygo_pachy_rep1, new_zygo_pachy_rep2]))
            metaphaseI_mean = np.mean(np.array([new_metaphaseI_rep1, new_metaphaseI_rep2]))
            metaphaseI_std = np.std(np.array([new_metaphaseI_rep1, new_metaphaseI_rep2]))
            grain_Z71_mean = np.mean(np.array([new_grain_Z71_rep1, new_grain_Z71_rep2]))
            grain_Z71_std = np.std(np.array([new_grain_Z71_rep1, new_grain_Z71_rep2]))
            grain_Z75_mean = np.mean(np.array([new_grain_Z75_rep1, new_grain_Z75_rep2]))
            grain_Z75_std = np.std(np.array([new_grain_Z75_rep1, new_grain_Z75_rep2]))
            grain_Z85_mean = np.mean(np.array([new_grain_Z85_rep1, new_grain_Z85_rep2]))
            grain_Z85_std = np.std(np.array([new_grain_Z85_rep1, new_grain_Z85_rep2]))

            Wheat_10DPA_mean = np.mean(np.array([Wheat_Room1_10DPA, Wheat_Room1_10DPA_Rep,Wheat_Room2_10DPA, Wheat_Room2_10DPA_Rep]))
            Wheat_10DPA_std = np.std(np.array([Wheat_Room1_10DPA, Wheat_Room1_10DPA_Rep,Wheat_Room2_10DPA, Wheat_Room2_10DPA_Rep]))
            Wheat_AL_20DPA_mean = np.mean(np.array([Wheat_Room1_AL_20DPA, Wheat_Room1_AL_20DPA_Rep,Wheat_Room2_AL_20DPA, Wheat_Room2_AL_20DPA_Rep, Wheat_Room1_AL_20DPA_Extra1, Wheat_Room1_AL_20DPA_Extra2]))
            Wheat_AL_20DPA_std = np.std(np.array([Wheat_Room1_AL_20DPA, Wheat_Room1_AL_20DPA_Rep,Wheat_Room2_AL_20DPA, Wheat_Room2_AL_20DPA_Rep, Wheat_Room1_AL_20DPA_Extra1, Wheat_Room1_AL_20DPA_Extra2]))
            Wheat_SE_20DPA_mean = np.mean(np.array([Wheat_Room1_SE_20DPA, Wheat_Room1_SE_20DPA_Rep, Wheat_Room2_SE_20DPA, Wheat_Room2_SE_20DPA_Rep]))
            Wheat_SE_20DPA_std = np.std(np.array([Wheat_Room1_SE_20DPA, Wheat_Room1_SE_20DPA_Rep, Wheat_Room2_SE_20DPA, Wheat_Room2_SE_20DPA_Rep]))
            Wheat_TC_20DPA_mean = np.mean(np.array([Wheat_Room1_TC_20DPA, Wheat_Room1_TC_20DPA_Rep, Wheat_Room2_TC_20DPA, Wheat_Room2_TC_20DPA_Rep]))
            Wheat_TC_20DPA_std = np.std(np.array([Wheat_Room1_TC_20DPA, Wheat_Room1_TC_20DPA_Rep, Wheat_Room2_TC_20DPA, Wheat_Room2_TC_20DPA_Rep]))
            Wheat_REF_20DPA_mean = np.mean(np.array([Wheat_Room1_REF_20DPA, Wheat_Room1_REF_20DPA_Rep, Wheat_Room2_REF_20DPA, Wheat_Room2_REF_20DPA_Rep]))
            Wheat_REF_20DPA_std = np.std(np.array([Wheat_Room1_REF_20DPA, Wheat_Room1_REF_20DPA_Rep, Wheat_Room2_REF_20DPA, Wheat_Room2_REF_20DPA_Rep]))
            Wheat_SE_30DPA_mean = np.mean(np.array([Wheat_Room1_SE_30DPA, Wheat_Room1_SE_30DPA_Rep, Wheat_Room2_SE_30DPA, Wheat_Room2_SE_30DPA_Rep]))
            Wheat_SE_30DPA_std = np.std(np.array([Wheat_Room1_SE_30DPA, Wheat_Room1_SE_30DPA_Rep, Wheat_Room2_SE_30DPA, Wheat_Room2_SE_30DPA_Rep]))
            Wheat_AL_SE_30DPA_mean = np.mean(np.array([Wheat_Room1_AL_SE_30DPA, Wheat_Room1_AL_SE_30DPA_Rep, Wheat_Room2_AL_SE_30DPA, Wheat_Room2_AL_SE_30DPA_Rep]))
            Wheat_AL_SE_30DPA_std = np.std(np.array([Wheat_Room1_AL_SE_30DPA, Wheat_Room1_AL_SE_30DPA_Rep, Wheat_Room2_AL_SE_30DPA, Wheat_Room2_AL_SE_30DPA_Rep]))
            wheat_23_mean = np.mean(np.array([wheat_23_1, wheat_23_2, wheat_23_3]))
            wheat_23_std = np.std(np.array([wheat_23_1, wheat_23_2, wheat_23_3]))
            wheat_4_mean = np.mean(np.array([wheat_4_1, wheat_4_2, wheat_4_3]))
            wheat_4_std = np.std(np.array([wheat_4_1, wheat_4_2, wheat_4_3]))

            print "%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" \
                  "\t%s\t\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s\t%s" % \
                  (Geneid, Chr, Start, End, Strand, Length, root_Z10_mean, root_Z13_mean,root_Z39_mean, stem_Z30_mean,
                   stem_Z32_mean, stem_Z65_mean, leaf_Z10_mean, leaf_Z23_mean, leaf_Z71_mean, spike_Z32_mean,
                   spike_Z39_mean, spike_Z65_mean, new_carpel, new_carpel_like_structure, new_stamen, latet_lepto_mean,
                   diplo_dia_mean, zygo_pachy_mean, metaphaseI_mean, grain_Z71_mean, grain_Z75_mean, grain_Z85_mean,
                   Wheat_10DPA_mean, Wheat_AL_20DPA_mean, Wheat_SE_20DPA_mean, Wheat_TC_20DPA_mean, Wheat_REF_20DPA_mean,
                   Wheat_SE_30DPA_mean, Wheat_AL_SE_30DPA_mean, wheat_23_mean, wheat_4_mean,
                   root_Z10_std, root_Z13_std, root_Z39_std, stem_Z30_std, stem_Z32_std, stem_Z65_std, leaf_Z10_std,
                   leaf_Z23_std, leaf_Z71_std, spike_Z32_std, spike_Z39_std, spike_Z65_std, 'null', 'null', 'null',
                   latet_lepto_std, diplo_dia_std, zygo_pachy_std, metaphaseI_std, grain_Z71_std, grain_Z75_std,
                   grain_Z85_std, Wheat_10DPA_std, Wheat_AL_20DPA_std, Wheat_SE_20DPA_std, Wheat_TC_20DPA_std,
                   Wheat_REF_20DPA_std, Wheat_SE_30DPA_std, Wheat_AL_SE_30DPA_std, wheat_23_std, wheat_4_std)

这里只能使用FPKM而不是TPM,因为我们没有所有的转录本信息，故不能统计出TPM。可变剪切现象广泛存在，而二代测序不能有效区分可变剪切的转录本的表达量。在一定意义说只能衡量转录水平的表达量，而不能衡量转录后水平的表达量。

你可能感兴趣的:(数据,RNA-seq,软件,基因表达,生物信息,生物)

芦花鞋一四许叶晗
又是在一个寒冷的夏日里，青铜和葵花决定今天一起去卖芦花鞋，奶奶亲手给他们做了一碗热乎乎的粥对他们说:“就靠你们两挣生活费了这碗粥赶紧趁热喝了吧！”于是青铜和葵花喝完了奶奶给她们做的粥，就准备去镇上卖卢花鞋，这回青铜和葵花穿着新的芦花鞋来到了镇上。青铜这回看到了很多人都在卖，用手势表达对葵花说:“这回有好多人在抢我们生意呢！我们必须得吆喝起来。”葵花点了点头。可是谁知他们也大声的叫，卖芦花喽！卖芦花
机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
【iOS】MVC设计模式 Magnetic_h ios mvc 设计模式 objective-c 学习 ui
MVC前言如何设计一个程序的结构，这是一门专门的学问，叫做"架构模式"（architecturalpattern），属于编程的方法论。MVC模式就是架构模式的一种。它是Apple官方推荐的App开发架构，也是一般开发者最先遇到、最经典的架构。MVC各层controller层Controller/ViewController/VC（控制器）负责协调Model和View，处理大部分逻辑它将数据从Mod
C语言如何定义宏函数？小九格物 c语言
在C语言中，宏函数是通过预处理器定义的，它在编译之前替换代码中的宏调用。宏函数可以模拟函数的行为，但它们不是真正的函数，因为它们在编译时不会进行类型检查，也不会分配存储空间。宏函数的定义通常使用#define指令，后面跟着宏的名称和参数列表，以及宏展开后的代码。宏函数的定义方式：1.基本宏函数：这是最简单的宏函数形式，它直接定义一个表达式。#defineSQUARE(x)((x)*(x))2.带参
微服务下功能权限与数据权限的设计与实现 nbsaas-boot 微服务 java 架构
在微服务架构下，系统的功能权限和数据权限控制显得尤为重要。随着系统规模的扩大和微服务数量的增加，如何保证不同用户和服务之间的访问权限准确、细粒度地控制，成为设计安全策略的关键。本文将讨论如何在微服务体系中设计和实现功能权限与数据权限控制。1.功能权限与数据权限的定义功能权限：指用户或系统角色对特定功能的访问权限。通常是某个用户角色能否执行某个操作，比如查看订单、创建订单、修改用户资料等。数据权限：
c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
《投行人生》读书笔记小蘑菇的树洞
《投行人生》----作者詹姆斯-A-朗德摩根斯坦利副主席40年的职业洞见-很短小精悍的篇幅，比较适合初入职场的新人。第一部分成功的职业生涯需要规划1.情商归为适应能力分享与协作同理心适应能力，更多的是自我意识，你有能力识别自己的情并分辨这些情绪如何影响你的思想和行为。2.对于初入职场的人的建议，细节，截止日期和数据很重要截止日期，一种有效的方法是请老板为你所有的任务进行优先级排序。和老板喝咖啡的好
Long类型前后端数据不一致 igotyback 前端
响应给前端的数据浏览器控制台中response中看到的Long类型的数据是正常的到前端数据不一致前后端数据类型不匹配是一个常见问题，尤其是当后端使用Java的Long类型（64位）与前端JavaScript的Number类型（最大安全整数为2^53-1，即16位）进行数据交互时，很容易出现精度丢失的问题。这是因为JavaScript中的Number类型无法安全地表示超过16位的整数。为了解决这个问
Python数据分析与可视化实战指南 William数据分析 python python 数据
在数据驱动的时代，Python因其简洁的语法、强大的库生态系统以及活跃的社区，成为了数据分析与可视化的首选语言。本文将通过一个详细的案例，带领大家学习如何使用Python进行数据分析，并通过可视化来直观呈现分析结果。一、环境准备1.1安装必要库在开始数据分析和可视化之前，我们需要安装一些常用的库。主要包括pandas、numpy、matplotlib和seaborn等。这些库分别用于数据处理、数学
WPF中的ComboBox控件几种数据绑定的方式互联网打工人no1 wpf c#
一、用字典给ItemsSource赋值（此绑定用的地方很多，建议熟练掌握）在XMAL中：在CS文件中privatevoidBindData(){DictionarydicItem=newDictionary();dicItem.add(1,"北京");dicItem.add(2,"上海");dicItem.add(3,"广州");cmb_list.ItemsSource=dicItem;cmb_l
Pyecharts数据可视化大屏：打造沉浸式数据分析体验我的运维人生信息可视化数据分析数据挖掘运维开发技术共享
Pyecharts数据可视化大屏：打造沉浸式数据分析体验在当今这个数据驱动的时代，如何将海量数据以直观、生动的方式展现出来，成为了数据分析师和企业决策者关注的焦点。Pyecharts，作为一款基于Python的开源数据可视化库，凭借其丰富的图表类型、灵活的配置选项以及高度的定制化能力，成为了构建数据可视化大屏的理想选择。本文将深入探讨如何利用Pyecharts打造数据可视化大屏，并通过实际代码案例
Python教程：一文了解使用Python处理XPath 旦莫 Python进阶 python 开发语言
目录1.环境准备1.1安装lxml1.2验证安装2.XPath基础2.1什么是XPath？2.2XPath语法2.3示例XML文档3.使用lxml解析XML3.1解析XML文档3.2查看解析结果4.XPath查询4.1基本路径查询4.2使用属性查询4.3查询多个节点5.XPath的高级用法5.1使用逻辑运算符5.2使用函数6.实战案例6.1从网页抓取数据6.1.1安装Requests库6.1.2代
Google earth studio 简介陟彼高冈yu 旅游
GoogleEarthStudio是一个基于Web的动画工具，专为创作使用GoogleEarth数据的动画和视频而设计。它利用了GoogleEarth强大的三维地图和卫星影像数据库，使用户能够轻松地创建逼真的地球动画、航拍视频和动态地图可视化。网址为https://www.google.com/earth/studio/。GoogleEarthStudio是一个基于Web的动画工具，专为创作使用G
LLM 词汇表落难Coder LLMs NLP 大语言模型大模型 llama 人工智能
Contextwindow“上下文窗口”是指语言模型在生成新文本时能够回溯和参考的文本量。这不同于语言模型训练时所使用的大量数据集，而是代表了模型的“工作记忆”。较大的上下文窗口可以让模型理解和响应更复杂和更长的提示，而较小的上下文窗口可能会限制模型处理较长提示或在长时间对话中保持连贯性的能力。Fine-tuning微调是使用额外的数据进一步训练预训练语言模型的过程。这使得模型开始表示和模仿微调数
关于提高复杂业务逻辑代码可读性的思考编程经验分享开发经验 java 数据库开发语言
目录前言需求场景常规写法拆分方法领域对象总结前言实际工作中大部分时间都是在写业务逻辑，一般都是三层架构，表示层（Controller）接收客户端请求，并对入参做检验，业务逻辑层（Service）负责处理业务逻辑，一般开发都是在这一层中写具体的业务逻辑。数据访问层（Dao）是直接和数据库交互的，用于查数据给业务逻辑层，或者是将业务逻辑层处理后的数据写入数据库。简单的增删改查接口不用多说，基本上写好一
蘩漪：新女性？利己主义者赮_红雨
蘩漪是曹禺《雷雨》笔下的女性形象。对于她的喜爱，曹禺在之前的访谈中，就已经表达得很清楚了，蘩漪是他所倾心的女子的“代替者”。在这个女性身上有着曹禺最精心的描写，但同时她的身上又存在着一些时代的问题。图片发自App首先，繁漪是追求自由和幸福的新女性形象。她是精神悲剧的核心人物，她对周朴园的反抗，具有典型意义。她是位资产阶级家庭出身的小姐，受过五四新思潮的影响，她任性、傲慢，追求人格独立、个性自由和爱
SQL Server_查询某一数据库中的所有表的内容 qq_42772833 SQL Server 数据库 sqlserver
1.查看所有表的表名要列出CrabFarmDB数据库中的所有表（名），可以使用以下SQL语句：USECrabFarmDB;--切换到目标数据库GOSELECTTABLE_NAMEFROMINFORMATION_SCHEMA.TABLESWHERETABLE_TYPE='BASETABLE';对这段SQL脚本的解释：SELECTTABLE_NAME：这个语句的作用是从查询结果中选择TABLE_NAM
使用LLaVa和Ollama实现多模态RAG示例 llzwxh888 python 人工智能开发语言
本文将详细介绍如何使用LLaVa和Ollama实现多模态RAG（检索增强生成），通过提取图像中的结构化数据、生成图像字幕等功能来展示这一技术的强大之处。安装环境首先，您需要安装以下依赖包：!pipinstallllama-index-multi-modal-llms-ollama!pipinstallllama-index-readers-file!pipinstallunstructured!p
使用Apify加载Twitter消息以进行微调的完整指南 nseejrukjhad twitter easyui 前端 python
#使用Apify加载Twitter消息以进行微调的完整指南##引言在自然语言处理领域，微调模型以适应特定任务是提升模型性能的常见方法。本文将介绍如何使用Apify从Twitter导出聊天信息，以便进一步进行微调。##主要内容###使用Apify导出推文首先，我们需要从Twitter导出推文。Apify可以帮助我们做到这一点。通过Apify的强大功能，我们可以批量抓取和导出数据，适用于各类应用场景。
利用Requests Toolkit轻松完成HTTP请求 nseejrukjhad http 网络协议网络 python
RequestsToolkit的力量：轻松构建HTTP请求Agent在现代软件开发中，API请求是与外部服务交互的核心。RequestsToolkit提供了一种便捷的方式，帮助开发者构建自动化的HTTP请求Agent。本文旨在详细介绍RequestsToolkit的设置、使用和潜在挑战。引言RequestsToolkit是一个强大的工具包，可用于构建执行HTTP请求的智能代理。这对于想要自动化与外
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
利用LangChain的StackExchange组件实现智能问答系统 nseejrukjhad langchain microsoft 数据库 python
利用LangChain的StackExchange组件实现智能问答系统引言在当今的软件开发世界中，StackOverflow已经成为程序员解决问题的首选平台之一。而LangChain作为一个强大的AI应用开发框架，提供了StackExchange组件，使我们能够轻松地将StackOverflow的海量知识库集成到我们的应用中。本文将详细介绍如何使用LangChain的StackExchange组件
数组去重好奇的猫猫猫
整理自js中基础数据结构数组去重问题思考？如何去除数组中重复的项例如数组：[1,3,4,3,5]我们在做去重的时候，一开始想到的肯定是，逐个比较，外面一层循环，内层后一个与前一个一比较，如果是久不将当前这一项放进新的数组，挨个比较完之后返回一个新的去过重复的数组不好的实践方式上述方法效率极低，代码量还多，思考？有没有更好的方法这时候不禁一想当然有了！！！hashtable啊，通过对象的hash办法
Day1笔记-Python简介&标识符和关键字&输入输出 ~在杰难逃~ Python python 开发语言大数据数据分析数据挖掘
大家好，从今天开始呢，杰哥开展一个新的专栏，当然，数据分析部分也会不定时更新的，这个新的专栏主要是讲解一些Python的基础语法和知识，帮助0基础的小伙伴入门和学习Python，感兴趣的小伙伴可以开始认真学习啦！一、Python简介【了解】1.计算机工作原理编程语言就是用来定义计算机程序的形式语言。我们通过编程语言来编写程序代码，再通过语言处理程序执行向计算机发送指令，让计算机完成对应的工作，编程
【目标检测数据集】卡车数据集1073张VOC+YOLO格式熬夜写代码的平头哥∰ 目标检测 YOLO 人工智能
数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：1073标注数量(xml文件个数)：1073标注数量(txt文件个数)：1073标注类别数：1标注类别名称:["truck"]每个类别标注的框数：truck框数=1120总框数：1120使用标注工具：labelImg标注
2022现在哪个打车软件比较好用又便宜实惠的打车软件合集高省APP珊珊
这是一个信息高速传播的社会。信息可以通过手机，微信，自媒体，抖音等方式进行传播。但同时这也是一个交通四通发达的社会。高省APP，是2022年推出的平台，0投资，0风险、高省APP佣金更高，模式更好，终端用户不流失。【高省】是一个自用省钱佣金高，分享推广赚钱多的平台，百度有几百万篇报道，也期待你的加入。珊珊导师，高省邀请码777777，注册送2皇冠会员，送万元推广大礼包，教你如何1年做到百万团队。高
MongoDB Oplog 窗口喝醉酒的小白 MongoDB 运维
在MongoDB中，oplog（操作日志）是一个特殊的日志系统，用于记录对数据库的所有写操作。oplog允许副本集成员（通常是从节点）应用主节点上已经执行的操作，从而保持数据的一致性。它是MongoDB副本集实现数据复制的基础。MongoDBOplog窗口oplog窗口是指在MongoDB副本集中，从节点可以用来同步数据的时间范围。这个窗口通常由以下因素决定：Oplog大小：oplog的大小是有限
Faiss Tips：高效向量搜索与聚类的利器焦习娜Samantha
FaissTips：高效向量搜索与聚类的利器faiss_tipsSomeusefultipsforfaiss项目地址:https://gitcode.com/gh_mirrors/fa/faiss_tips项目介绍Faiss是由FacebookAIResearch开发的一个用于高效相似性搜索和密集向量聚类的库。它支持多种硬件平台，包括CPU和GPU，能够在海量数据集上实现快速的近似最近邻搜索（AN
ARM中断处理过程落汤老狗嵌入式linux
一、前言本文主要以ARM体系结构下的中断处理为例，讲述整个中断处理过程中的硬件行为和软件动作。具体整个处理过程分成三个步骤来描述：1、第二章描述了中断处理的准备过程2、第三章描述了当发生中的时候，ARM硬件的行为3、第四章描述了ARM的中断进入过程4、第五章描述了ARM的中断退出过程本文涉及的代码来自3.14内核。另外，本文注意描述ARM指令集的内容，有些sourcecode为了简短一些，删除了T
pyecharts——绘制柱形图折线图 2224070247 信息可视化 python java 数据可视化
一、pyecharts概述自2013年6月百度EFE(ExcellentFrontEnd）数据可视化团队研发的ECharts1.0发布到GitHub网站以来，ECharts一直备受业界权威的关注并获得广泛好评，成为目前成熟且流行的数据可视化图表工具，被应用到诸多数据可视化的开发领域。Python作为数据分析领域最受欢迎的语言，也加入ECharts的使用行列，并研发出方便Python开发者使用的数据
关于旗正规则引擎中的MD5加密问题何必如此 jsp MD5 规则加密
一般情况下，为了防止个人隐私的泄露，我们都会对用户登录密码进行加密，使数据库相应字段保存的是加密后的字符串，而非原始密码。在旗正规则引擎中，通过外部调用，可以实现MD5的加密，具体步骤如下： 1.在对象库中选择外部调用，选择“com.flagleader.util.MD5”，在子选项中选择“com.flagleader.util.MD5.getMD5ofStr({arg1})”； 2.在规
【Spark101】Scala Promise/Future在Spark中的应用 bit1129 Promise
Promise和Future是Scala用于异步调用并实现结果汇集的并发原语，Scala的Future同JUC里面的Future接口含义相同，Promise理解起来就有些绕。等有时间了再仔细的研究下Promise和Future的语义以及应用场景，具体参见Scala在线文档：http://docs.scala-lang.org/sips/completed/futures-promises.html
spark sql 访问hive数据的配置详解 daizj spark sql hive thriftserver
spark sql 能够通过thriftserver 访问hive数据，默认spark编译的版本是不支持访问hive，因为hive依赖比较多，因此打的包中不包含hive和thriftserver,因此需要自己下载源码进行编译，将hive，thriftserver打包进去才能够访问，详细配置步骤如下： 1、下载源码 2、下载Maven,并配置此配置简单，就略过
HTTP 协议通信周凡杨 java httpclient http 通信
一：简介 HTTPCLIENT，通过JAVA基于HTTP协议进行点与点间的通信！二：代码举例测试类： import java
java unix时间戳转换 g21121 java
把java时间戳转换成unix时间戳： Timestamp appointTime=Timestamp.valueOf(new SimpleDateFormat("yyyy-MM-dd HH:mm:ss").format(new Date())) SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd hh:m
web报表工具FineReport常用函数的用法总结（报表函数）老A不折腾 web报表 finereport 总结
说明：本次总结中，凡是以tableName或viewName作为参数因子的。函数在调用的时候均按照先从私有数据源中查找，然后再从公有数据源中查找的顺序。 CLASS CLASS(object):返回object对象的所属的类。 CNMONEY CNMONEY(number,unit)返回人民币大写。 number:需要转换的数值型的数。 unit:单位，
java jni调用c++ 代码报错墙头上一根草 java C++jni
# # A fatal error has been detected by the Java Runtime Environment: # # EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x00000000777c3290, pid=5632, tid=6656 # # JRE version: Java(TM) SE Ru
Spring中事件处理de小技巧 aijuans spring Spring 教程 Spring 实例 Spring 入门 Spring3
Spring 中提供一些Aware相关de接口，BeanFactoryAware、 ApplicationContextAware、ResourceLoaderAware、ServletContextAware等等，其中最常用到de匙ApplicationContextAware.实现ApplicationContextAwaredeBean，在Bean被初始后，将会被注入 Applicati
linux shell ls脚本样例 annan211 linux linux ls源码 linux 源码
#! /bin/sh - #查找输入文件的路径 #在查找路径下寻找一个或多个原始文件或文件模式 # 查找路径由特定的环境变量所定义 #标准输出所产生的结果通常是查找路径下找到的每个文件的第一个实体的完整路径 # 或是filename :not found 的标准错误输出。 #如果文件没有找到则退出码为0 #否则即为找不到的文件个数 #语法 pathfind [--
List,Set,Map遍历方式 (收集的资源,值得看一下) 百合不是茶 list set Map遍历方式
List特点：元素有放入顺序，元素可重复 Map特点：元素按键值对存储，无放入顺序 Set特点：元素无放入顺序，元素不可重复（注意：元素虽然无放入顺序，但是元素在set中的位置是有该元素的HashCode决定的，其位置其实是固定的） List接口有三个实现类：LinkedList，ArrayList，Vector LinkedList：底层基于链表实现，链表内存是散乱的，每一个元素存储本身
解决SimpleDateFormat的线程不安全问题的方法 bijian1013 java thread 线程安全
在Java项目中，我们通常会自己写一个DateUtil类，处理日期和字符串的转换，如下所示： public class DateUtil01 { private SimpleDateFormat dateformat = new SimpleDateFormat("yyyy-MM-dd HH:mm:ss"); public void format(Date d
http请求测试实例（采用fastjson解析） bijian1013 http 测试
在实际开发中，我们经常会去做http请求的开发，下面则是如何请求的单元测试小实例，仅供参考。 import java.util.HashMap; import java.util.Map; import org.apache.commons.httpclient.HttpClient; import
【RPC框架Hessian三】Hessian 异常处理 bit1129 hessian
RPC异常处理概述 RPC异常处理指是，当客户端调用远端的服务，如果服务执行过程中发生异常，这个异常能否序列到客户端？如果服务在执行过程中可能发生异常，那么在服务接口的声明中，就该声明该接口可能抛出的异常。在Hessian中，服务器端发生异常，可以将异常信息从服务器端序列化到客户端，因为Exception本身是实现了Serializable的
【日志分析】日志分析工具 bit1129 日志分析
1. 网站日志实时分析工具 GoAccess http://www.vpsee.com/2014/02/a-real-time-web-log-analyzer-goaccess/ 2. 通过日志监控并收集 Java 应用程序性能数据(Perf4J) http://www.ibm.com/developerworks/cn/java/j-lo-logforperf/ 3.log.io 和
nginx优化加强战斗力及遇到的坑解决 ronin47 nginx 优化
　　　先说遇到个坑，第一个是负载问题，这个问题与架构有关，由于我设计架构多了两层，结果导致会话负载只转向一个。解决这样的问题思路有两个：一是改变负载策略，二是更改架构设计。　　　由于采用动静分离部署，而nginx又设计了静态，结果客户端去读nginx静态，访问量上来，页面加载很慢。解决：二者留其一。最好是保留apache服务器。　　　来以下优化：　　　
java-50-输入两棵二叉树A和B，判断树B是不是A的子结构 bylijinnan java
思路来自： http://zhedahht.blog.163.com/blog/static/25411174201011445550396/ import ljn.help.*; public class HasSubtree { /**Q50. * 输入两棵二叉树A和B，判断树B是不是A的子结构。例如，下图中的两棵树A和B，由于A中有一部分子树的结构和B是一
mongoDB 备份与恢复开窍的石头 mongDB备份与恢复
Mongodb导出与导入 1: 导入/导出可以操作的是本地的mongodb服务器,也可以是远程的. 所以,都有如下通用选项: -h host 主机 --port port 端口 -u username 用户名 -p passwd 密码 2: mongoexport 导出json格式的文件
[网络与通讯]椭圆轨道计算的一些问题 comsci 网络
如果按照中国古代农历的历法，现在应该是某个季节的开始，但是由于农历历法是3000年前的天文观测数据，如果按照现在的天文学记录来进行修正的话，这个季节已经过去一段时间了。。。。。也就是说，还要再等3000年。才有机会了，太阳系的行星的椭圆轨道受到外来天体的干扰，轨道次序发生了变
软件专利如何申请 cuiyadll 软件专利申请
软件技术可以申请软件著作权以保护软件源代码，也可以申请发明专利以保护软件流程中的步骤执行方式。专利保护的是软件解决问题的思想，而软件著作权保护的是软件代码（即软件思想的表达形式）。例如，离线传送文件，那发明专利保护是如何实现离线传送文件。基于相同的软件思想，但实现离线传送的程序代码有千千万万种，每种代码都可以享有各自的软件著作权。申请一个软件发明专利的代理费大概需要5000-8000申请发明专利可
Android学习笔记 darrenzhu android
1.启动一个AVD 2.命令行运行adb shell可连接到AVD,这也就是命令行客户端 3.如何启动一个程序 am start -n package name/.activityName am start -n com.example.helloworld/.MainActivity 启动Android设置工具的命令如下所示： # am start -
apache虚拟机配置，本地多域名访问本地网站 dcj3sjt126com apache
现在假定你有两个目录，一个存在于 /htdocs/a，另一个存在于 /htdocs/b 。现在你想要在本地测试的时候访问 www.freeman.com 对应的目录是 /xampp/htdocs/freeman ,访问 www.duchengjiu.com 对应的目录是 /htdocs/duchengjiu。 1、首先修改C盘WINDOWS\system32\drivers\etc目录下的
yii2 restful web服务[速率限制] dcj3sjt126com PHP yii2
速率限制为防止滥用，你应该考虑增加速率限制到您的API。例如，您可以限制每个用户的API的使用是在10分钟内最多100次的API调用。如果一个用户同一个时间段内太多的请求被接收，将返回响应状态代码 429 (这意味着过多的请求)。要启用速率限制, [[yii\web\User::identityClass|user identity class]] 应该实现 [[yii\filter
Hadoop2.5.2安装——单机模式 eksliang hadoop hadoop单机部署
转载请出自出处：http://eksliang.iteye.com/blog/2185414 一、概述 Hadoop有三种模式单机模式、伪分布模式和完全分布模式，这里先简单介绍单机模式，默认情况下，Hadoop被配置成一个非分布式模式，独立运行JAVA进程，适合开始做调试工作。二、下载地址 Hadoop 网址http:
LoadMoreListView+SwipeRefreshLayout（分页下拉）基本结构 gundumw100 android
一切为了快速迭代 import java.util.ArrayList; import org.json.JSONObject; import android.animation.ObjectAnimator; import android.os.Bundle; import android.support.v4.widget.SwipeRefreshLayo
三道简单的前端HTML/CSS题目 ini html Web 前端 css 题目
使用CSS为多个网页进行相同风格的布局和外观设置时，为了方便对这些网页进行修改，最好使用（）。http://hovertree.com/shortanswer/bjae/7bd72acca3206862.htm 在HTML中加入<table style=”color:red; font-size:10pt”>，此为（）。http://hovertree.com/s
overrided方法编译错误 kane_xie override
问题描述：在实现类中的某一或某几个Override方法发生编译错误如下： Name clash: The method put(String) of type XXXServiceImpl has the same erasure as put(String) of type XXXService but does not override it 当去掉@Over
Java中使用代理IP获取网址内容（防IP被封，做数据爬虫） mcj8089 免费代理IP 代理IP 数据爬虫 JAVA设置代理IP 爬虫封IP
推荐两个代理IP网站： 1. 全网代理IP：http://proxy.goubanjia.com/ 2. 敲代码免费IP：http://ip.qiaodm.com/ Java语言有两种方式使用代理IP访问网址并获取内容，方式一，设置System系统属性 // 设置代理IP System.getProper
Nodejs Express 报错之 listen EADDRINUSE qiaolevip 每天进步一点点学习永无止境 nodejs 纵观千象
当你启动 nodejs服务报错： >node app Express server listening on port 80 events.js:85 throw er; // Unhandled 'error' event ^ Error: listen EADDRINUSE at exports._errnoException (
C++中三种new的用法 _荆棘鸟_ C++new
转载自：http://news.ccidnet.com/art/32855/20100713/2114025_1.html 作者: mt 其一是new operator，也叫new表达式；其二是operator new，也叫new操作符。这两个英文名称起的也太绝了，很容易搞混，那就记中文名称吧。new表达式比较常见，也最常用，例如： string* ps = new string("
Ruby深入研究笔记1 wudixiaotie Ruby
module是可以定义private方法的 module MTest def aaa puts "aaa" private_method end private def private_method puts "this is private_method" end end