lijiatong1005

测度论（Measurement theory）

转自： http://blog.csdn.net/aris_zzy/article/details/2071923

测度论（Measurement theory）

分类：优化算法 2008-01-29 17:26 1562人阅读评论(0) 收藏举报

   transformation statistics numbers variables attributes random 
 

目录(?)[+]

测度论，什么是测度？在现实中有各种各样的东西需要度量，度量的方法有会有很多种，一种度量的方法是否正确，需要证明该方法符合一些列的理论。

必须说明的是，度量一个事物的方法不是唯一，有多种度量的方法，各种度量方法都有自己的特点，在什么情况下时候什么样的度量方法最合理是值得研究的问题。

一下是ＳＡＳ提供的关于Measurement theory的基本知识

Measurement theory: Frequently asked questions

Version 3, Sep 14, 1997

Warren S. Sarle
SAS Institute Inc.
SAS Campus Drive
Cary, NC 27513, USA
[email protected]

URL: ftp://ftp.sas.com/pub/neural/measurement.html

Version 1 originally published in the Disseminations of the International Statistical Applications Institute, volume 1, edition 4, 1995, Wichita: ACG Press, pp. 61-66.

Permission is granted to reproduce this article for non-profit educational purposes only, retaining the author's name and copyright notice.

What is measurement theory?
What is measurement?
Why should I care about measurement theory?
What are permissible transformations?
What are levels of measurement?
What about binary (0/1) variables?
Is measurement level a fixed, immutable property of the data?
Isn't an ordinal scale just an interval scale with error?
What does measurement level have to do with discrete vs. continuous?
Don't the theorems in a statistics textbook prove the validity of statistical methods without reference to measurement theory?
Does measurement level detemine what statistics are valid?
But measurement level has been shown empirically to be irrelevant to statistical results, hasn't it?
What are some more examples of how measurement level relates to statistical methodology?
Are there other theories of measurement?
What's the bottom line?
References

What is measurement theory?

Measurement theory is a branch of applied mathematics that is useful in measurement and data analysis. The fundamental idea of measurement theory is that measurements are not the same as the attribute being measured. Hence, if you want to draw conclusions about the attribute, you must take into account the nature of the correspondence between the attribute and the measurements.

The mathematical theory of measurement is elaborated in:

: Krantz, D. H., Luce, R. D., Suppes, P., and Tversky, A. (1971), Foundations of measurement, Vol. I: Additive and polynomial representations, New York: Academic Press.
: Suppes, P., Krantz, D. H., Luce, R. D., and Tversky, A. (1989), Foundations of measurement, Vol. II: Geometrical, threshold, and probabilistic respresentations,New York: Academic Press.
: Luce, R. D., Krantz, D. H., Suppes, P., and Tversky, A. (1990), Foundations of measurement, Vol. III: Representation, axiomatization, and invariance, New York: Academic Press.

Measurement theory was popularized in psychology by S. S. Stevens, who originated the idea of levels of measurement. His relevant articles include Stevens (1946, 1951, 1959, 1968).

For a recent discussion of measurement theory and statistics, see Hand (1996).

What is measurement?

Measurement of some attribute of a set of things is the process of assigning numbers or other symbols to the things in such a way that relationships of the numbers or symbols reflect relationships of the attributes of the things being measured. A particular way of assigning numbers or symbols to measure something is called a scale of measurement.

Suppose we have a collection of straight sticks of various sizes and we assign a number to each stick by measuring its length using a ruler. If the number assigned to one stick is greater than the number assigned to another stick, we can conclude that the first stick is longer than the second. Thus a relationship among the numbers (greater than) corresponds to a relationship among the sticks (longer than). If we lay two sticks end-to-end in a straight line and measure their combined length, then the number we assign to the concatenated sticks will equal the sum of the numbers assigned to the individual sticks (within measurement error). Thus another relationship among the numbers (addition) corresponds to a relationship among the sticks (concatenation). These relationships among the sticks must be empirically verified for the measurements to be valid.

Why should I care about measurement theory?

Measurement theory helps us to avoid making meaningless statements. A typical example of such a meaningless statement is the claim by the weatherman on the local TV station that it was twice as warm today as yesterday because it was 40 degrees Fahrenheit today but only 20 degrees yesterday. This statement is meaningless because one measurement (40) is twice the other measurement (20) only in certain arbitrary scales of measurement, such as Fahrenheit. The relationship 'twice-as' applies only to the numbers, not the attribute being measured (temperature).

When we measure something, the resulting numbers are usually, to some degree, arbitrary. We choose to use a 1 to 5 rating scale instead of a -2 to 2 scale. We choose to use Fahrenheit instead of Celsius. We choose to use miles per gallon instead of gallons per mile. The conclusions of a statistical analysis should not depend on these arbitrary decisions, because we could have made the decisions differently. We want the statistical analysis to say something about reality, not simply about our whims regarding meters or feet. If a given statement may be either true or false depending on arbitrary, unspecified choices, then that statement is logically meaningless.

Suppose we have a rating scale where several judges rate the goodness of flavor of several foods on a 1 to 5 scale. If we want to draw conclusions about the measurements, i.e. the 1-to-5 ratings, then we need not be concerned about measurement theory. For example, if we want to test the hypothesis that the foods have equal mean ratings, we might do a two-way ANOVA on the ratings.

But if we want to draw conclusions about flavor, then we must consider how flavor relates to the ratings, and that is where measurement theory comes in. Ideally, we would want the ratings to be linear functions of the flavors with the same slope for each judge; if so, the ANOVA can be used to make inferences about mean goodness-of-flavors, providing we can justify all the appropriate statistical assumptions. But if the judges have different slopes relating ratings to flavor, or if these functions are not linear, then this ANOVA will not allow us to make inferences about mean goodness-of-flavor. Note that this issue is not about statistical interaction; even if there is no evidence of interaction in the ratings, the judges may have different functions relating ratings to flavor.

We need to consider what information we have about the functions relating ratings to flavor for each judge. Perhaps the only thing we are sure of is that the ratings are monotone increasing functions of flavor. In this case, we would want to use a statistical analysis that is valid no matter what the particular monotone increasing functions are. One way to do this is to choose an analysis that yields invariant results no matter what monotone increasing functions the judges happen to use, such as a Friedman test. The study of such invariances is a major concern of measurement theory.

However, no measurement theorist would claim that measurement theory provides a complete solution to such problems. In particular, measurement theory generally does not take random measurement error into account, and if such errors are an important aspect of the measurement process, then additional methods, such as latent variable models, are called for. There is no clear boundary between measurement theory and statistical theory; for example, a Rasch model is both a measurement model and a statistical model.

What are permissible transformations?

Permissible transformations are transformations of a scale of measurement that preserve the relevant relationships of the measurement process. Permissible is a technical term; use of this term does not imply that other transformations are prohibited for data analysis any more than use of the term normal for probability distributions implies that other distributions are pathological. If Stevens had used the termmandatory rather than permissible, a lot of confusion might have been avoided.

In the example of measuring sticks, changing the unit of measurement (say, from centimeters to inches) multiplies the measurements by a constant factor. This multiplication does not alter the correspondence of the relationships 'greater than' and 'longer than', nor the correspondence of addition and concatenation. Hence, change of units is a permissible transformation with respect to these relationships.

What are levels of measurement?

There are different levels of measurement that involve different properties (relations and operations) of the numbers or symbols that constitute the measurements. Associated with each level of measurement is a set of permissible transformations. The most commonly discussed levels of measurement are as follows:

Nominal:

Two things are assigned the same symbol if they have the same value of the attribute.
Permissible transformations are any one-to-one or many-to-one transformation, although a many-to-one transformation loses information.
Examples: numbering of football players; numbers assigned to religions in alphabetical order, e.g. atheist=1, Buddhist=2, Christian=3, etc.

Ordinal:

Things are assigned numbers such that the order of the numbers reflects an order relation defined on the attribute. Two things x and y with attribute values a(x) and a(y) are assigned numbers m(x) and m(y) such that if m(x) > m(y), then a(x) > a(y).
Permissible transformations are any monotone increasing transformation, although a transformation that is not strictly increasing loses information.
Examples: Moh's scale for hardness of minerals; grades for academic performance (A, B, C, ...); blood sedimentation rate as a measure of intensity of pathology.

Interval:

Things are assigned numbers such that differences between the numbers reflect differences of the attribute. If m(x) - m(y) > m(u) - m(v), then a(x) - a(y) > a(u) - a(v).
Permissible transformations are any affine transformation t(m) = c * m + d, where c and d are constants; another way of saying this is that the origin and unit of measurement are arbitrary.
Examples: temperature in degrees Fahrenheit or Celsius; calendar date.

Log-interval:

Things are assigned numbers such that ratios between the numbers reflect ratios of the attribute. If m(x) / m(y) > m(u) / m(v), then a(x) / a(y) > a(u) / a(v).
Permissible transformations are any power transformation t(m) = c * m ** d, where c and d are constants.
Examples: density (mass/volume); fuel efficiency in mpg.

Ratio:

Things are assigned numbers such that differences and ratios between the numbers reflect differences and ratios of the attribute.
Permissible transformations are any linear (similarity) transformation t(m) = c * m, where c is a constant; another way of saying this is that the unit of measurement is arbitrary.
Examples: Length in centimeters; duration in seconds; temperature in degrees Kelvin.

Absolute:

Things are assigned numbers such that all properties of the numbers reflect analogous properties of the attribute.
The only permissible transformation is the identity transformation.
Examples: number of children in a family, probability.

These measurement levels form a partial order based on the sets of permissible transformations:

      Weaker <-----------------------------------> Stronger

                           - Interval -
                          /            /
      Nominal -- Ordinal <              > Ratio -- Absolute
                          /            /
                           Log-interval

In real life, a scale of measurement may not correspond precisely to any of these levels of measurement. For example, there can be a mixture of nominal and ordinal information in a single scale, such as in questionnaires that have several non-response categories. It is common to have scales that lie somewhere between the ordinal and interval levels in that the measurements can be assumed to be a smooth monotone function of the attribute. For many subjective rating scales (such as the 'strongly agree,' 'agree,' ... 'strongly disagree' variety) it cannot be shown that the intervals between successive ratings are exactly equal, but with reasonable care and diagnostics it may be safe to say that no interval represents a difference more than two or three times greater than another interval.

The above list of measurement levels is not exhaustive. It is not unusual to encounter other scales of measurement that do not have such widely recognized names. For example, directional or circular data may be measured on what might be called a periodic-interval scale, which has an arbitrary origin and unit as well as a period related to the unit. Time of day, for example, conventionally has an origin of midnight, a unit of hours, and a period of 24 hours.

Unfortunately, there are also many situations where the measurement process is too ill-defined for measurement theory to apply. In such cases, it may still be fruitful to consider what arbitrary choices were made in the course of measurement, what effect these choices may have had on the measurements, and whether some plausible class of permissible transformations can be determined.

What about binary (0/1) variables?

For a binary variable, the classes of one-to-one transformations, monotone increasing/decreasing transformations, and affine transformations are identical--you can't do anything with a one-to-one transformation that you can't do with an affine tranformation. Hence binary variables are at least at the interval level. If the variable connotes presence/absence or if there is some other distinguishing feature of one category, a binary variable may be at the ratio or absolute level.

Nominal variables are often analyzed in linear models by coding binary dummy variables. This procedure is justified since binary variables are at the interval level or higher.

Is measurement level a fixed, immutable property of the data?

Measurement level depends on the correspondence between the measurements and the attribute. Given a set of data, one cannot say what the measurement level is without knowing what attribute is being measured. It is possible that a certain data set might be treated as measuring different attributes at different times for different purposes.

Consider a rat in a Skinner box who pushes a lever to get food pellets. The number of pellets dispensed in the course of an experiment is obviously an absolute-level measurement of the number of pellets dispensed. If number of pellets is considered as a measure of some other attribute, the measurement level may differ. As a measure of amount of food dispensed, the number of pellets is at the ratio level under the assumption that the pellets are of equal size; if the pellets are not of equal size, a more elaborate measurement model is required, perhaps one involving random measurement error if the pellets are dispensed in random order. As a measure of duration during the experiment, the number of pellets is at an ordinal level. As a measure of response effort, the number of pellets might be approximately ratio level, but we would need to consider whether the rat's responses were executed in a consistent way, whether the rat may miss the lever, and so forth. As a measure of amount of reward, the number of pellets could only be justified by some very strong assumptions about the nature of rewards; the measurement level would depend on the precise nature of those assumptions. The main virtue of measurement theory is that it encourages people to consider such issues.

Velleman and Wilkinson (1993) have pointed out that decisions about the scale of measurement should not be made in haste. Not only may information about the measurement process be inadequate, but during the course of a statistical analysis, we may revise our theories about what attributes we are trying to measure. Especially in predictive modeling, we may discover that some of the predictor variables contain previously unsuspected information. However, measurement, like experimental design, is something that should be considered carefully before collecting data.

Once a set of measurements have been made on a particular scale, it may be possible to transform the measurements to yield a new set of measurements at a different level. It is always possible to transform from a stronger level to a weaker level. For example, a temperature measurement in degrees Kelvin is at the ratio level. If we convert the measurements to degrees Celsius, the level is interval. If we rank the measurements, the level becomes ordinal. In some cases it is possible to convert from a weaker scale to a stronger scale. For example, correspondence analysis can convert nominal measurements to an interval scale under appropriate assumptions, and multidimensional scaling or conjoint analysis can convert ordinal measurements to an interval scale if the model is correct.

Isn't an ordinal scale just an interval scale with error?

You can view an ordinal scale as an interval scale with error if you really want to, but the errors are not independent, additive, or identically distributed as required for many statistical methods. The errors would involve complicated dependencies to maintain monotonicity with the interval scale. In the example above with number of pellets as a measure of duration, the errors would be cumulative, not additive, and the error variance would increase over time. Hence for most statistical purposes, it useless to consider an ordinal scale as an interval scale with measurement error.

What does measurement level have to do with discrete vs. continuous?

Measurement level has nothing to do with discrete vs. continuous variables.

The distinction between discrete and continuous random variables is commonly used in statistical theory, but that distinction is rarely of importance in practice. A continuous random variable has a continuous cumulative distribution function. A discrete random variable has a stepwise-constant cumulative distribution function. A discrete random variable can take only a finite number of distinct values in any finite interval. There exist random variables that are neither continuous nor discrete; for example, if Z is a standard normal random variable and Y=max(0,Z), then Y is neither continuous nor discrete, but has characteristics of both.

While measurements are always discrete due to finite precision, attributes can be conceptually either discrete or continuous regardless of measurement level. Temperature is usually regarded as a continuous attribute, so temperature measurement to the nearest degree Kelvin is a ratio-level measurement of a continuous attribute. However, quantum mechanics holds that the universe is fundamentally discrete, so temperature may actually be a discrete attribute. In ordinal scales for continuous attributes, ties are impossible (or have probability zero). In ordinal scales for discrete attributes, ties are possible. Nominal scales usually apply to discrete attributes. Nominal scales for continuous attributes can be modeled but are rarely used.

Don't the theorems in a statistics textbook prove the validity of statistical methods without reference to measurement theory?

Mathematical statistics is concerned with the connection between inference and data. Measurement theory is concerned with the connection between data and reality. Both statistical theory and measurement theory are necessary to make inferences about reality.

Does measurement level detemine what statistics are valid?

Measurement theory cannot determine some single statistical method or model as appropriate for data at a specific level of measurement. But measurement theory does in fact show that some some statistical methods are inappropriate for certain levels of measurement if we want to make meaningful inferences about the attribute being measured.

If we want to make statistical inferences regarding an attribute based on a scale of measurement, the statistical method must yield invariant or equivariant results under the permissible transformations for that scale of measurement. If this invariance or equivariance does not hold, then the statistical inferences apply only to the measurements, not to the attribute that was measured.

If we record the temperature in degrees Fahrenheit in Cary, NC, at various times, we can compute statistics such as the mean, standard deviation, and coefficient of variation. Since Fahrenheit is an interval scale, only statistics that are invariant or equivariant under change of origin or unit of measurement are meaningful. The mean is meaningful because it is equivariant under change of origin or unit. The standard deviation is meaningful because it is invariant under change of origin and equivariant under change of unit. But the coefficient of variation is meaningless because it lacks such invariance or equivariance. The mean and standard deviation can easily be converted back and forth from Fahrenheit to Celsius, but we cannot compute the coefficient of variation in degrees Celsius if we know only the coefficient of variation in degrees Fahrenheit.

Paul Thompson provides an example where interval and ratio levels are confused:

... I recently published a paper in Psychiatric Research. We discuss the BPRS, a very common rating scale in psychiatry. Oddly enough, in the BPRS in the US, '1' means NO PATHOLOGY. However, a frequent statistic computed is percent improved:
     PI = (BPRS(Base)-BPRS(6week) ) / BPRS(Base) * 100
If you use the '1 implies no pathology' model, you are not measuring according to a ratio scale, which requires a true 0. We show that this has very bad characteristics, which include a flat impossibility for a certain % improvement at certain points in the scale.
This is pretty trivial, but should have an effect. As one reviewer said, 'This is so obvious that I am surprised that no one has ever thought of it before.' Nonetheless, the scale was being misused.

It is clear that if we are estimating a parameter that lacks invariance or equivariance under permissible transformations, we are estimating a chimera. The situation for hypothesis testing is more subtle. It is nonsense to test a null hypothesis the truth of which is not invariant under permissible transformations. For example, it would be meaningless to test the null hypothesis that the mean temperature in Cary in July is twice the mean temperature in December using a Fahrenheit or Celsius scale--we would need a ratio scale for that hypothesis to be meaningful.

But it is possible for the null hypothesis to be meaningful even if the error rates for a given test are not invariant. Suppose that we had an ordinal scale of temperature, and the null hypothesis was that the distribution of temperatures in July is identical to the distribution in December. The truth of this hypothesis is invariant under strictly increasing monotone transformations and is therefore meaningful under an ordinal scale. But if we do a t-test of this hypothesis, the error rates will not be invariant under monotone transformations. Hard-core measurement theorists would therefore consider a t-test inappropriate. But given a null hypothesis, there are usually many different tests that can be performed with accurate or conservative significance levels but with different levels of power against different alternatives. The fact that different tests have different error rates does not make any of them correct or incorrect. Hence a soft-core measurement theorist might argue that invariance of error rates is not a prerequisite for a meaningful hypothesis test--only invariance of the null hypothesis is required.

Nevertheless, the hard-core policy rules out certain tests that, while not incorrect in a strict sense, are indisputably poor tests in terms of having absurdly low power. Consider the null hypothesis that two random variables are independent of each other. This hypothesis is invariant under one-to-one transformations of either variable. Suppose we have two nominal variables, say, religion and preferred statistical software product, to which we assign arbitrary numbers. After verifying that at least one of the two variables is approximately normally distributed, we could test the null hypothesis using a Pearson product-moment correlation, and this would be a valid test. However, the power of this test would be so low as to be useless unless we were lucky enough to assign numbers to categories in such a way as to reveal the dependence as a linear relationship. Measurement theory would suggest using a test that is invariant under one-to-one transformations, such as a chi-squared test of independence in a contingency table. Another possibility would be to use a Pearson product-moment correlation after assigning numbers to categories in such a way as to maximize the correlation (although the usual sampling distribution of the correlation coefficient would not apply). In general, we can test for independence by maximizing some measure of dependence over all permissible transformations.

However, it must be emphasized that there is no need to restrict the transformations in a statistical analysis to those that are permissible. That is not what permissible transformation means. The point is that statistical methods should be used that give invariant results under the class of permissible transformations, because those transformations do not alter the meaning of the measurements. Permissible was undoubtedly a poor choice of words, but Stevens was quite clear about what he meant. For example (Stevens 1959):

In general, the more unrestricted the permissible transformations, the more restricted the statistics. Thus, nearly all statistics are applicable to measurements made on ratio scales, but only a very limited group of statistics may be applied to measurements made on nominal scales.

The connection between measurement level and statistical analysis has been hotly disputed in the psychometric and statistical literature by people who fail to distinguish between inferences regarding the attribute and inferences regarding the measurements. If one is interested only in making inferences about the measurements without regard to their meaning, then measurement level is, of course, irrelevant to choice of statistical method. The classic example is Lord's (1953) article "On the Statistical Treatment of Football Numbers." Lord argued that statistical methods could be applied regardless of level of measurement, and concocted a silly example involving the jersey numbers assigned to football players, which Lord claimed were nominal-level measurements of the football players. Lord contrived a situation in which freshmen claimed they were getting lower numbers than the sophomores, so the purpose of the analysis was to make inferences about the numbers, not about some attribute measured by the numbers. It was therefore quite reasonable to treat the numbers as if they were on an absolute scale. However, this argument completely misses the point by eliminating the measured attribute from the scenario.

The confusion between measurements and attributes was perpetuated by Velleman and Wilkinson (1993), who set up a series of straw men and knocked some of them down, while consistently misunderstanding the meaning of meaning and of permissible transformation. For example, they claimed that the number of cylinders in an automobile engine can be treated, depending on the circumstances, as nominal, ordinal, interval, or ratio, and hence the concept of measurement level "simplifies the matter so far as to be false." In fact, the number of cylinders is at the absolute level of measurement. Thus, measurement theory would dictate that any statistical analysis of number of cylinders must be invariant under an identity transformation. Obviously, anyanalysis is invariant under an identity transformation, so all of the analyses that Velleman and Wilkinson claimed might be appropriate are acceptable according to measurement theory. What is false is not measurement theory but Velleman and Wilkinson's backwards interpretation of it.

It is important to understand that the level of measurement of a variable does not mandate how that variable must appear in a statistical model. However, the measurement level does suggest reasonable ways to use a variable by default. Consider the analysis of fuel efficiency in automobiles. If we are interested in the average distance that can be driven with a given amount of gas, we should analyze miles per gallon. If we are interested in the average amount of gas required to drive a given distance, we should analyze gallons per mile. Both miles per gallon and gallons per mile are measurements of fuel efficiency, but they may yield quite different results in a statistical analysis, and there may be no clear reason to use one rather than the other. So how can we make inferences regarding fuel efficiency that do not depend on the choice between these two scales of measurement? We can do that by recognizing that both miles per gallon and gallons per mile are measurements of the same attribute on a log-interval scale, and hence that the logarithm of either can be treated as a measurement on an interval scale. Thus, if we were doing a regression, it would be reasonable to begin the analysis using log(mpg). If evidence of nonlinearity were detected, then other transformations could still be considered.

Rank tests are broadly useful for ordinal data because ranking often produces the required invariance of test statistics. But ranking is not some sort of ordinal-to-interval conversion. An hypothesis that is meaningless for ordinal data does not become meaningful when the data are ranked. For example, in a two-way factorial design, the hypothesis of additivity (no interaction) of the effects requires an interval or stronger scale. Ranking does not allow tests of interaction for ordinal data because no well-defined hypothesis is being tested--the truth of the hypothesis of additivity of ranks can change depending on how many cases are in each cell of the design.

The cookbook approach to measurement theory and rank tests has yielded some peculiar ideas. Measurement theory certainly does not demand that ordinal data be ranked, since there are other ways of achieving the necessary invariance (e.g., Agresti, 1984; Gifi, 1990). Neither does measurement theory forbid the use of rank tests, as some people have argued under the misguided notion that ranks, being ordinal, cannot be summed; sums of ranks can be used meaningfully when they have the necessary invariance properties.

But measurement level has been shown empirically to be irrelevant to statistical results, hasn't it?

What has been shown is that various statistical methods are more or less robust to distortions that could arise from smooth monotone transformations; in other words, there are cases where it makes little difference whether we treat a measurement as ordinal or interval. But there can hardly be any doubt that it often makes a huge difference whether we treat a measurement as nominal or ordinal, and confusion between interval and ratio scales is a common source of nonsense.

Suppose we are doing a two-sample t-test; we are sure that the assumptions of ordinal measurement are satisfied, but we are not sure whether an equal-interval assumption is justified. A smooth monotone tranformation of the entire data set will generally have little effect on the p value of the t-test. A robust variant of a t-test will likely be affected even less (and, of course, a rank version of a t-test will be affected not at all). It should come as no surprise then that a decision between an ordinal or an interval level of measurement is of no great importance in such a situation, but anyone with lingering doubts on the matter may consult the simulations in Baker, Hardyck, and Petrinovich (1966) for a demonstration of the obvious.

On the other hand, suppose we were comparing the variability instead of the location of the two samples. The F test for equality of variances is not robust, and smooth monotone transformations of the data could have a large effect on the p value. Even a more robust test could be highly sensitive to smooth monotone transformations if the samples differed in location.

Measurement level is of greatest importance in situations where the meaning of the null hypothesis depends on measurement assumptions. Suppose the data are 1-to-5 ratings obtained from two groups of people, say males and females, regarding how often the subjects have sex: frequently, sometimes, rarely, etc. Suppose that these two groups interpret the term 'frequently' differently as applied to sex; perhaps males consider 'frequently' to mean twice a day, while females consider it to mean once a week. Females may report having sex more 'frequently' than men on the 1-to-5 scale, even if men in fact have sex more frequently as measured by sexual acts per unit of time. Hence measurement considerations are crucial to the interpretation of the results.

What are some more examples of how measurement level relates to statistical methodology?

As mentioned earlier, it is meaningless to claim that it was twice as warm today as yesterday because it was 40 degrees Fahrenheit today but only 20 degrees yesterday. Fahrenheit is not a ratio scale, and there is no meaningful sense in which 40 degrees is twice as warm as 20 degrees. It would be just as meaningless to compute the geometric mean or coefficient of variation of a set of temperatures in degrees Fahrenheit, since these statistics are not invariant or equivariant under change of origin. There are many other statistics that can be meaningfully applied only to data at a sufficiently strong level of measurement.

Consider some measures of location: the mode requires a nominal or stronger scale, the median requires an ordinal or stronger scale, the arithmetic mean requires an interval or stronger scale, and the geometric mean or harmonic mean require a ratio or stronger scale.

Consider some measures of variation: entropy requires a nominal or stronger scale, the standard deviation require an interval or stronger scale, and the coefficient of variation requires a ratio or stronger scale.

Simple linear regression with an intercept requires that both variables be on an interval or stronger scale. Regression through the origin requires that both variables be on a ratio or stronger scale.

A generalized linear model using a normal distribution requires the dependent variable to be on an interval or stronger scale. A gamma distribution requires a ratio or stronger scale. A Poisson distribution requires an absolute scale.

The general principle is that an appropriate statistical analysis must yield invariant or equivariant results for all permissible transformations. Obviously, we cannot actually conduct an infinite number of analyses of a real data set corresponding to an infinite class of transformations. However, it is often straightforward to verify or falsify the invariance mathematically. The application of this idea to summary statistics such as means and coefficients of variation is fairly widely understood.

Confusion arises when we come to linear or nonlinear models and consider transformations of variables. Recall that Stevens did not say that transformations that are not 'permissible' are prohibited. What Stevens said was that we should consider all'permissible' transformations and verify that our conclusions are invariant.

Consider, for example, the problem of estimating the parameters of a nonlinear model by maximum likelihood (ML), and comparing various models by likelihood ratio (LR) tests . We would want the LR tests to be invariant under the permissible transformations of the variables. One way to do this is to parameterize the model so that any permissible transformation can be inverted by a corresponding change in the parameter estimates. In other words, we can make the ML and LR tests invariant by making the inverse-permissible transformations mandatory (this is the same set of transformations as the permissible transformations except for a degeneracy here and there which I won't worry about).

To illustrate, suppose we are modeling a variable Y as a function f() of variables N, O, I, L, R, and A at the nominal, ordinal, etc. measurement levels, respectively. Then we can ensure the desired invariance by setting up the model as:

   Y = f( arb(N), mon(O), a+bI, cL^d, eR, A, ...)

where arb() is any (estimated) function, mon() is any (estimated) monotone function, anda, b, c, d, and e are parameters. Then any permissible transformations of N, O, I, L, R, and A can be absorbed by the estimation of the arb() and mon() functions and the parameters. The function f() can involve any other transformations such as sqrts or logs or whatever. f() can be as complicated as you like--the presence of the permissible transformations as part of the model to be estimated guarantees the desired invariance.

If we were designing software for fitting linear or nonlinear models, we might want to provide these 'permissible' or 'mandatory' transformations in a convenient way. This, in fact, was the motivation for numerous programs developed by psychometricians that anticipated many of the features of ACE and generalized additive models.

Are there other theories of measurement?

Michell (1986, 1990; also discussed by Hand, 1996) has distinguished among "representational" measurement theory as described in this FAQ, "operational" theory, and "classical" theory. The representational theory assumes that there exists a "reality" that is being measured, and that scientific theories are about this reality. The operational theory avoids the assumption of an underlying reality, requiring only that measurement consist of precisely specified operations; scientific theories concern only relationships among measurements. The classical theory, like the representational theory, assumes an objective reality, but, unlike the representational theory, holds that only quantitative attributes are measurable, and measurement involves the discovery of the magnitudes of these attributes. In the classical theory, like the operational theory, meaningfulness comes from empirical support for scientific theories describing the interrelationships of various measurements.

It is interesting to consider the dramatic international rise in IQ scores (the "Flynn effect": Flynn, 1987; Neisser, 1997) in light of these three theories of measurement. Has intelligence increased, or just the test scores? The representational theory makes clear the importance of this distinction. The operational theory makes the question impossible to ask, since there is no such thing as "intelligence" as distinct from scores on intelligence tests. The classical theory allows the question, "What do IQ tests really measure?" (Flynn, 1987), but it is difficult to see how intelligence can be regarded as having a magnitude susceptible to classical measurement by Michell's (1986) definition.

The distinctions among Michell's theories of measurement are not always clear. Consider latent variable models, such as a Rasch model. Michell (1986, p. 404) says that "Such an approach to psychological measurement owes more to the classical theory of measurement than to either the operational or the representational theories." Hand (1996, p. 455) says, "It seems to me, however, that the so-called latent variables are operationally defined by their relationships to the observed variables." But Hand later (p. 457) describes a Rasch model as an example of classical measurement. My opinion is that a Rasch model is a form of representational measurement involving probabilitic relationships--an extension, but a natural extension, of Stevens's idea of measurement.

The operational theory has the virtue of discouraging sloppiness. As Hand (1996) points out, operational measurement may be good enough for predictive (as opposed to explanatory) models. But operationalism has some severe philosophical disadvantages. Robert Klee (1997) says about operationalism (pp. 53-54):

The most influential doctrine about [correspondence rules] ever to circulate among practicing scientists themselves (and not just philosophers of science) was operationalism. Operationalism was first introduced to a wide scientific audience by the physicist Percy Bridgman in 1927, just as logical positivism was starting up in central Europe. Operationalism did not last long in the physical sciences; but, for reasons that continue to puzzle philosophers of science, it survives to this day with considerable influence in the social and behavioral sciences (especially psychology), where the methodological war cry to "operationalize your variables!" persists among practitioners in certain quarters despite the problems with operationalism that we are about to investigate.

Michell (1990, p. 28) says that operationalism is logically false, hence the operational theory of measurement must be rejected. He also claims (p. 49) that the representational theory, while not logically incoherent, is nevertheless empirically false. However, this conclusion seems to be based more on philosophical convictions than empirical results, primarily on the claim that numbers are empirical entities, not abstractions. Most of Michell's (1990) book is devoted to promoting the classical theory of measurement, in which numbers are empirical, but which in other respects seems very similar to the representational theory. However, in the discussion of Hand's (1996, pp. 481-482) paper, Michell emphasizes that the three theories of measurement are mutually contradictory.

While I find most of Michell's (1990) conclusions unconvincing, I think his emphasis on philosophy is enlightening with respect to the ongoing arguments about measurement. In the debate about the implications of measurement theory for statistical practice, it often seems that the two sides are arguing past each other, each side considering their own position to be self-evident. Klee (1997) points out that a similar situation exists with regard to arguments between realist and anti-realist philosophers of science. Operationalism is dead, but operational measurement could still be supported by some varieties of anti-realism, especially the pragmatic school. Perhaps the arguments about measurement theory go nowhere because of unstated philosophical assumptions.

What's the bottom line?

Measurement theory shows that strong assumptions are required for certain statistics to provide meaningful information about reality. Measurement theory encourages people to think about the meaning of their data. It encourages critical assessment of the assumptions behind the analysis. It encourages responsible real-world data analysis.

References

: Agresti, A. (1984), Analysis of Ordinal Categorical Data. NY: Wiley.
: Baker, B. O., Hardyck, C, and Petrinovich, L. F. (1966), "Weak measurement vs. strong statistics: An empirical critique of S.S. Stevens' proscriptions on statistics," Educational and Psychological Measurement, 26, 291-309.
: Bridgman, P. (1927), The Logic of Modern Physics, NY: Macmillan.
: Flynn, J.R. (1987), "Massive IQ gains in 14 nations: What IQ tests really measure," Psychological Bulletin, 101, 171-191.
: Gifi, A. (1990), Nonlinear Multivariate Analysis, Chichester: Wiley.
: Hand, D.J. (1996), "Statistics and the theory of measurement," with discussion, J. of the Royal Statistical Society, Series A, 159, 445-492.
: Klee, R. (1997), Introduction to the Philosophy of Science: Cutting Nature at Its Seams, NY: Oxford University Press.
: Krantz, D. H., Luce, R. D., Suppes, P., and Tversky, A. (1971), Foundations of measurement, Vol. I: Additive and polynomial representations, New York: Academic Press.
: Lord, F.M. (1953), "On the Statistical Treatment of Football Numbers," American Psychologist, 8, 750-751,
: Luce, R. D., Krantz, D. H., Suppes, P., and Tversky, A. (1990), Foundations of measurement, Vol. III: Representation, axiomatization, and invariance, New York: Academic Press.
: Michell, J. (1986), "Measurement scales and statistics: a clash of paradigms," Psychological Bulletin, 100, 398-407.
: Michell, J. (1990), An Introduction to the Logic of Psychological Measurement,Hillsdale: Erlbaum.
: Neisser, U. (1997), "Rising scores on intelligence tests," American Scientist, 85, 440-447.
: Suppes, P., Krantz, D. H., Luce, R. D., and Tversky, A. (1989), Foundations of measurement, Vol. II: Geometrical, threshold, and probabilistic respresentations,New York: Academic Press.
: Stevens, S. S. (1946), "On the theory of scales of measurement," Science, 103, 677-680.
: Stevens, S. S. (1951), "Mathematics, measurement, and psychophysics," in S. S. Stevens (ed.), Handbook of experimental psychology, pp 1-49), New York: Wiley.
: Stevens, S. S. (1959), "Measurement," In C. W. Churchman, ed., Measurement: Definitions and Theories, pp. 18-36, New York: Wiley. Reprinted in G. M. Maranell, ed., (1974) Scaling: A Sourcebook for Behavioral Scientists, pp. 22-41, Chicago: Aldine.
: Stevens, S. S. (1968), "Measurement, statistics, and the schemapiric view," Science, 161, 849-856.
: Velleman, P.F., and Wilkinson, L. (1993), "Nominal, Ordinal, Interval, and Ratio Typologies Are Misleading," The American Statistician, 47, 65-72.

你可能感兴趣的:(优化算法)

[由浅入深理解神经网络] 2 张量流与反向传播
由浅入深理解神经网络2张量流与反向传播0前言1张量流和运算图2复合函数视角2.1复合函数求导2.1.1链式法则2.1.2多元函数的链式法则2.2前馈网络的反向传播2.3任意网络的反向传播3结语0前言在由浅入深理解神经网络1一个简单到极致的神经网络中,我们已经发现了训练神经网络最重要的一件事,那就是求梯度,然后优化算法利用梯度来调整网络参数.我们重写一下前面提到的一个通用的神经网络:y=f(x;θ)
Python 中的集合（Set）详解：从基础操作到实际应用面朝大海，春不暖，花不开 Python基础 python 开发语言
文章大纲引言：集合在Python中的重要性在Python编程中，集合（Set）是一种极为重要的内置数据结构，它以无序性和元素唯一性为主要特点。集合中的每个元素都是独一无二的，这使得它在处理数据去重、成员检测以及数学运算（如并集、交集）时表现出色。无论是进行大规模数据分析，还是优化算法效率，集合都能提供高效的解决方案。例如，在处理用户ID列表时，集合可以快速去除重复项，确保数据准确性。此外，集合与字
结构力学优化算法：多目标优化：遗传算法与结构优化_2024-08-08_19-41-25.Tex chenjj4003 材料力学2 算法 javascript 前端人工智能线性代数
结构力学优化算法：多目标优化：遗传算法与结构优化绪论结构优化的重要性在工程设计中，结构优化扮演着至关重要的角色。它旨在通过最小化成本、重量或应力等目标，同时确保结构的强度、刚度和稳定性满足设计要求，来提高结构的性能和效率。结构优化可以帮助工程师在设计初期就避免潜在的结构问题，减少材料浪费，降低生产成本，同时提升产品的竞争力。多目标优化的概念多目标优化是指在优化过程中同时考虑多个目标函数的优化问题。
TVFEMD-CPO-TCN-BiLSTM多输入单输出模型微光-沫年 matlab 回归机器学习
47-TVFEMD-CPO-TCN-BiLSTM多输入单输出模型适合单变量，多变量时间序列预测模型（可改进，加入各种优化算法）时变滤波的经验模态分解TVFEMD时域卷积TCN双向长短期记忆网络BiLSTM时间序列预测模型另外以及有TCN-BILSTMTCN-LSTMTCN-BiLSTM-ATTENTION等！（此不包含在内，另算的！）Matlab代码！
AI驱动的智能电网:平衡供需提高效率 AI智能应用 AI大模型应用入门实战与进阶 java python javascript kotlin golang 架构人工智能
智能电网，AI，机器学习，预测模型，优化算法，供需平衡，能源效率1.背景介绍随着全球能源需求的不断增长和可再生能源的快速发展，传统电网面临着越来越多的挑战。传统的电网结构是集中式供电，难以适应分布式能源的接入和负荷需求的波动性。智能电网应运而生，它利用先进的通信技术、传感器网络和数据分析技术，实现电网的自动化、智能化和可视化，从而提高电网的可靠性、效率和安全性。人工智能（AI）作为一种新兴技术，在
[插电式混合动力车辆][交替方向乘子法（ADMM）结合CVX]插电式混合动力车辆的能源管理：基于凸优化算法用于模型预测控制MPC研究（Matlab代码实现）程序辅导帮算法 matlab 人工智能
欢迎来到本博客❤️❤️博主优势：博客内容尽量做到思维缜密，逻辑清晰，为了方便读者。⛳️座右铭：行百里者，半于九十。本文目录如下：目录⛳️赠与读者1概述2运行结果3参考文献4Matlab代码、数据、文章⛳️赠与读者‍做科研，涉及到一个深在的思想系统，需要科研者逻辑缜密，踏实认真，但是不能只是努力，很多时候借力比努力更重要，然后还要有仰望星空的创新点和启发点。当哲学课上老师问你什么是科学，什么是电的时
MCP如何助力智能交通系统？从数据融合到精准决策 Echo_Wish Python 进阶 python 开发语言
MCP如何助力智能交通系统？从数据融合到精准决策近年来，智能交通系统（ITS）正在全球范围内快速发展，它结合人工智能（AI）、物联网（IoT）和数据分析，致力于提高交通效率、减少拥堵、增强安全性。而MCP（Multi-ConstraintPathfinding，多约束路径寻优）技术作为一种复杂路径优化算法，在智能交通系统中扮演着重要角色，尤其是在导航优化、公共交通调度、应急响应等场景。今天，我们就
AI优化算法实战：使用粒子群优化求解复杂工程问题 AI学长带你学AI ai
AI优化算法实战：使用粒子群优化求解复杂工程问题关键词：粒子群优化（PSO）、全局优化、工程问题、智能算法、参数调优摘要：本文以“鸟群觅食”为灵感来源，深入浅出地讲解粒子群优化（ParticleSwarmOptimization,PSO）算法的核心原理，并通过机械结构轻量化设计的实战案例，展示其在复杂工程问题中的应用。文章从算法起源到数学模型，从代码实现到工程落地，层层拆解技术细节，帮助读者快速掌
手把手教程：在 VS2017 32位 Windows 环境下编译 OR-Tools 9.6 并集成到 C++ 项目 A小庞 C++知识算法 c++开发语言 or-tools 算法库
OR-Tools是Google开源的优化算法库，支持路径规划、线性规划、约束编程等多种功能。本文将详细介绍在VisualStudio201732位Windows环境下编译OR-Tools9.6的两种方法：联网自动下载依赖和手动编译依赖项，并提供避坑指南。方法一：联网自动下载依赖（推荐新手）步骤1：克隆OR-Tools仓库gitclonehttps://github.com/google/or-to
MATLAB实现WOA-BP鲸鱼优化算法优化BP神经网络多输入单输出回归预测（含模型描述及示例代码） nantangyuxi MATLAB 含模型描述及示例代码算法 matlab 神经网络大数据人工智能深度学习机器学习
目录MATLAB实现WOA-BP鲸鱼优化算法优化BP神经网络多输入单输出回归预测（多指标，多图）1项目背景介绍...1项目目标与意义...2项目挑战...3项目特点与创新...5<
【机器学习实战】Datawhale夏令营2：深度学习回顾城主_全栈开发机器学习机器学习深度学习人工智能
#DataWhale夏令营#ai夏令营文章目录1.深度学习的定义1.1深度学习＆图神经网络1.2机器学习和深度学习的关系2.深度学习的训练流程2.1数学基础2.1.1梯度下降法基本原理数学表达步骤学习率α梯度下降的变体2.1.2神经网络与矩阵网络结构表示前向传播激活函数反向传播批处理卷积操作参数更新优化算法正则化初始化2.2激活函数Sigmoid函数:Tanh函数:ReLU函数(Rectified
大语言模型(LLM)量化基础知识(一) -派神- RAG NLP ChatGPT 语言模型人工智能自然语言处理
承接各类AI相关应用开发项目(包括但不限于大模型微调、RAG、AI智能体、NLP、机器学习算法、运筹优化算法、数据分析EDA等)!!!有意愿请私信!!!随着大型语言模型(LLM)的参数数量的增长,与其支持硬件（加速器内存）增长速度之间的差距越来越大，如下图所示：上图显示，从2017年到2022年，语言模型的大小显著增加：2017年：Transformer模型（0.05B参数）2018年：GPT（0
LabVIEW工业指针仪表检测 LabVIEW开发 LabVIEW开发案例 labview 深度学习 LabVIEW开发案例
用LabVIEW融合深度学习与机器视觉技术，构建适用于复杂工业环境的多类指针式仪表自动检测系统。通过集成品牌硬件与优化算法架构，实现仪表实时定位、图像增强、示数读取全流程自动化，解决传统人工巡检效率低、误差大的问题，满足煤矿、变电站等场景的智能化监测需求。应用场景工业设备监控：煤矿通风设备压力表、变电站电压电流表、集气站流量仪表等圆形指针式设备的实时状态监测。恶劣环境检测：适用于高温、高压、粉尘或
BLDC电机控制器下一个发展趋势是什么？ funny2024 大数据
【哔哥哔特导读】集成降本?优化算法?BLDC电机控制器更新迭代居然还有新花样......本栏目就邀请整机企业和半导体企业资深行业人士展开对话，一窥BLDC电机控制器的魅力所在，探讨BLDC电机技术创新、算法优化及产业链协同的奥秘。编者按：相比于传统的电机，BLDC电机具有不可比拟的优势。在智能化、工业自动化的今天，BLDC电机控制器在白电、新能源汽车、工业/人形机器人等领域有着广泛的应用前景和市场
【智能优化算法】多目标于分解的多目标进化算法MOEA/D算法（Matlab代码实现）荔枝科研社单多目标智能算法算法 matlab 开发语言多目标进化算法MOEA/D算法
目录1概述2数学模型3运行结果4参考文献5Matlab代码及详细文章1概述基于分解的多目标进化算法(multiobjectiveevolu-tionaryalgorithmbasedondecomposition，MOEA/D)是一种利用分解策略解决多目标问题的算法2'。该算法通过聚合函数将多目标问题分解为N个子问题,每个子问题分配一个对应的权重和相关种群点的邻域"3'。种群迭代通过邻域内随机选择
蚁群算法及其改进——全局路径规划 ~夕上林~ 优化算法算法
文章目录蚁群算法运行机制公式原理转移概率信息素更新步骤改进精英蚂蚁策略遗传算法+ACO程序参考文献蚁群算法蚁群算法（AntColonyOptimization,ACO）是由意大利学者MarcoDorigo于1992年提出的一种群智能优化算法，其核心思想源于对蚂蚁群体觅食行为的仿生学模拟。通过模拟蚂蚁群体在觅食过程中通过信息素进行间接通信的行为机制，利用正反馈原理动态调整路径选择策略，最终在复杂搜索
战争策略优化算法（WSO）（Matlab代码实现）荔枝科研社单多目标智能算法算法 matlab 开发语言战争策略优化算法
欢迎来到本博客❤️❤️❤️博主优势：博客内容尽量做到思维缜密，逻辑清晰，为了方便读者。⛳️座右铭：行百里者，半于九十。目录1概述2主函数3参考文献4Matlab代码实现1概述战争战略优化（WSO）基于战争期间陆军部队的战略调动。战争策略被建模为一个优化过程，其中每个士兵都动态地向最佳值移动。该算法模拟了两种流行的战争策略，即攻击和防御策略。战场上士兵的位置根据实施的策略进行更新。为了提高算法的收敛
北斗导航｜基于改进小龙虾优化算法的GPS接收机自主完好性监测算法研究北斗猿卫星导航算法 matlab
详细介绍基于改进小龙虾优化算法（COA）的GPS接收机自主完好性监测算法的原理、公式和MATLAB实现。主要内容如下：RAIM基础原理与问题定义：介绍最小二乘残差法的数学模型，包括伪距观测方程、故障检测统计量和故障识别方法。改进小龙虾优化算法设计：详细说明COA的三种行为模式及其数学表述，以及三种改进策略（非线性温度更新、自适应视野调整、混合变异机制）。融合改进COA的RAIM算法：阐述种群初始化
微算法科技(NASDAQ：MLGO)集成生成式人工智能扩散模型GAI，优化区块链网络性能 MicroTech2025 算法科技人工智能
随着区块链技术的飞速发展，其在金融、供应链管理、物联网等多个领域展现出巨大的应用潜力。然而，区块链网络在处理大规模交易时面临的性能瓶颈，如交易延迟、吞吐量不足等问题，成为制约其进一步普及和应用的关键因素，区块链网络也面临着可扩展性、安全性、隐私保护等方面的挑战。微算法科技(NASDAQ：MLGO)将生成式人工智能（GAI）技术引入区块链领域，旨在通过GAI的强大数据处理能力和智能优化算法，提升区块
基于Split Bregman算法的稀疏图像重建（附带Matlab代码）代码创造者算法 matlab 人工智能 Matlab
基于SplitBregman算法的稀疏图像重建（附带Matlab代码）SplitBregman算法是一种用于稀疏图像重建的优化算法，它能够有效地恢复受损的图像并保持重要的细节。本文将详细介绍SplitBregman算法的原理，并提供Matlab代码实现。算法原理SplitBregman算法是一种迭代算法，用于求解具有L1正则化项的优化问题。在图像重建中，我们希望找到一个稀疏表示来恢复受损的图像。该
“MOOOA多目标鱼鹰算法在无人机多目标路径规划 Matlab建模攻城师粉丝福利算法无人机
一、MOOOA算法的核心原理与多目标扩展1.基础鱼鹰优化算法（OOA）的生物启发机制OOA模拟鱼鹰捕鱼的两阶段行为：探索阶段（定位与捕鱼）：鱼鹰随机探测鱼群位置并俯冲攻击，对应全局搜索。位置更新公式为：xi,jnew=xi,j+rand×(SFi,j−I×xi,j)x_{i,j}^{new}=x_{i,j}+\text{rand}\times(SF_{i,j}-I\timesx_{i,j})xi,
5、探讨计算、通信与控制领域的最新进展 Aurora曙光探索计算通信与控制的前沿进展计算技术通信技术控制技术
探讨计算、通信与控制领域的最新进展1引言在当今快速发展的科技领域，计算、通信和控制技术的融合已经成为了推动社会进步的重要力量。从智能家居到自动驾驶汽车，从工业自动化到智慧城市，这些技术不仅改变了我们的生活方式，也在不断塑造着未来的社会形态。本文将深入探讨计算、通信和控制领域的最新进展，重点介绍Cohen-Sutherland线段裁剪算法的改进、视图选择优化算法以及智能代理架构在医疗诊断系统中的应用
cartographer官方指导文件说明---第3章 cartographer前端算法流程介绍从小练武功前端算法
cartographer官方指导文件说明第3章cartographer前端算法流程介绍3.1ScanMatch扫描匹配扫描匹配（ScanMatching）是Cartographer中实现局部SLAM的核心技术，它通过优化算法将当前激光扫描数据对齐到子图地图中。下面从计算过程、数学模型、参数配置等多个维度进行全面解析：3.1.1扫描匹配工作流程完整处理流程低置信度高置信度原始扫描数据运动畸变校正体素
MATLAB 优化类算法的改进方向探索及仿真对比分析鱼弦人工智能时代算法 matlab 人工智能
MATLAB优化类算法的改进方向探索及仿真对比分析一、概述优化算法是解决复杂问题的有效工具，在工程设计、机器学习、数据分析等领域有着广泛应用。本文将探讨MATLAB中优化类算法的改进方向，并进行仿真对比分析，包括遗传算法、粒子群算法、模拟退火算法等。二、优化算法简介1.遗传算法(GA)原理:模拟生物进化过程，通过选择、交叉、变异等操作寻找最优解。优点:全局搜索能力强:能够跳出局部最优解。并行计算能
程序员转向人工智能 CoderIsArt 机器学习与深度学习人工智能
以下是针对程序员转向人工智能（AI）领域的学习路线建议，分为基础、核心技术和进阶方向，结合你的编程背景进行优化：1.夯实基础数学基础（选择性补足，边学边用）线性代数：矩阵运算、特征值、张量（深度学习基础）概率与统计：贝叶斯定理、分布、假设检验微积分：梯度、导数（优化算法核心）优化算法：梯度下降、随机梯度下降（SGD）学习资源：3Blue1Brown（视频）、《程序员的数学》系列编程工具Python
【Python打卡Day12】启发式算法 @浙大疏锦行可能是猫猫人 Python打卡训练营内容启发式算法算法
今天学习遗传算法，在以后的论文写作中可以水一节，胆子大的人才可以水一章这些算法仅作为你的了解，不需要开始学习，如果以后需要在论文中用到，在针对性的了解下处理逻辑。下面介绍这几种常见的优化算法遗传算法粒子群优化模拟退火##1.数据处理+划分训练和测试importpandasaspdimportpandasaspd#用于数据处理和分析，可处理表格数据。importnumpyasnp#用于数值计算，提供
MATLAB 中常用的微分函数介绍士兵突击许三多 matlab基础 matlab
MATLAB中常用的微分函数介绍在MATLAB中，微分运算是数值计算和符号计算中常用的功能。无论是在进行数据分析、优化算法，还是数学建模时，微分都扮演着重要的角色。本文将介绍MATLAB中常用的微分函数，并通过简单的示例帮助大家理解如何在实际应用中使用这些函数。引言微分是数学中重要的运算之一，广泛应用于物理学、工程学、经济学等领域。在MATLAB中，微分函数可以帮助我们对数据进行分析，提取变化趋势
pytorch——自动微分
求导是几乎所有深度学习优化算法的关键步骤。深度学习框架通过自动计算导数，即自动微分来加快求导。标量变量的反向传播对函数y=2xTxy=2x^Txy=2xTx关于列向量xxx求导importtorchx=torch.arange(4.0)print(f'x:{x}')x.requires_grad_(True)print(f'x.grad:{x.grad}')y=2*torch.dot(x,x)y.
Pytorch框架——自动微分和反向传播 Xyz_Overlord pytorch 人工智能 python
一、自动微分概念自动微分（AutomaticDifferentiation，AD）是一种利用计算机程序自动计算函数导数的技术，它是机器学习和优化算法中的核心工具（如神经网络的梯度下降），通过反向传播计算并更新梯度。计算梯度的目的是更新权重w和b，，其中value是梯度值，学习率需要提前指定，求导计算梯度，前面我们学过了手动求导，这次使用自动微分的方法，来简化我们的工作量。注意：1.w和b一定是可自
海马优化算法优化支持向量回归（SVR）模型项目神经网络15044 仿真模型 python 算法算法回归数据挖掘
海马优化算法优化支持向量回归（SVR）模型项目一、项目概述本项目将实现海马优化算法（SeahorseOptimizationAlgorithm,SOA）优化支持向量回归（SVR）模型的全过程。海马优化算法是一种新型元启发式算法，模拟海马的智能行为（包括移动、捕食和繁殖），能有效解决复杂优化问题。SVR作为强大的回归模型，其性能高度依赖参数选择（C、ε、γ）。本项目将结合SOA和SVR，在Pytho
Spring中@Value注解，需要注意的地方无量 spring bean @Value xml
Spring 3以后,支持@Value注解的方式获取properties文件中的配置值，简化了读取配置文件的复杂操作 1、在applicationContext.xml文件(或引用文件中)中配置properties文件 <bean id="appProperty" class="org.springframework.beans.fac
mongoDB 分片开窍的石头 mongodb
mongoDB的分片。要mongos查询数据时候先查询configsvr看数据在那台shard上，configsvr上边放的是metar信息，指的是那条数据在那个片上。由此可以看出mongo在做分片的时候咱们至少要有一个configsvr,和两个以上的shard（片）信息。第一步启动两台以上的mongo服务 &nb
OVER(PARTITION BY)函数用法 0624chenhong oracle
这篇写得很好，引自 http://www.cnblogs.com/lanzi/archive/2010/10/26/1861338.html OVER(PARTITION BY)函数用法 2010年10月26日 OVER(PARTITION BY)函数介绍开窗函数 &nb
Android开发中，ADB server didn't ACK 解决方法一炮送你回车库 Android开发
首先通知：凡是安装360、豌豆荚、腾讯管家的全部卸载，然后再尝试。一直没搞明白这个问题咋出现的，但今天看到一个方法，搞定了！原来是豌豆荚占用了 5037 端口导致。参见原文章：一个豌豆荚引发的血案——关于ADB server didn't ACK的问题简单来讲，首先将Windows任务进程中的豌豆荚干掉，如果还是不行，再继续按下列步骤排查。 &nb
canvas中的像素绘制问题换个号韩国红果果 JavaScript canvas
pixl的绘制，1.如果绘制点正处于相邻像素交叉线，绘制x像素的线宽，则从交叉线分别向前向后绘制x/2个像素，如果x/2是整数，则刚好填满x个像素，如果是小数，则先把整数格填满，再去绘制剩下的小数部分，绘制时，是将小数部分的颜色用来除以一个像素的宽度，颜色会变淡。所以要用整数坐标来画的话（即绘制点正处于相邻像素交叉线时），线宽必须是2的整数倍。否则会出现不饱满的像素。 2.如果绘制点为一个像素的
编码乱码问题灵静志远 java jvm jsp 编码
1、JVM中单个字符占用的字节长度跟编码方式有关，而默认编码方式又跟平台是一一对应的或说平台决定了默认字符编码方式；2、对于单个字符：ISO-8859-1单字节编码，GBK双字节编码，UTF-8三字节编码；因此中文平台(中文平台默认字符集编码GBK)下一个中文字符占2个字节，而英文平台(英文平台默认字符集编码Cp1252(类似于ISO-8859-1))。 3、getBytes()、getByte
java 求几个月后的日期 darkranger calendar getinstance
Date plandate = planDate.toDate(); SimpleDateFormat df = new SimpleDateFormat("yyyy-MM-dd"); Calendar cal = Calendar.getInstance(); cal.setTime(plandate); // 取得三个月后时间 cal.add(Calendar.M
数据库设计的三大范式（通俗易懂） aijuans 数据库复习
关系数据库中的关系必须满足一定的要求。满足不同程度要求的为不同范式。数据库的设计范式是数据库设计所需要满足的规范。只有理解数据库的设计范式，才能设计出高效率、优雅的数据库，否则可能会设计出错误的数据库. 目前，主要有六种范式：第一范式、第二范式、第三范式、BC范式、第四范式和第五范式。满足最低要求的叫第一范式，简称1NF。在第一范式基础上进一步满足一些要求的为第二范式，简称2NF。其余依此类推。
想学工作流怎么入手 atongyeye jbpm
工作流在工作中变得越来越重要，很多朋友想学工作流却不知如何入手。很多朋友习惯性的这看一点，那了解一点，既不系统，也容易半途而废。好比学武功，最好的办法是有一本武功秘籍。研究明白，则犹如打通任督二脉。系统学习工作流，很重要的一本书《JBPM工作流开发指南》。本人苦苦学习两个月，基本上可以解决大部分流程问题。整理一下学习思路，有兴趣的朋友可以参考下。 1 首先要
Context和SQLiteOpenHelper创建数据库百合不是茶 android Context创建数据库
一直以为安卓数据库的创建就是使用SQLiteOpenHelper创建,但是最近在android的一本书上看到了Context也可以创建数据库,下面我们一起分析这两种方式创建数据库的方式和区别,重点在SQLiteOpenHelper 一:SQLiteOpenHelper创建数据库: 1,SQLi
浅谈group by和distinct bijian1013 oracle 数据库 group by distinct
group by和distinct只了去重意义一样，但是group by应用范围更广泛些，如分组汇总或者从聚合函数里筛选数据等。譬如：统计每id数并且只显示数大于3 select id ,count(id) from ta
vi opertion 征客丶 mac opration vi
进入 command mode （命令行模式）按 esc 键再按 shift + 冒号注：以下命令中带 $ 【在命令行模式下进行】，不带 $ 【在非命令行模式下进行】一、文件操作 1.1、强制退出不保存 $ q! 1.2、保存 $ w 1.3、保存并退出 $ wq 1.4、刷新或重新加载已打开的文件 $ e 二、光标移动 2.1、跳到指定行数字
【Spark十四】深入Spark RDD第三部分RDD基本API bit1129 spark
对于K/V类型的RDD,如下操作是什么含义？ val rdd = sc.parallelize(List(("A",3),("C",6),("A",1),("B",5)) rdd.reduceByKey(_+_).collect reduceByKey在这里的操作，是把
java类加载机制 BlueSkator java 虚拟机
java类加载机制 1.java类加载器的树状结构引导类加载器 ^ | 扩展类加载器 ^ | 系统类加载器 java使用代理模式来完成类加载，java的类加载器也有类似于继承的关系，引导类是最顶层的加载器，它是所有类的根加载器，它负责加载java核心库。当一个类加载器接到装载类到虚拟机的请求时，通常会代理给父类加载器，若已经是根加载器了，就自己完成加载。虚拟机区分一个Cla
动态添加文本框 BreakingBad 文本框
<script> var num=1; function AddInput() { var str=""; str+="<input
读《研磨设计模式》-代码笔记-单例模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ public class Singleton { } /* * 懒汉模式。注意，getInstance如果在多线程环境中调用，需要加上synchronized，否则存在线程不安全问题 */ class LazySingleton
iOS应用打包发布常见问题 chenhbc ios iOS发布 iOS上传 iOS打包
这个月公司安排我一个人做iOS客户端开发，由于急着用，我先发布一个版本，由于第一次发布iOS应用，期间出了不少问题，记录于此。 1、使用Application Loader 发布时报错：Communication error.please use diagnostic mode to check connectivity.you need to have outbound acc
工作流复杂拓扑结构处理新思路 comsci 设计模式工作算法企业应用 OO
我们走的设计路线和国外的产品不太一样，不一样在哪里呢？国外的流程的设计思路是通过事先定义一整套规则(类似XPDL)来约束和控制流程图的复杂度(我对国外的产品了解不够多，仅仅是在有限的了解程度上面提出这样的看法)，从而避免在流程引擎中处理这些复杂的图的问题，而我们却没有通过事先定义这样的复杂的规则来约束和降低用户自定义流程图的灵活性，这样一来，在引擎和流程流转控制这一个层面就会遇到很
oracle 11g新特性Flashback data archive daizj oracle
1. 什么是flashback data archive Flashback data archive是oracle 11g中引入的一个新特性。Flashback archive是一个新的数据库对象，用于存储一个或多表的历史数据。Flashback archive是一个逻辑对象，概念上类似于表空间。实际上flashback archive可以看作是存储一个或多个表的所有事务变化的逻辑空间。
多叉树:2-3-4树 dieslrae 树
平衡树多叉树,每个节点最多有4个子节点和3个数据项,2,3,4的含义是指一个节点可能含有的子节点的个数,效率比红黑树稍差.一般不允许出现重复关键字值.2-3-4树有以下特征: 1、有一个数据项的节点总是有2个子节点(称为2-节点) 2、有两个数据项的节点总是有3个子节点(称为3-节
C语言学习七动态分配 malloc的使用 dcj3sjt126com c language malloc
/* 2013年3月15日15:16:24 malloc 就memory(内存) allocate(分配)的缩写本程序没有实际含义，只是理解使用 */ # include <stdio.h> # include <malloc.h> int main(void) { int i = 5; //分配了4个字节静态分配 int * p
Objective-C编码规范[译] dcj3sjt126com 代码规范
原文链接 : The official raywenderlich.com Objective-C style guide 原文作者 : raywenderlich.com Team 译文出自 : raywenderlich.com Objective-C编码规范译者 : Sam Lau
0.性能优化-目录 frank1234 性能优化
从今天开始笔者陆续发表一些性能测试相关的文章，主要是对自己前段时间学习的总结，由于水平有限，性能测试领域很深，本人理解的也比较浅，欢迎各位大咖批评指正。主要内容包括：一、性能测试指标吞吐量、TPS、响应时间、负载、可扩展性、PV、思考时间 http://frank1234.iteye.com/blog/2180305 二、性能测试策略生产环境相同基准测试预热等 htt
Java父类取得子类传递的泛型参数Class类型 happyqing java 泛型父类子类 Class
import java.lang.reflect.ParameterizedType; import java.lang.reflect.Type; import org.junit.Test; abstract class BaseDao<T> { public void getType() { //Class<E> clazz =
跟我学SpringMVC目录汇总贴、PDF下载、源码下载 jinnianshilongnian springMVC
----广告-------------------------------------------------------------- 网站核心商详页开发掌握Java技术，掌握并发/异步工具使用，熟悉spring、ibatis框架；掌握数据库技术，表设计和索引优化，分库分表/读写分离；了解缓存技术，熟练使用如Redis/Memcached等主流技术；了解Ngin
the HTTP rewrite module requires the PCRE library 流浪鱼 rewrite
./configure: error: the HTTP rewrite module requires the PCRE library. 模块依赖性Nginx需要依赖下面3个包 1. gzip 模块需要 zlib 库 ( 下载: http://www.zlib.net/ ) 2. rewrite 模块需要 pcre 库 ( 下载: http://www.pcre.org/ ) 3. s
第12章 Ajax（中） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Optimize query with Query Stripping in Web Intelligence blueoxygen BO
http://wiki.sdn.sap.com/wiki/display/BOBJ/Optimize+query+with+Query+Stripping+in+Web+Intelligence and a very straightfoward video http://www.sdn.sap.com/irj/scn/events?rid=/library/uuid/40ec3a0c-936
Java开发者写SQL时常犯的10个错误 tomcat_oracle java sql
1、不用PreparedStatements 　　有意思的是，在JDBC出现了许多年后的今天，这个错误依然出现在博客、论坛和邮件列表中，即便要记住和理解它是一件很简单的事。开发者不使用PreparedStatements的原因可能有如下几个：　　他们对PreparedStatements不了解　　他们认为使用PreparedStatements太慢了　　他们认为写Prepar
世纪互联与结盟有感阿尔萨斯
10月10日，世纪互联与（Foxcon）签约成立合资公司，有感。全球电子制造业巨头（全球500强企业）与世纪互联共同看好IDC、云计算等业务在中国的增长空间，双方迅速果断出手，在资本层面上达成合作，此举体现了全球电子制造业巨头对世纪互联IDC业务的欣赏与信任，另一方面反映出世纪互联目前良好的运营状况与广阔的发展前景。众所周知，精于电子产品制造（世界第一），对于世纪互联而言，能够与结盟

测度论（Measurement theory）