cumi6497

源数据和数据源_这些是任何人都可以使用的最佳免费开放数据源

源数据和数据源

by Hiren Patel

希伦·帕特尔(Hiren Patel)

什么是开放数据？ (What is Open Data?)

In simple terms, Open Data means the kind of data which is open for anyone and everyone for access, modification, reuse, and sharing.

简而言之，“ 开放数据”是指对任何人和所有人开放以供访问，修改，重用和共享的数据类型。

Open Data derives its base from various “open movements” such as open source, open hardware, open government, open science etc.

开放数据源于各种“开放运动”，例如开放源代码，开放硬件，开放政府，开放科学等。

Governments, independent organizations, and agencies have come forward to open the floodgates of data to create more and more open data for free and easy access.

各国政府，独立组织和机构已经挺身而出，打开数据的闸门，以创建越来越多的开放数据，以供免费和轻松访问。

为什么开放数据很重要？ (Why Is Open Data Important?)

Open data is important because the world has grown increasingly data-driven. But if there are restrictions on the access and use of data, the idea of data-driven business and governance will not be materialized.

开放数据非常重要，因为世界越来越以数据为驱动力。但是，如果对数据的访问和使用有限制，那么数据驱动型业务和治理的想法将无法实现。

Therefore, open data has its own unique place. It can allow a fuller understanding of the global problems and universal issues. It can give a big boost to businesses. It can be a great impetus for machine learning. It can help fight global problems such as disease or crime or famine. Open data can empower citizens and hence can strengthen democracy. It can streamline the processes and systems that the society and governments have built. It can help transform the way we understand and engage with the world.

因此，开放数据有其独特的位置。它可以使人们对全球问题和普遍问题有更全面的了解。它可以极大地促进企业发展。这可能是机器学习的强大动力。它可以帮助解决疾病，犯罪或饥荒等全球性问题。开放数据可以增强公民权能，因此可以加强民主。它可以简化社会和政府建立的流程和系统。它可以帮助改变我们理解和与世界互动的方式。

So here’s my list of 15 awesome Open Data sources:

因此，这是我列出的15个很棒的开放数据源的清单：

1. 世界银行公开数据 (1. World Bank Open Data)

As a repository of the world’s most comprehensive data regarding what’s happening in different countries across the world, World Bank Open Data is a vital source of Open Data. It also provides access to other datasets as well which are mentioned in the data catalog.

作为有关世界不同国家正在发生的事情的全球最全面数据的存储库，世界银行开放数据是开放数据的重要来源。它还提供对数据目录中提到的其他数据集的访问。

World Bank Open Data is massive because it has got 3000 datasets and 14000 indicators encompassing microdata, time series statistics, and geospatial data.

世界银行开放数据之所以庞大，是因为它拥有3000个数据集和14000个指标，其中包括微数据，时间序列统计信息和地理空间数据。

Accessing and discovering the data you want is also quite easy. All you need to do is to specify the indicator names, countries or topics and it will open up the treasure-house of Open Data for you. It also allows you to download data in different formats such as CSV, Excel, and XML.

访问和发现所需的数据也非常容易。您所需要做的就是指定指标名称，国家或主题，这将为您打开开放数据的宝库。它还允许您下载不同格式的数据，例如CSV，Excel和XML。

If you are a journalist or academic, you will be enthralled by the array of tools available to you. You can get access to analysis and visualization tools that can bolster your research. It can felicitate a deeper and better understanding of global problems.

如果您是新闻工作者或学术界人士，那么您将被一系列可用的工具所吸引。您可以访问可以增强您的研究的分析和可视化工具。它可以促进对全球问题的更深入和更好的理解。

You can get access to the API which can help you create the data visualizations you need, live combinations with other data sources and many more such features.

您可以访问API，该API可以帮助您创建所需的数据可视化，与其他数据源的实时组合以及更多此类功能。

Therefore, it’s no surprise that World Bank Open Data tops any list of Open Data sources!

因此，世界银行开放数据在开放数据源的任何列表中居于首位也就不足为奇了！

2. 世卫组织(世界卫生组织)—开放数据仓库 (2. WHO (World Health Organization) — Open data repository)

WHO’s Open Data repository is how WHO keeps track of health-specific statistics of its 194 Member States.

世卫组织的开放数据存储库是世卫组织跟踪其194个会员国特定于健康的统计数据的方式。

The repository keeps the data systematically organized. It can be accessed as per different needs. For instance, whether it is mortality or burden of diseases, one can access data classified under 100 or more categories such as the Millennium Development Goals (child nutrition, child health, maternal and reproductive health, immunization, HIV/AIDS, tuberculosis, malaria, neglected diseases, water and sanitation), non communicable diseases and risk factors, epidemic-prone diseases, health systems, environmental health, violence and injuries, equity etc.

该存储库可以系统地组织数据。可以根据不同需求进行访问。例如，无论是死亡还是疾病负担，人们都可以访问100类或更多类别的数据，例如千年发展目标(儿童营养，儿童健康，孕产妇和生殖健康，免疫，艾滋病毒/艾滋病，结核病，疟疾，被忽视的疾病，水和卫生设施)，非传染性疾病和危险因素，易流行的疾病，卫生系统，环境健康，暴力和伤害，公平等。

For your specific needs, you can go through the datasets according to themes, category, indicator, and country.

根据您的特定需求，您可以根据主题，类别，指标和国家/地区浏览数据集。

The good thing is that it is possible to download whatever data you need in Excel Format. You can also monitor and analyze data by making use of its data portal.

好处是可以以Excel格式下载所需的任何数据。您还可以通过其数据门户监视和分析数据。

The API to the World Health Organization’s data and statistics content is also available.

也可以使用世界卫生组织的数据和统计内容的API。

3. Google Public Data Explorer (3. Google Public Data Explorer)

Launched in 2010, Google Public Data Explorer can help you explore vast amounts of public-interest datasets. You can visualize and communicate the data for your respective uses.

Google公共数据资源管理器于2010年启动，可帮助您探索大量的公共利益数据集。您可以可视化并交流数据以供各自使用。

It makes the data from different agencies and sources available. For instance, you can access data from World Bank, U. S. Bureau of Labor Statistics and U.S. Bureau, OECD, IMF, and others.

它使来自不同机构和来源的数据可用。例如，您可以访问来自世界银行，美国劳工统计局和美国局，经合组织，国际货币基金组织等的数据。

Different stakeholders access this data for a variety of purposes. Whether you are a student or a journalist, whether you are a policy maker or an academic, you can leverage this tool in order to create visualizations of public data.

不同的利益相关者出于各种目的访问此数据。无论您是学生还是新闻工作者，无论您是决策者还是学者，都可以利用此工具来创建公共数据的可视化。

You can deploy various ways of representing the data such as line graphs, bar graphs, maps and bubble charts with the help of Data Explorer.

您可以借助数据资源管理器部署各种表示数据的方式，例如折线图，条形图，地图和气泡图。

The best part is that you would find these visualizations quite dynamic. It means that you will see them change over time. You can change topics, focus on different entries and modify the scale.

最好的部分是您会发现这些可视化非常动态。这意味着您将看到它们随时间变化。您可以更改主题，关注不同的条目并修改比例。

It is easily shareable too. As soon as you get the chart ready, you can embed it on your website or blog or simply share a link with your friends.

它也很容易共享。一旦您准备好图表，就可以将其嵌入到您的网站或博客中，或者简单地与您的朋友共享链接。

4. 在AWS(RODA)上注册开放数据 (4. Registry of Open Data on AWS (RODA))

This is a repository containing public datasets. It is data which is available from AWS resources.

这是一个包含公共数据集的存储库。它是可从AWS资源中获得的数据。

As far as RODA is concerned, you can discover and share the data which is publicly available.

就RODA而言，您可以发现和共享公开可用的数据。

In RODA, you can use keywords and tags for common types of data such as genomic, satellite imagery and transportation in order to search whatever data that you are looking for. All of this is possible on a simple web interface.

在RODA中，可以将关键字和标签用于常见的数据类型，例如基因组，卫星图像和运输，以搜索所需的数据。所有这些都可以在简单的Web界面上实现。

For every dataset, you will discover detail page, usage examples, license information and tutorials or applications that use this data.

对于每个数据集，您将发现详细信息页面，用法示例，许可信息以及使用此数据的教程或应用程序。

By making use of a broad range of compute and data analytics products, you can analyze the open data and build whatever services you want.

通过使用各种计算和数据分析产品，您可以分析开放数据并构建所需的任何服务。

While the data you access is available through AWS resources, you need to bear in mind that it is not provided by AWS. This data belongs to different agencies, government organizations, researchers, businesses and individuals.

尽管您可以通过AWS资源访问您访问的数据，但请记住，它不是由AWS提供的。此数据属于不同的机构，政府组织，研究人员，企业和个人。

5. 欧盟开放数据门户 (5. European Union Open Data Portal)

You can access whatever open data EU institutions, agencies and other organizations publish on a single platform namely European Union Open Data Portal.

您可以访问欧盟机构，机构和其他组织在单一平台(即欧盟开放数据门户)上发布的所有开放数据。

The EU Open Data Portal is home to vital open data pertaining to EU policy domains. These policy domains include economy, employment, science, environment, and education.

欧盟开放数据门户网站是与欧盟政策领域相关的重要开放数据的所在地。这些政策领域包括经济，就业，科学，环境和教育。

Around 70 EU institutions, organizations or departments such as Eurostat, the European Environment Agency, the Joint Research Centre and other European Commission Directorates General and EU Agencies have made their datasets public and allowed access. These datasets have crossed the number of 11700 till date.

大约70个欧盟机构，组织或部门，例如欧盟统计局(Eurostat)，欧洲环境署，联合研究中心以及其他欧盟委员会总局和欧盟机构已将其数据集公开并允许访问。迄今为止，这些数据集的数量已超过11700。

The portal enables easy access. You can easily search, explore, link, download and reuse the data through a catalog of common metadata. You can do so for your specific purposes. It could be commercial or non-commercial purposes.

门户使访问变得容易。您可以通过常见的元数据目录轻松地搜索，浏览，链接，下载和重用数据。您可以根据自己的特定目的进行操作。它可以是商业目的，也可以是非商业目的。

You can search the metadata catalog through an interactive search engine (Data tab) and SPARQL queries (Linked data tab).

您可以通过交互式搜索引擎(“数据”选项卡)和SPARQL查询(“链接的数据”选项卡)搜索元数据目录。

By making use of this catalog, you can gain access to the data stored on the different websites of the EU institutions, agencies and organizations.

通过使用此目录，您可以访问存储在欧盟机构，机构和组织的不同网站上的数据。

6. 五十八 (6. FiveThirtyEight)

It is a great site for data-driven journalism and story-telling.

这是一个以数据为驱动的新闻和故事讲述的好网站。

It provides its various sources of data for a variety of sectors such as politics, sports, science, economics etc. You can download the data as well.

它为政治，体育，科学，经济学等各个领域提供各种数据源。您也可以下载数据。

When you access the data, you will come across a brief explanation regarding each dataset with respect to its source. You will also get to know what it stands for and how to use it.

访问数据时，您会遇到关于每个数据集及其来源的简短说明。您还将了解它代表什么以及如何使用它。

In order to render this data user-friendly, it provides datasets in as simple, non-proprietary formats such as CSV files as possible. Needless to say, these formats can be easily accessed and processed by humans as well as machines.

为了使此数据易于使用，它以尽可能简单，非专有的格式(例如CSV文件)提供数据集。不用说，人类和机器都可以轻松访问和处理这些格式。

With the help of these datasets, you can create stories and visualizations as per your own requirements and preference.

借助这些数据集，您可以根据自己的要求和偏好创建故事和可视化文件。

7. 美国人口普查局 (7. U.S. Census Bureau)

U.S. Census Bureau is the biggest statistical agency of the federal government. It stores and provides reliable facts and data regarding people, places, and economy of America.

美国人口普查局是联邦政府最大的统计机构。它存储并提供有关美国人，地方和经济的可靠事实和数据。

The Census Bureau considers its noble mission to extend its services as the most reliable provider of quality data.

人口普查局认为其扩展服务的崇高使命是最可靠的质量数据提供者。

Whether it is a federal, state, local or tribal government, all of them make use of census data for a variety of purposes. These governments use this data to determine the location of new housing and public facilities. They also make use of it at the time of examining the demographic characteristics of communities, states, and the USA.

无论是联邦政府，州政府，地方政府还是部落政府，他们都出于各种目的使用普查数据。这些政府使用这些数据来确定新房屋和公共设施的位置。他们在检查社区，州和美国的人口统计学特征时也会使用它。

This data is also made use of in planning of transportation systems and roadways. When it comes to deciding quotas and creating police and fire precincts, this data comes in handy. When governments create localized areas of elections, schools, utilities etc, they make use of this data. It is a practice to compile population information once a decade and this data are quite useful in accomplishing the same.

此数据也用于运输系统和道路的规划中。在确定配额以及创建警察和消防区时，此数据非常有用。当政府创建选举，学校，公用事业等的本地化区域时，它们将使用此数据。十年一次汇编人口信息是一种惯例，这些数据对于完成人口信息非常有用。

There are various tools such as American Fact Finder, Census Data Explorer and Quick Facts which are useful in case you want to search, customize and visualize data.

有多种工具，例如American Fact Finder，Census Data Explorer和Quick Facts，在您想要搜索，自定义和可视化数据时非常有用。

For instance, Quick Facts alone contains statistics for all the states, counties, cities and even towns with a population of 5000 or more.

例如，仅《事实》便包含所有州，县，城市甚至人口超过5000的城镇的统计信息。

Likewise, American Fact Finder can help you discover popular facts such as population, income etc. It provides information that is frequently requested.

同样，American Fact Finder可以帮助您发现流行的事实，例如人口，收入等。它提供了经常需要的信息。

The good thing is that you can search, interact with the data, get to know about popular statistics and see the related charts through Census Data Explorer. Moreover, you can also use visual tool to customize data on an interactive maps experience.

好处是，您可以通过Census Data Explorer搜索，与数据进行交互，了解流行的统计信息并查看相关的图表。此外，您还可以使用可视化工具来自定义交互式地图体验中的数据。

8. Data.gov (8. Data.gov)

Data.gov is the treasure-house of US government’s open data. It was only recently that the decision was made to make all government data available for free.

Data.gov是美国政府开放数据的宝库。直到最近才决定免费提供所有政府数据。

When it was launched, there were only 47. There are now 180,000 datasets.

当它启动时，只有47个。现在有180,000个数据集。

Why Data.gov is a great resource is because you can find data, tools, and resources that you can deploy for a variety of purposes. You can conduct your research, develop your web and mobile applications and even design data visualizations.

之所以将Data.gov用作强大的资源，是因为您可以找到可以部署用于各种目的的数据，工具和资源。您可以进行研究，开发Web和移动应用程序，甚至设计数据可视化。

All you need to do is enter keywords in the search box and browse through types, tags, formats, groups, organization types, organizations, and categories. This will facilitate easy access to data or datasets that you need.

您需要做的就是在搜索框中输入关键字，然后浏览类型，标签，格式，组，组织类型，组织和类别。这将有助于轻松访问所需的数据或数据集。

Data.gov follows the Project Open Data Schema — a set of requisite fields (Title, Description, Tags, Last Update, Publisher, Contact Name, etc.) for every data set displayed on Data.gov.

Data.gov遵循项目开放数据架构— Data.gov上显示的每个数据集的一组必填字段(标题，描述，标签，最新更新，发布者，联系人姓名等)。

9. DBpedia (9. DBpedia)

As you know, Wikipedia is a great source of information. DBpedia aims at getting structured content from the valuable information that Wikipedia created.

如您所知，维基百科是一个很好的信息来源。 DBpedia旨在从Wikipedia创建的有价值的信息中获取结构化内容。

With DBpedia, you can semantically search and explore relationships and properties of Wikipedia resource. This includes links to other related datasets as well.

使用DBpedia，您可以在语义上搜索和探索Wikipedia资源的关系和属性。这也包括到其他相关数据集的链接。

There are around 4.58 million entities in the DBpedia dataset. 4.22 million are classified in ontology, including 1,445,000 persons, 735,000 places, 123,000 music albums, 87,000 films, 19,000 video games, 241,000 organizations, 251,000 species and 6,000 diseases.

DBpedia数据集中大约有458万个实体。本体中有422万种，包括1,445,000人，735,000个位置，123,000个音乐专辑，87,000个电影，19,000个视频游戏，241,000个组织，251,000种和6,000种疾病。

There are labels and abstracts for these entities in around 125 languages. There are 25.2 million links to images. There are 29.8 million links to external web pages.

这些实体有大约125种语言的标签和摘要。有2520万个图像链接。有2980万个指向外部网页的链接。

All you need to do in order to use DBpedia is write SPARQL queries against endpoint or by downloading their dumps.

要使用DBpedia，您需要做的就是针对端点编写SPARQL查询或通过下载其转储。

DBpedia has benefitted several enterprises, such as Apple (via Siri), Google (via Freebase and Google Knowledge Graph), and IBM (via Watson), and particularly their respective prestigious projects associated with artificial intelligence.

DBpedia使数家企业受益，例如Apple(通过Siri)，Google(通过Freebase和Google Knowledge Graph)和IBM(通过Watson)，特别是与人工智能相关的著名项目。

10. freeCodeCamp打开数据 (10. freeCodeCamp Open Data)

It is an open source community. Why it matters is because it enables you to code, build pro bono projects after nonprofits and grab a job as a developer.

这是一个开源社区。之所以如此重要，是因为它使您能够编码，在非营利组织之后建立公益项目并获得开发人员的职位。

In order to make this happen, the freeCodeCamp.org community makes available enormous amounts of data every month. They have turned it into open data.

为了实现这一目标，freeCodeCamp.org社区每月都会提供大量数据。他们已将其转换为开放数据。

You will find a variety of things in this repository. You can find datasets, analysis of the same and even demos of projects based on the freeCodeCamp data. You can also find links to external projects involving the freeCodeCamp data.

您将在此存储库中找到各种东西。您可以基于freeCodeCamp数据查找数据集，对项目的相同甚至演示进行分析。您还可以找到涉及freeCodeCamp数据的外部项目的链接。

It can help you with a diversity of projects and tasks that you may have in mind. Whether it is web analytics, social media analytics, social network analysis, education analysis, data visualization, data-driven web development or bots, the data offered by this community can extremely useful and effective.

它可以帮助您解决各种项目和任务。无论是Web分析，社交媒体分析，社交网络分析，教育分析，数据可视化，数据驱动的Web开发还是漫游器，此社区提供的数据都非常有用和有效。

11. Yelp开放数据集 (11. Yelp Open Datasets)

The Yelp dataset is basically a subset of nothing but our own businesses, reviews and user data for use in personal, educational and academic pursuits.

Yelp数据集基本上只是我们自己的业务，评论和用户数据的一个子集，用于个人，教育和学术追求。

There are 5,996,996 reviews, 188,593 businesses, 280,991 pictures and 10 metropolitan areas included in Yelp Open Datasets.

Yelp开放数据集包含5,996,996条点评，188,593家企业，280,991张图片和10个大城市区域。

You can use them for different purposes. Since they are available as JSON files, you can use them in order to teach students about databases. You can use them to learn NLP or for sample production data while you understand how to design mobile apps.

您可以将它们用于不同的目的。由于它们以JSON文件形式提供，因此您可以使用它们来向学生传授有关数据库的知识。在了解如何设计移动应用程序的同时，您可以使用它们来学习NLP或获取示例生产数据。

In this dataset, you will find each file composed of a single object type, one JSON-object per-line.

在此数据集中，您将找到每个由单一对象类型组成的文件，每行一个JSON对象。

12. 联合国儿童基金会数据集 (12. UNICEF Dataset)

Since UNICEF concerns itself with a wide variety of critical issues, it has compiled relevant data on education, child labor, child disability, child mortality, maternal mortality, water and sanitation, low birth-weight, antenatal care, pneumonia, malaria, iodine deficiency disorder, female genital mutilation/cutting, and adolescents.

由于儿童基金会关注各种各样的关键问题，因此它收集了有关教育，童工，儿童残疾，儿童死亡率，孕产妇死亡率，水和卫生，低出生体重，产前保健，肺炎，疟疾，碘缺乏症的相关数据。疾病，女性生殖器残割/切割以及青少年。

UNICEF’s open datasets published on the IATI Registry: http://www.iatiregistry.org/publisher/unicef has been extracted directly from UNICEF’s operating system (VISION) and other data systems, and it reflects inputs made by individual UNICEF offices.

联合国儿童基金会在IATI注册中心( http://www.iatiregistry.org/publisher/unicef)上公开的数据集是直接从联合国儿童基金会的操作系统(VISION)和其他数据系统中提取的，反映了联合国儿童基金会各个办事处的投入。

The good thing is that there is a regular update when it comes to these datasets. Every month, the data is updated in order to make it more comprehensive, reliable and accurate.

好消息是这些数据集会定期更新。每个月，数据都会更新一次，以使其更加全面，可靠和准确。

You can freely and easily access this data. In order to do so, you can download this data in CSV format. You can also preview sample data prior to downloading it.

您可以自由，轻松地访问此数据。为此，您可以CSV格式下载此数据。您还可以在下载样本数据之前对其进行预览。

While anybody can explore and visualize UNICEF’s datasets, there are three principal publishers:

尽管任何人都可以浏览和可视化联合国儿童基金会的数据集，但主要的发布者有以下三个：

UNICEF’s AID TRANSPARENCY PORTAL : You can far more easily access the datasets if you use this portal. It also includes details for each country that UNICEF works in.

联合国儿童基金会的援助透明门户：如果您使用此门户，则可以更加轻松地访问数据集。它还包括儿童基金会工作所在的每个国家的详细信息。

Publisher d-portal : It is, at the moment, in BETA. With this, portal, you can explore IATI data.

Publisher d-portal ：目前在BETA中。有了这个门户，您可以浏览IATI数据。

You can search the information related to development activities, budgets etc. You can explore this information country-wise.

您可以搜索与开发活动，预算等有关的信息。可以在全国范围内探索该信息。

Publisher’s data platform : On this platform, you can easily access statistics, charts, and metrics on data accessed via the IATI Registry. If you click on the headers, you can also sort many of the tables that you see on the platform. You will also find many of the datasets in the platforms in machine-readable JSON format.

发布者的数据平台：在此平台上，您可以轻松访问通过IATI注册中心访问的数据的统计信息，图表和度量。如果单击标题，还可以对平台上看到的许多表进行排序。您还将在平台中找到许多机器可读的JSON格式的数据集。

13. Kaggle (13. Kaggle)

Kaggle is great because it promotes the use of different dataset publication formats. However, the better part is that it strongly recommends that the dataset publishers share their data in an accessible, non-proprietary format.

Kaggle很棒，因为它促进了不同数据集发布格式的使用。但是，更好的是，它强烈建议数据集发布者以一种可访问的非专有格式共享其数据。

The platform supports open and accessible data formats. It is important not just for access but also for whatever you want to do with this data. Therefore, Kaggle Dataset clearly defines the file formats which are recommended while sharing data.

该平台支持开放和可访问的数据格式。这不仅对访问很重要，而且对于您要使用此数据进行的任何操作都非常重要。因此，Kaggle数据集明确定义了共享数据时建议使用的文件格式。

The unique thing about Kaggle datasets is that it is not just a data repository. Each dataset stands for a community that enables you to discuss data, find out public codes and techniques, and conceptualize your own projects in Kernels.

关于Kaggle数据集的独特之处在于，它不仅仅是一个数据存储库。每个数据集代表一个社区，使您可以讨论数据，查找公共代码和技术，以及在内核中概念化自己的项目。

CSV, JSON, SQLite, Archive, Big Query etc. are files types that Kaggle supports. You can find a variety of resources in order to start working on your open data project.

CSV，JSON，SQLite，Archive，Big Query等是Kaggle支持的文件类型。您可以找到各种资源，以开始进行开放数据项目。

The best part is that Kaggle allows you to publish and share datasets privately or publicly.

最好的部分是Kaggle允许您私下或公开发布和共享数据集。

14. LODUM (14. LODUM)

It is the Open Data initiative of the University of Münster. Under this initiative, it is made possible for anyone to access any public information about the university in machine-readable formats. You can easily access and reuse it as per your needs.

这是明斯特大学的开放数据倡议。在此倡议下，任何人都可以以机器可读的格式访问有关大学的任何公共信息。您可以根据需要轻松访问和重用它。

Open data about scientific artifacts and encoded as linked data is made available under this project.

在此项目下，可以获得有关科学人工制品的开放数据并被编码为链接数据。

With the help of Linked Data, it is possible to share and use data, ontologies and various metadata standards. It is, in fact, envisaged that it will be the accepted standard for providing metadata, and the data itself on the Web.

借助链接数据，可以共享和使用数据，本体和各种元数据标准。实际上，可以预见它将成为提供元数据和Web上数据本身的公认标准。

The LODUM team has co-initiated LinkedUniversities.org and LinkedScience.org.

LODUM团队共同发起了LinkedUniversities.org和LinkedScience.org 。

You can use SPARQL editor or SPARQL package of R to analyze data.

您可以使用SPARQL编辑器或R的SPARQL包来分析数据。

SPARQL Package enables to connect to a SPARQL endpoint over HTTP, pose a SELECT query or an update query (LOAD, INSERT, DELETE).

SPARQL软件包使您可以通过HTTP连接到SPARQL端点，进行SELECT查询或更新查询(LOAD，INSERT，DELETE)。

15. UCI机器学习存储库 (15. UCI Machine Learning Repository)

It serves as a comprehensive repository of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms.

它充当数据库，领域理论和数据生成器的综合存储库，机器学习社区使用它们来对机器学习算法进行实证分析。

In this repository, there are, at present, 463 datasets as a service to the machine learning community.

在该存储库中，目前有463个数据集作为对机器学习社区的服务。

The Center for Machine Learning and Intelligent Systems at the University of California, Irvine hosts and maintains it. David Aha had originally created it as a graduate student at UC Irvine.

加利福尼亚大学欧文分校的机器学习和智能系统中心负责托管和维护该中心。 David Aha最初是在加州大学尔湾分校(UC Irvine)的研究生创建的。

Since then, students, educators, and researchers all over the world make use of it as a reliable source of machine learning datasets.

从那时起，全世界的学生，教育者和研究人员都将其用作可靠的机器学习数据集来源。

How it works is that each dataset has its distinct webpage which enlists all the known details including any relevant publications that investigate it. You can download these datasets as ASCII files, often the useful CSV format.

它的工作方式是每个数据集都有其独特的网页，其中列出了所有已知的详细信息，包括进行调查的所有相关出版物。您可以将这些数据集下载为ASCII文件，通常是有用的CSV格式。

The details of datasets are summarized by aspects like attribute types, number of instances, number of attributes and year published that can be sorted and searched.

数据集的详细信息按属性类型，实例数量，属性数量和可以分类和搜索的发布年份等方面进行了汇总。

打开数据门户和搜索引擎： (Open Data Portals and Search Engines:)

While there are plenty of datasets published by numerous agencies every year, very few datasets become recognized and established.

尽管每年都有许多机构发布大量的数据集，但很少有数据集得到认可和建立。

The reason why very few such datasets sustain as useful resource is that it is a challenge to develop, manage and provide the data in a way that people and organizations find it useful and easy to use.

这样的数据集只能作为有用资源来维持的原因是，以人们和组织认为有用和易于使用的方式来开发，管理和提供数据是一个挑战。

However, please find below a list of other few important open data portals and platforms that permit users to access open data quite easily, study the impact and glean valuable insights.

但是，请在下面找到其他一些重要的开放数据门户和平台的列表，这些门户和平台使用户可以非常轻松地访问开放数据，研究其影响并收集有价值的见解。

Google dataset search
Google资料集搜寻
Dataverse
数据宇宙
Open Data Kit
开放数据套件
Ckan
kan
Open Data Monitor
打开数据监控器
Plenar.io
Plenar.io
Open Data Impact Map
开放数据影响图

结论 (Conclusion)

Open data is the order of the day. The world has gradually started moving towards open systems and open data is rightly in sync with that.

开放数据是每天的工作。世界逐渐开始向开放系统迈进，开放数据正与此同步。

The business and organizations which leverage open data will gain a competitive edge and will be able to dominate the future.

利用开放数据的企业和组织将获得竞争优势，并能够支配未来。

翻译自: https://www.freecodecamp.org/news/https-medium-freecodecamp-org-best-free-open-data-sources-anyone-can-use-a65b514b0f2d/

源数据和数据源

你可能感兴趣的:(可视化,大数据,编程语言,python,机器学习)

Python入门(函数) 高育良00003 python 开发语言
一.基础认识一种映射关系1.1什么是函数呢？概念函数是可以重复执行的语句块，可以重复调用作用用于封装语句块，提高代码的重用性1.2函数的定义语法：deffunction():#def为关键字，function为函数名#语句想要执行的操作returnre#re为返回值二.函数的调用函数名后+小括号()表示函数的执行2.1基本用法语法：函数名(实际调用的参数)2.2调用传参2.2.1位置传参最为常见，
python本地连接minio 伶星37 python 网络服务器
在你浏览器能成功访问到你的minio网页，并且成功登录之后。接下来如果你想用python连接数据库，并且想用python连接minio，就可以用这个blog。连接代码client=Minio("localhost:9000",#9000是默认端口号access_key="admin",#你的账户secret_key="password",#你的密码secure=False,#这点我会详细说明)为什
梯度下降法理论理解伶星37 机器学习人工智能
梯度下降法：看似原始却透露着机器学习的本质前提：在研究梯度下降方法之前，你要理解矩阵运算（解析解）的方法矩阵运算目前的缺点只能进行对线性函数经行分析，无法对复杂的函数经行分析什么是梯度，以及梯度向量梯度下降的形象例子以及基本思想有三个兄弟被困在山上，得要死，他们目标是看谁尽快找到山谷中的水源老大比较后选择最陡的方向随便探索一下，就朝较低处走去探测几下就走陡峭的方向梯度下降算法的核心思想就是沿着负梯
头歌实践教学平台 Python程序设计实训答案（三）学习的锅头哥实践教学平台实训答案 python
第七阶段文件实验一文本文件的读取第1关：学习-Python文件之文本文件的读取任务描述本关任务：使用open函数以只写的方式打开文件，打印文件的打开方式。相关知识为了完成本关任务，你需要掌握：文本文件；open函数及其参数；文件打开模式；文件对象常用属性；关闭文件close函数。#请在下面的Begin-End之间按照注释中给出的提示编写正确的代码##########Begin###########
C++开发内存监控工具推荐点云SLAM 开发工具开发环境 c++开发语言 AddProperty gperftools Address 内存监控访问越界
在C++开发中，内存管理是至关重要的，尤其是当程序处理大数据或长时间运行时，内存泄漏或不当使用可能导致性能下降或崩溃。以下是几种常见且有效的内存监控工具，它们可以帮助开发者实时分析、诊断和优化程序的内存使用。1.ValgrindValgrind是一个广泛使用的内存调试和性能分析工具，它的Memcheck工具可以帮助你检查程序中的内存泄漏、内存越界、未初始化内存使用等问题。特点：检测内存泄漏。检查内
python基础之--面相对象--OOP基本特性暴龙胡乱写博客 python 开发语言人工智能
python基础之–面相对象–OOP基本特性文章目录python基础之--面相对象--OOP基本特性一，OOP基本特性1.1封装1.2继承/派生1.2.1基础概念1.2.3继承实现1.3多态1.4对象对成员的操作（补充）1.5私有属性1.6重写魔术方法二，super函数2.1基本使用2.2super().\__init__()一，OOP基本特性OOP的四大基本特性是封装、继承、多态和抽象。1.1封
Dify1.01版本vscode 本地环境搭建运行实践 hamish-wu vscode 编辑器 dify 大模型 python flask
dify是python编写的低代码AI开发平台，是常用的大模型开发平台。本文基于最新的1.0.1版本实践完成，有需要的可以私信交流。咨询免费，详细文档及视频需要一定成本，大概相当于节约的时间成本。搭建环境windows11开发工具vscode搭建步骤：1.Startthedocker-composestackwindow环境下运行docker命令，需要下载docker官网镜像，会遇到timeout
vscode python 入门教程(一) window 10 环境下安装pyenv hamish-wu Python python 开发语言 pyenv
python的环境配置方法很多，由于python有两个大版本，很多时候需要切换某个固定的版本才能运行三方包，所以推荐使用pyenv配置python环境变量pyenv的安装安装方法：Invoke-WebRequest-UseBasicParsing-Uri"https://raw.githubusercontent.com/pyenv-win/pyenv-win/master/pyenv-win/i
Java 大视界 -- Java 大数据在智慧农业精准灌溉与施肥决策中的应用（144）青云交大数据新视界 Java 大视界 java 大数据智慧农业精准灌溉施肥决策数据分析机器学习
亲爱的朋友们，热烈欢迎来到青云交的博客！能与诸位在此相逢，我倍感荣幸。在这飞速更迭的时代，我们都渴望一方心灵净土，而我的博客正是这样温暖的所在。这里为你呈上趣味与实用兼具的知识，也期待你毫无保留地分享独特见解，愿我们于此携手成长，共赴新程！一、欢迎加入【福利社群】点击快速加入：青云交灵犀技韵交响盛汇福利社群点击快速加入2：2024CSDN博客之星创作交流营（NEW)二、本博客的精华专栏：大数据新视
Java 大视界 -- 基于 Java 的大数据机器学习模型的多模态融合技术与应用（143）青云交大数据新视界 Java 大视界 java 大数据机器学习多模态融合智能安防智能客服数据处理
亲爱的朋友们，热烈欢迎来到青云交的博客！能与诸位在此相逢，我倍感荣幸。在这飞速更迭的时代，我们都渴望一方心灵净土，而我的博客正是这样温暖的所在。这里为你呈上趣味与实用兼具的知识，也期待你毫无保留地分享独特见解，愿我们于此携手成长，共赴新程！一、欢迎加入【福利社群】点击快速加入：青云交灵犀技韵交响盛汇福利社群点击快速加入2：2024CSDN博客之星创作交流营（NEW)二、本博客的精华专栏：大数据新视
1-5 Python 入门之运算符的使用 Sa_sa_ki_Haise python
第1关：算术、比较、赋值运算符100任务要求参考答案评论201任务描述相关知识算术运算符比较(关系)运算符赋值运算符编程要求测试说明任务描述在编程时，我们常常需要对数值或对象进行算术、比较运算和赋值运算，以此来实现我们的功能需求。本关介绍Python中的一些基本运算符，并要求对给定的苹果和梨的数量进行算术运算、比较、赋值运算，然后输出相应的结果。相关知识要实现上述功能，需要用到Python中的各种
2025年第二届机器学习与神经网络国际学术会议(MLNN 2025) 分享学术科研与论文的禁小默机器学习神经网络人工智能
重要信息官网：www.icmlnn.org时间：2025年4月22-24日地点：中国-重庆简介2025年第二届机器学习与神经网络国际学术会议（MLNN2025）围绕学习系统与神经网络的核心理论、关键技术和应用展开讨论，涵盖深度学习、计算机视觉、自然语言处理、强化学习等多个子领域，通过特邀报告、主题演讲、海报展示等形式，展示相关领域的最新研究成果和技术创新。征稿主题神经网络机器学习深度学习算法及应用
rabbitmq + minio +python 上传文件伶星37 rabbitmq python ruby
功能实现RabbitMq接收hello里面传来的消息根据消息在MobileFile里面新建文件新建文件上传到miniopython新建文件importospath='./MobileFile'file_path=os.path.join(path,"new_file.txt")withopen(file_path,"w")asfile:pass转换成函数格式importosdefcreatefil
vscode python 入门教程(二) vscode使用gti 管理代码 hamish-wu vscode ide 编辑器
vscode代码管理需要用管道git的命令，这点和idea的代码管理区别比较大。作为java开发需要自己熟悉适应一下。一、GitHub新建一个仓库过程略二、本地git项目初始化gitinitvscode中可以看到文件状态gitstatus使用gitremote命令吧本地git仓库和远程git仓库链接起来[email protected]提交代码gitcommit-m"评论
【监控系列】open-falcon yunqi1215 Monitor 自动化
Open-Falcon是一款由小米开源的分布式监控系统，具备高性能、高可用性和易扩展的特点。以下从多个维度对其进行详细解析：1.核心特点分布式架构：模块化设计，各组件独立部署，支持水平扩展。高性能：单实例可处理百万级监控指标，采用RPC通信和数据分片优化。灵活的数据模型：支持Tag（标签）标记数据，便于多维查询。实时告警：支持多条件策略、表达式告警及依赖管理。可视化：提供Dashboard和图表，
Python进阶之-加密库cryptography使用详解夏天Aileft Python python 网络加密
✨前言cryptography库是一个强大的Python加密库，提供了对加密算法和协议的高层和低层访问。它是用来实现数据加密、签名、密钥管理等功能的。以下是一些常见用法的详解，帮助你理解如何使用这个库。✨安装首先，你需要确保安装了cryptography库：pipinstallcryptography✨1.对称加密对称加密是指加密和解密使用相同的密钥。Fernet是cryptography库中提供
python列表添加元素的三种方法定义集合数据对象_python 学习第三天可迭代对象（列表，字典，元组和集合）... weixin_39852491
列表，字典，元组和集合列表list列表是由一系列特定元素组成的，元素和元素之间没有任何关联关系，但他们之间有先后顺序关系列表是一种容器列表是序列的一种列表是可以被改变的序列Python中的序列类型简介（sequence）字符串（str）列表（list）元组（tuple）字节串（bytes）字节数组（bytearray）创建空列表的字面值L=[]#L绑定空列表创建非空列表：L=[1,’two’,3,
python~集合详解鱼跃龙 python python集合详解 set集合
集合的基本操作首先需要明确的是：集合(set)是一个无序的不重复元素序列，多用来进行排重；不支持切片和索引取值！1.创建集合>>>a={1,2,4,4}>>>a{1,2,4}>>>type(a)**创建空集合时需要注意：不能直接用大括号，只能用set()；否则创建的是一个字典>>>b=set()>>>type(b)>>>c={}>>>type(c)2.添加元素add()方法是将要添加的元素作为一个
Elasticsearch 搜索引擎原理与实践 AI天才研究院 Python实战自然语言处理人工智能语言模型编程实践开发语言架构设计
作者：禅与计算机程序设计艺术1.简介Elasticsearch是开源分布式搜索引擎，提供搜素、分析、数据可视化等功能。它是一个基于Lucene的全文搜索服务器，能够把结构化或非结构化的数据经过索引生成一个索引库，使其可以被搜索到。在现代Web应用中，搜索功能已经成为不可或缺的一项功能。但是传统上，传统搜索方式需要依赖于数据库查询或者其他复杂的查询接口。而Elasticsearch提供了一种高效、稳
Python密码学：cryptography库零度° python python 密码学
在数字时代，确保数据的安全性和隐私至关重要。Python中的cryptography库是一个全面的包，为Python开发者提供了密码学原语和配方。它支持高级配方和常见密码学算法的低级接口。cryptography库概述cryptography库旨在易于使用且默认安全。它包括各种密码学操作的高级和低级API，如：对称加密非对称加密哈希函数消息认证码（MAC）数字签名密钥管理cryptography库
Python---frozenset集合爱听雨声的北方汉快快乐乐学Python Python
frozenset是set的不可变版本，因此set集合中所有能改变集合本身的方法（如add、remove、discard、xxx_update等），frozenset都不支持；set集合中不改变集合本身的方法，fronzenset都支持。frozenset的作用主要有以下两点：1、当集合元素不需要改变时，使用frozenset代替set更安全。2、当某些API需要不可变对象时，必须用frozens
(python)保障信息安全的加密库-cryptography Marst·Zhang 基础知识实用工具 python
前言cryptography是一个广泛使用的Python加密库，提供了各种加密、哈希和签名算法的实现。它支持多种加密算法，如AES、RSA、ECC等，以及哈希函数（如SHA-256、SHA-384等）和数字签名算法(如DSA、ECDSA等).目录常见用途密码学函数主要功能优点缺点总结常见用途数据加密使用对称加密算法（如AES）对数据进行加密，确保数据在传输或存储过程中的机密性。数字签名生成和验证数
Python if-else对缩进的要求宇寒风暖 python编程 python 开发语言学习笔记
在Python中，缩进是语法的一部分，用于表示代码块的层次结构。if-else语句的代码块必须通过缩进来定义，缩进不正确会导致语法错误或逻辑错误。1.缩进的基本规则1.1缩进的作用缩进用于表示代码块的层次结构。同一代码块中的语句必须具有相同的缩进级别。缩进通常使用4个空格，这是Python官方推荐的风格。1.2示例x=10ifx>5:print("x大于5")#缩进4个空格print("这是if代
一文弄懂 Python assert 断言宇寒风暖 python编程 python 开发语言学习笔记
在Python中，assert是一种用于调试的语句，用于检查某个条件是否为True。如果条件为False，assert会抛出AssertionError异常，并可选地输出错误信息。assert通常用于在开发阶段验证程序的假设条件，确保代码的正确性。1.assert的基本语法1.1语法assertcondition,messagecondition：需要检查的条件表达式。message：可选参数，当
开源项目常见问题解决方案——cryptography 周屹隽
开源项目常见问题解决方案——cryptographycryptographycryptographyisapackagedesignedtoexposecryptographicprimitivesandrecipestoPythondevelopers.项目地址:https://gitcode.com/gh_mirrors/cr/cryptography项目基础介绍cryptography是一个
python 利用pandas实现从CSV导出并格式化后写入.jsonl文件风_流沙 python工具备忘录 python pandas 开发语言
你可以使用pandas库来读取CSV文件，然后通过一些格式化操作将数据转换为JSONL格式并写入文件。JSONL（JSONLines）格式是一种每行一个JSON对象的文件格式。下面是一个示例，演示了如何使用pandas读取CSV文件，处理数据并将其导出到JSONL文件中：示例代码：importpandasaspdimportjson#读取CSV文件df=pd.read_csv('data.csv'
Python文件加密库之cryptography使用详解 Rocky006 python 开发语言
概要在现代信息社会中，数据的安全性变得越来越重要。为了保护敏感信息，文件加密技术被广泛应用。Python的cryptography库提供了强大的加密功能，可以轻松实现文件加密和解密。本文将详细介绍如何使用cryptography库进行文件加密，包含具体的示例代码。cryptography库简介cryptography是Python中一个功能强大且易用的加密库，提供了对称加密、非对称加密、哈希算法、
国内外的网络安全成难题，IPLOOK 2022年用产品筑起“护城墙” 爱浦路 IPLOOK 网络安全安全架构
《爱尔兰时报》和爱尔兰国家广播电台（RTE）于12月31日对2021年爱尔兰科技行业的赢家和弱点进行了年终盘点。双方纷纷表示，2021年爱尔兰科技行业最大的弱点是爱尔兰的网络安全，这一年是一场前所未有的灾难。随着人工智能、大数据、5G等新兴技术的发展，企业面临的威胁日益增加，信息安全的重要性变得越来越突显。现在我们把视线从爱尔兰的网络安全问题拉回到国内的网络安全现状。我国对网络安全问题保持时刻警惕
【Python系列】高效Parquet数据处理策略：合并与分析实践小团团0 python 开发语言
在大数据时代，数据的存储、处理和分析变得尤为重要。Parquet作为一种高效的列存储格式，被广泛应用于大数据处理框架中，如ApacheSpark、ApacheHive等。Parquet是一个开源的列存储格式，它被设计用于支持复杂的嵌套数据结构，同时提供高效的压缩和编码方案，以优化存储空间和查询性能。以下将详细介绍如何使用Python对Parquet文件进行数据处理与合并，并提供相应的源码示例。一、
cryptography，一个神奇的 Python 库！ Sitin涛哥 Python python 开发语言
更多资料获取个人网站：ipengtao.com大家好，今天为大家分享一个神奇的Python库-cryptography。Github地址：https://github.com/pyca/cryptography在当今数字化时代，信息安全越来越受到重视。数据加密是保护数据安全的重要手段之一，而Python的cryptography库提供了丰富的功能来支持各种加密算法和协议。本文将深入探讨crypto
js动画html标签（持续更新中） 843977358 html js 动画 media opacity
1.jQuery 效果 - animate() 方法改变 "div" 元素的高度： $(".btn1").click(function(){ $("#box").animate({height:"300px
springMVC学习笔记 caoyong springMVC
1、搭建开发环境 a>、添加jar文件，在ioc所需jar包的基础上添加spring-web.jar,spring-webmvc.jar b>、在web.xml中配置前端控制器 <servlet> &nbs
POI中设置Excel单元格格式 107x poi style 列宽合并单元格自动换行
引用：http://apps.hi.baidu.com/share/detail/17249059 POI中可能会用到一些需要设置EXCEL单元格格式的操作小结：先获取工作薄对象: HSSFWorkbook wb = new HSSFWorkbook(); HSSFSheet sheet = wb.createSheet(); HSSFCellStyle setBorder = wb.
jquery 获取A href 触发js方法的this参数无效的情况一炮送你回车库 jquery
html如下： <td class=\"bord-r-n bord-l-n c-333\"> <a class=\"table-icon edit\" onclick=\"editTrValues(this);\">修改</a> </td>" j
md5 3213213333332132 MD5
import java.security.MessageDigest; import java.security.NoSuchAlgorithmException; public class MDFive { public static void main(String[] args) { String md5Str = "cq
完全卸载干净Oracle11g sophia天雪 orale数据库卸载干净清理注册表
完全卸载干净Oracle11g A、存在OUI卸载工具的情况下：第一步：停用所有Oracle相关的已启动的服务；第二步：找到OUI卸载工具：在“开始”菜单中找到“oracle_OraDb11g_home”文件夹中 &
apache 的access.log 日志文件太大如何解决 darkranger apache
CustomLog logs/access.log common 此写法导致日志数据一致自增变大。直接注释上面的语法 #CustomLog logs/access.log common 增加： CustomLog "|bin/rotatelogs.exe -l logs/access-%Y-%m-d.log
Hadoop单机模式环境搭建关键步骤 aijuans 分布式
Hadoop环境需要sshd服务一直开启，故，在服务器上需要按照ssh服务，以Ubuntu Linux为例，按照ssh服务如下： sudo apt-get install ssh sudo apt-get install rsync 编辑HADOOP_HOME/conf/hadoop-env.sh文件，将JAVA_HOME设置为Java
PL/SQL DEVELOPER 使用的一些技巧 atongyeye java sql
1 记住密码这是个有争议的功能，因为记住密码会给带来数据安全的问题。但假如是开发用的库，密码甚至可以和用户名相同，每次输入密码实在没什么意义，可以考虑让PLSQL Developer记住密码。位置：Tools菜单－－Preferences－－Oracle－－Logon HIstory－－Store with password 2 特殊Copy 在SQL Window
PHP：在对象上动态添加一个新的方法 bardo 方法动态添加闭包
有关在一个对象上动态添加方法，如果你来自Ruby语言或您熟悉这门语言，你已经知道它是什么...... Ruby提供给你一种方式来获得一个instancied对象，并给这个对象添加一个额外的方法。好！不说Ruby了，让我们来谈谈PHP PHP未提供一个“标准的方式”做这样的事情，这也是没有核心的一部分... 但无论如何，它并没有说我们不能做这样
ThreadLocal与线程安全 bijian1013 java java多线程 threadLocal
首先来看一下线程安全问题产生的两个前提条件： 1.数据共享，多个线程访问同样的数据。 2.共享数据是可变的，多个线程对访问的共享数据作出了修改。实例：定义一个共享数据： public static int a = 0;
Tomcat 架包冲突解决征客丶 tomcat Web
环境： Tomcat 7.0.6 win7 x64 错误表象：【我的冲突的架包是：catalina.jar 与 tomcat-catalina-7.0.61.jar 冲突，不知道其他架包冲突时是不是也报这个错误】严重: End event threw exception java.lang.NoSuchMethodException: org.apache.catalina.dep
【Scala三】分析Spark源代码总结的Scala语法一 bit1129 scala
Scala语法 1. classOf运算符 Scala中的classOf[T]是一个class对象，等价于Java的T.class,比如classOf[TextInputFormat]等价于TextInputFormat.class 2. 方法默认值 defaultMinPartitions就是一个默认值，类似C++的方法默认值
java 线程池管理机制 BlueSkator java线程池管理机制
编辑 Add Tools jdk线程池一、引言第一：降低资源消耗。通过重复利用已创建的线程降低线程创建和销毁造成的消耗。第二：提高响应速度。当任务到达时，任务可以不需要等到线程创建就能立即执行。第三：提高线程的可管理性。线程是稀缺资源，如果无限制的创建，不仅会消耗系统资源，还会降低系统的稳定性，使用线程池可以进行统一的分配，调优和监控。
关于hql中使用本地sql函数的问题（问-答） BreakingBad HQL 存储函数
转自于：http://www.iteye.com/problems/23775 问：我在开发过程中，使用hql进行查询（mysql5）使用到了mysql自带的函数find_in_set()这个函数作为匹配字符串的来讲效率非常好，但是我直接把它写在hql语句里面（from ForumMemberInfo fm,ForumArea fa where find_in_set(fm.userId,f
读《研磨设计模式》-代码笔记-迭代器模式-Iterator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.Arrays; import java.util.List; /** * Iterator模式提供一种方法顺序访问一个聚合对象中各个元素，而又不暴露该对象内部表示 * * 个人觉得，为了不暴露该
常用SQL chenjunt3 oracle sql C++c C#
--NC建库 CREATE TABLESPACE NNC_DATA01 DATAFILE 'E:\oracle\product\10.2.0\oradata\orcl\nnc_data01.dbf' SIZE 500M AUTOEXTEND ON NEXT 50M EXTENT MANAGEMENT LOCAL UNIFORM SIZE 256K ; CREATE TABLESPA
数学是科学技术的语言 comsci 工作活动领域模型
从小学到大学都在学习数学，从小学开始了解数字的概念和背诵九九表到大学学习复变函数和离散数学，看起来好像掌握了这些数学知识，但是在工作中却很少真正用到这些知识，为什么？最近在研究一种开源软件-CARROT2的源代码的时候，又一次感觉到数学在计算机技术中的不可动摇的基础作用，CARROT2是一种用于自动语言分类（聚类）的工具性软件，用JAVA语言编写，它
Linux系统手动安装rzsz 软件包 daizj linux sz rz
1、下载软件 rzsz-3.34.tar.gz。登录linux，用命令 wget http://freeware.sgi.com/source/rzsz/rzsz-3.48.tar.gz下载。 2、解压 tar zxvf rzsz-3.34.tar.gz 3、安装 cd rzsz-3.34 ; make posix 。注意：这个软件安装与常规的GNU软件不
读源码之:ArrayBlockingQueue dieslrae java
ArrayBlockingQueue是concurrent包提供的一个线程安全的队列,由一个数组来保存队列元素.通过 takeIndex和 putIndex来分别记录出队列和入队列的下标,以保证在出队列时不进行元素移动. //在出队列或者入队列的时候对takeIndex或者putIndex进行累加,如果已经到了数组末尾就又从0开始,保证数
C语言学习九枚举的定义和应用 dcj3sjt126com c
枚举的定义 # include <stdio.h> enum WeekDay { MonDay, TuesDay, WednesDay, ThursDay, FriDay, SaturDay, SunDay }; int main(void) { //int day; //day定义成int类型不合适 enum WeekDay day = Wedne
Vagrant 三种网络配置详解 dcj3sjt126com vagrant
Forwarded port Private network Public network Vagrant 中一共有三种网络配置，下面我们将会详解三种网络配置各自优缺点。端口映射(Forwarded port)，顾名思义是指把宿主计算机的端口映射到虚拟机的某一个端口上，访问宿主计算机端口时，请求实际是被转发到虚拟机上指定端口的。Vagrantfile中设定语法为： c
16.性能优化-完结 frank1234 性能优化
性能调优是一个宏大的工程，需要从宏观架构(比如拆分，冗余，读写分离，集群，缓存等)，软件设计（比如多线程并行化，选择合适的数据结构），数据库设计层面（合理的表设计，汇总表，索引，分区，拆分，冗余等）以及微观（软件的配置，SQL语句的编写，操作系统配置等）根据软件的应用场景做综合的考虑和权衡，并经验实际测试验证才能达到最优。性能水很深，笔者经验尚浅，赶脚也就了解了点皮毛而已，我觉得
Word Search hcx2013 search
Given a 2D board and a word, find if the word exists in the grid. The word can be constructed from letters of sequentially adjacent cell, where "adjacent" cells are those horizontally or ve
Spring4新特性——Web开发的增强 jinnianshilongnian spring spring mvc spring4
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
CentOS安装配置tengine并设置开机启动 liuxingguome centos
yum install gcc-c++ yum install pcre pcre-devel yum install zlib zlib-devel yum install openssl openssl-devel Ubuntu上可以这样安装 sudo aptitude install libdmalloc-dev libcurl4-opens
第14章工具函数（上） onestopweb 函数
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Xelsius 2008 and SAP BW at a glance blueoxygen BO Xelsius
Xelsius提供了丰富多样的数据连接方式，其中为SAP BW专属提供的是BICS。那么Xelsius的各种连接的优缺点比较以及Xelsius是如何直接连接到BEx Query的呢？以下Wiki文章应该提供了全面的概览。 http://wiki.sdn.sap.com/wiki/display/BOBJ/Xcelsius+2008+and+SAP+NetWeaver+BW+Co
oracle表空间相关 tongsh6 oracle
在oracle数据库中，一个用户对应一个表空间，当表空间不足时，可以采用增加表空间的数据文件容量，也可以增加数据文件，方法有如下几种： 1.给表空间增加数据文件 ALTER TABLESPACE "表空间的名字" ADD DATAFILE '表空间的数据文件路径' SIZE 50M; &nb
.Net framework4.0安装失败 yangjuanjava .net windows
上午的.net framework 4.0，各种失败，查了好多答案，各种不靠谱，最后终于找到答案了和Windows Update有关系，给目录名重命名一下再次安装，即安装成功了！下载地址：http://www.microsoft.com/en-us/download/details.aspx?id=17113 方法： 1.运行cmd，输入net stop WuAuServ 2.点击开