qq_27390023

mmCIF 格式字符串解析

mmCIF（macromolecular Crystallographic Information File）是一种用于存储生物大分子结构数据的标准文件格式。它是 PDB（Protein Data Bank）数据文件格式的一种扩展，用于存储 X 射线晶体学和核磁共振测定的生物大分子的结构信息。

mmCIF 使用一种名为 CIF（Crystallographic Information File）的文本格式，但在生物大分子领域中，通常将其称为 mmCIF。与传统的 PDB 格式相比，mmCIF 提供了更灵活、结构化的数据表示方式，允许更详细和准确地描述生物大分子的结构、实验条件、结晶学信息等。

mmCIF 文件包含数据和元数据，包括原子坐标、晶体学信息、实验条件、分子结构、结合物信息等。每个数据项都有相应的描述，使得文件更容易理解和解释。mmCIF 还支持字典定义，用于规定可用的数据项和它们的含义，使得不同研究中的数据可以一致地描述和共享。

https://mmcif.wwpdb.org/docs/pdb_to_pdbx_correspondences.html

from Bio import PDB
from typing import Mapping, Any, Optional, Sequence, Tuple

#from typing import Any, Mapping, Optional, Sequence, Tuple
import dataclasses
import functools
import io
from absl import logging
import collections
from Bio.Data import SCOPData

### mmCIF string 解析

PdbStructure = PDB.Structure.Structure
PdbHeader = Mapping[str, Any]
ChainId = str
SeqRes = str
MmCIFDict = Mapping[str, Sequence[str]]

# 使用 @dataclasses.dataclass 装饰器，
# 可以轻松地创建一个包含字段、__init__ 方法、__repr__ 方法等的类，而不必手动编写这些代码。
# 当使用 frozen=True 参数时，生成的类将不再允许修改其实例的字段值。

# Used to map SEQRES index to a residue in the structure.
@dataclasses.dataclass(frozen=True)
class ResiduePosition:
    chain_id: str
    residue_number: int
    insertion_code: str
        

@dataclasses.dataclass(frozen=True)
class ResidueAtPosition:
    position: Optional[ResiduePosition]
    name: str
    is_missing: bool
    hetflag: str


@dataclasses.dataclass(frozen=True)
class Monomer:
    id: str
    num: int
        
        
# Note - mmCIF format provides no guarantees on the type of author-assigned
# sequence numbers. They need not be integers.
@dataclasses.dataclass(frozen=True)
class AtomSite:
    residue_name: str
    author_chain_id: str
    mmcif_chain_id: str
    author_seq_num: str
    mmcif_seq_num: int
    insertion_code: str
    hetatm_atom: str
    model_num: int


@dataclasses.dataclass(frozen=True)        
class MmcifObject:
    """Representation of a parsed mmCIF file.

    Contains:
        file_id: A meaningful name, e.g. a pdb_id. Should be unique amongst all
          files being processed.
        header: Biopython header.
        structure: Biopython structure.
        chain_to_seqres: Dict mapping chain_id to 1 letter amino acid sequence. E.g.
          {'A': 'ABCDEFG'}
        seqres_to_structure: Dict; for each chain_id contains a mapping between
           SEQRES index and a ResidueAtPosition. e.g. {'A': {0: ResidueAtPosition,
                                                        1: ResidueAtPosition,
                                                        ...}}
    raw_string: The raw string used to construct the MmcifObject.
    """
    file_id: str
    header: PdbHeader
    structure: PdbStructure
    chain_to_seqres: Mapping[ChainId, SeqRes]
    seqres_to_structure: Mapping[ChainId, Mapping[int, ResidueAtPosition]]
    raw_string: Any
        
    
@dataclasses.dataclass(frozen=True)
class ParsingResult:
    """Returned by the parse function.

    Contains:
        mmcif_object: A MmcifObject, may be None if no chain could be successfully
          parsed.
        errors: A dict mapping (file_id, chain_id) to any exception generated.
    """
    mmcif_object: Optional[MmcifObject]
    errors: Mapping[Tuple[str, str], Any]
    

def _get_atom_site_list(parsed_info: MmCIFDict) -> Sequence[AtomSite]:
    """Returns list of atom sites; contains data not present in the structure."""
    return [AtomSite(*site) for site in zip(  # pylint:disable=g-complex-comprehension
            parsed_info['_atom_site.label_comp_id'],
            parsed_info['_atom_site.auth_asym_id'],
            parsed_info['_atom_site.label_asym_id'],
            parsed_info['_atom_site.auth_seq_id'],
            parsed_info['_atom_site.label_seq_id'],
            parsed_info['_atom_site.pdbx_PDB_ins_code'],
            parsed_info['_atom_site.group_PDB'],
            parsed_info['_atom_site.pdbx_PDB_model_num'],
        )] 


def _get_first_model(structure: PdbStructure) -> PdbStructure:
    """Returns the first model in a Biopython structure."""
    return next(structure.get_models())


def _get_header(parsed_info: MmCIFDict) -> PdbHeader:
    """Returns a basic header containing method, release date and resolution."""
    header = {}

    experiments = mmcif_loop_to_list('_exptl.', parsed_info)
    header['structure_method'] = ','.join([
        experiment['_exptl.method'].lower() for experiment in experiments])

    # Note: The release_date here corresponds to the oldest revision. We prefer to
    # use this for dataset filtering over the deposition_date.
    if '_pdbx_audit_revision_history.revision_date' in parsed_info:
        header['release_date'] = get_release_date(parsed_info)
    else:
        logging.warning('Could not determine release_date: %s',
                        parsed_info['_entry.id'])

    header['resolution'] = 0.00
    for res_key in ('_refine.ls_d_res_high', '_em_3d_reconstruction.resolution',
                    '_reflns.d_resolution_high'):
        if res_key in parsed_info:
            try:
                raw_resolution = parsed_info[res_key][0]
                header['resolution'] = float(raw_resolution)
            except ValueError:
                logging.debug('Invalid resolution format: %s', parsed_info[res_key])

    return header
        


def mmcif_loop_to_list(prefix: str,
                       parsed_info: MmCIFDict) -> Sequence[Mapping[str, str]]:
    """Extracts loop associated with a prefix from mmCIF data as a list.

    Reference for loop_ in mmCIF:
        http://mmcif.wwpdb.org/docs/tutorials/mechanics/pdbx-mmcif-syntax.html

    Args:
        prefix: Prefix shared by each of the data items in the loop.
          e.g. '_entity_poly_seq.', where the data items are _entity_poly_seq.num,
          _entity_poly_seq.mon_id. Should include the trailing period.
        parsed_info: A dict of parsed mmCIF data, e.g. _mmcif_dict from a Biopython
          parser.

    Returns:
        Returns a list of dicts; each dict represents 1 entry from an mmCIF loop.
    """
    cols = []
    data = []
    for key, value in parsed_info.items():
        if key.startswith(prefix):
            cols.append(key)
            data.append(value)

    assert all([len(xs) == len(data[0]) for xs in data]), (
        'mmCIF error: Not all loops are the same length: %s' % cols)

    return [dict(zip(cols, xs)) for xs in zip(*data)]


def mmcif_loop_to_dict(prefix: str,
                       index: str,
                       parsed_info: MmCIFDict,
                       ) -> Mapping[str, Mapping[str, str]]:
    """Extracts loop associated with a prefix from mmCIF data as a dictionary.

    Args:
        prefix: Prefix shared by each of the data items in the loop.
          e.g. '_entity_poly_seq.', where the data items are _entity_poly_seq.num,
        _entity_poly_seq.mon_id. Should include the trailing period.
        index: Which item of loop data should serve as the key.
        parsed_info: A dict of parsed mmCIF data, e.g. _mmcif_dict from a Biopython
          parser.

    Returns:
        Returns a dict of dicts; each dict represents 1 entry from an mmCIF loop,
          indexed by the index column.
    """
    entries = mmcif_loop_to_list(prefix, parsed_info)
    return {entry[index]: entry for entry in entries}


def get_release_date(parsed_info: MmCIFDict) -> str:
    """Returns the oldest revision date."""
    revision_dates = parsed_info['_pdbx_audit_revision_history.revision_date']
    return min(revision_dates)

    
def _get_protein_chains(
      *, parsed_info: Mapping[str, Any]) -> Mapping[ChainId, Sequence[Monomer]]:
    """Extracts polymer information for protein chains only.

    Args:
        parsed_info: _mmcif_dict produced by the Biopython parser.

    Returns:
        A dict mapping mmcif chain id to a list of Monomers.
    """
    # Get polymer information for each entity in the structure.
    entity_poly_seqs = mmcif_loop_to_list('_entity_poly_seq.', parsed_info)

    polymers = collections.defaultdict(list)
    for entity_poly_seq in entity_poly_seqs:
        polymers[entity_poly_seq['_entity_poly_seq.entity_id']].append(
          Monomer(id=entity_poly_seq['_entity_poly_seq.mon_id'],
                  num=int(entity_poly_seq['_entity_poly_seq.num'])))

    # Get chemical compositions. Will allow us to identify which of these polymers
    # are proteins.
    chem_comps = mmcif_loop_to_dict('_chem_comp.', '_chem_comp.id', parsed_info)

    # Get chains information for each entity. Necessary so that we can return a
    # dict keyed on chain id rather than entity.
    struct_asyms = mmcif_loop_to_list('_struct_asym.', parsed_info)

    entity_to_mmcif_chains = collections.defaultdict(list)
    for struct_asym in struct_asyms:
        chain_id = struct_asym['_struct_asym.id']
        entity_id = struct_asym['_struct_asym.entity_id']
        entity_to_mmcif_chains[entity_id].append(chain_id)

    # Identify and return the valid protein chains.
    valid_chains = {}
    for entity_id, seq_info in polymers.items():
        chain_ids = entity_to_mmcif_chains[entity_id]

        # Reject polymers without any peptide-like components, such as DNA/RNA.
        if any(['peptide' in chem_comps[monomer.id]['_chem_comp.type'].lower()
                for monomer in seq_info]):
            for chain_id in chain_ids:
                valid_chains[chain_id] = seq_info
    return valid_chains


def _is_set(data: str) -> bool:
    """Returns False if data is a special mmCIF character indicating 'unset'."""
    return data not in ('.', '?')



@functools.lru_cache(16, typed=False)
def parse(*,
          file_id: str,
          mmcif_string: str,
          catch_all_errors: bool = True) -> ParsingResult:
    """Entry point, parses an mmcif_string.

    Args:
        file_id: A string identifier for this file. Should be unique within the
            collection of files being processed.
        mmcif_string: Contents of an mmCIF file.
        catch_all_errors: If True, all exceptions are caught and error messages are
            returned as part of the ParsingResult. If False exceptions will be allowed
            to propagate.

    Returns:
        A ParsingResult.
    """
    errors = {}
    try:
        # PDB.MMCIFParser 类用于解析 mmCIF（macromolecular Crystallographic Information File）格式
        # 的蛋白质结构数据。
        parser = PDB.MMCIFParser(QUIET=True)
        #print(type(parser))
        #print(dir(parser))
        
        # 在内存中创建一个类文件对象，该对象可以像文件一样进行读写操作，
        # 但实际上是在内存中进行操作，而不是在磁盘上创建文件。
        handle = io.StringIO(mmcif_string)
        #print("handle value:",handle.getvalue())
        
        # parser.get_structure(structure_id, file_path) 方法用于
        # 从一个 PDB 文件中读取结构数据并创建一个 Structure 对象。
        full_structure = parser.get_structure('', handle)
        
        #print(type(full_structure))
        #print("full_structure:",full_structure)

    
        #for structure in next(full_structure.get_models()):
        #    print(type(structure))
        #    print(structure)
        
        #model = full_structure.get_models()
        #models = list(model)
        #print("模型：")
        #print(models)
        #print("模型的数量：")
        #print(len(models))
        
        first_model_structure = _get_first_model(full_structure)
        #print("first_model_structure:",first_model_structure)
        
        # Extract the _mmcif_dict from the parser, which contains useful fields not
        # reflected in the Biopython structure.
        parsed_info = parser._mmcif_dict  # pylint:disable=protected-access
        #print("parsed_info:",parsed_info)
        
        # Ensure all values are lists, even if singletons.
        for key, value in parsed_info.items():           
            if not isinstance(value, list):
                parsed_info[key] = [value]

        header = _get_header(parsed_info) # 方法、日期、分辨率
        #print(header) 
        
        # Determine the protein chains, and their start numbers according to the
        # internal mmCIF numbering scheme (likely but not guaranteed to be 1).
        valid_chains = _get_protein_chains(parsed_info=parsed_info)
        # valid_chains 字典类型，键为链名，值为Monomer实例（氨基酸名称和编号）
        # {'A': [Monomer(id='MET', num=1), Monomer(id='GLY', num=2),...],...}
        
        if not valid_chains:
            return ParsingResult(
              None, {(file_id, ''): 'No protein chains found in this file.'})
        
        # 每条链的起始氨基酸编号
        seq_start_num = {chain_id: min([monomer.num for monomer in seq])
                         for chain_id, seq in valid_chains.items()}
        
        # Loop over the atoms for which we have coordinates. Populate two mappings:
        # -mmcif_to_author_chain_id (maps internal mmCIF chain ids to chain ids used
        # the authors / Biopython).
        # -seq_to_structure_mappings (maps idx into sequence to ResidueAtPosition).
        mmcif_to_author_chain_id = {}
        seq_to_structure_mappings = {}
        
        
        for atom in _get_atom_site_list(parsed_info):
            if atom.model_num != '1':
                # We only process the first model at the moment.
                continue

            mmcif_to_author_chain_id[atom.mmcif_chain_id] = atom.author_chain_id
            
            # HETATM 表示非标准的、异构的、或非生物分子的原子。
            # ATOM 标记表示蛋白质或核酸原子
            if atom.mmcif_chain_id in valid_chains:  #蛋白质链中的原子
                hetflag = ' '
                if atom.hetatm_atom == 'HETATM':
                    # Water atoms are assigned a special hetflag of W in Biopython. We
                    # need to do the same, so that this hetflag can be used to fetch
                    # a residue from the Biopython structure by id.
                    if atom.residue_name in ('HOH', 'WAT'):
                        hetflag = 'W'
                    else:
                        hetflag = 'H_' + atom.residue_name
                insertion_code = atom.insertion_code
                if not _is_set(atom.insertion_code):
                    insertion_code = ' '
                position = ResiduePosition(chain_id=atom.author_chain_id,
                                           residue_number=int(atom.author_seq_num),
                                           insertion_code=insertion_code)
                
                # 重新索引（从0开始编码），蛋白质结构数据中每条链的起始氨基酸编码不总是从1开始
                # 如 6->5, 1->0
                #print("atom",atom)
            
                # 查看AtomSite类， atom.mmcif_seq_num 为 _atom_site.label_seq_id
                seq_idx = int(atom.mmcif_seq_num) - seq_start_num[atom.mmcif_chain_id]
                #print("seq_start_num[atom.mmcif_chain_id]",seq_start_num[atom.mmcif_chain_id])
                #print(int(atom.mmcif_seq_num))
                #print(seq_idx)
                
                current = seq_to_structure_mappings.get(atom.author_chain_id, {})
                current[seq_idx] = ResidueAtPosition(position=position,
                                                     name=atom.residue_name,
                                                     is_missing=False,
                                                     hetflag=hetflag)
                seq_to_structure_mappings[atom.author_chain_id] = current
         
        #print("seq_to_structure_mappings",seq_to_structure_mappings)
        # {'A': {5: ResidueAtPosition(position=ResiduePosition(chain_id='A', residue_number=13, insertion_code=' '), name='ARG', is_missing=False, hetflag=' '), 
        #        6: ResidueAtPosition(position=ResiduePosition(chain_id='A', residue_number=14, insertion_code=' '), name='ASN', is_missing=False, hetflag=' '),
        #       ...}, ...}
        
        # Add missing residue information to seq_to_structure_mappings.
        for chain_id, seq_info in valid_chains.items():
            # 包括不在结构中的氨基酸
            # print(seq_info)
            # break
            
            author_chain = mmcif_to_author_chain_id[chain_id]
            current_mapping = seq_to_structure_mappings[author_chain]
            
            #print("current_mapping:",current_mapping)
            for idx, monomer in enumerate(seq_info):
                #print ("idx:",idx)
                #print (monomer)
                
                if idx not in current_mapping:
                    current_mapping[idx] = ResidueAtPosition(position=None,
                                                             name=monomer.id,
                                                             is_missing=True,
                                                             hetflag=' ')
                
                #print(current_mapping[idx]) 
                #print("----")
            
            # seq_to_structure_mappings[author_chain] = current_mapping ## 为什么不加这一句也可以？？
            # print("seq_to_structure_mappings['A'][0]:",seq_to_structure_mappings['A'][0])
            
        author_chain_to_sequence = {}
        for chain_id, seq_info in valid_chains.items():
            author_chain = mmcif_to_author_chain_id[chain_id]
            seq = []
            for monomer in seq_info:
                code = SCOPData.protein_letters_3to1.get(monomer.id, 'X')           
                
                seq.append(code if len(code) == 1 else 'X')
            seq = ''.join(seq)
            
            #print("seq:", seq) 
            
            author_chain_to_sequence[author_chain] = seq
            #print(author_chain_to_sequence)
        
        #print("mmcif_object construction")      
        #print("file_id:", file_id)
        #print("header:", header)
        #print("first_model_structure:", first_model_structure)
        #print("author_chain_to_sequence:", author_chain_to_sequence)
        # print("seq_to_structure_mappings:", seq_to_structure_mappings)
        #print("parsed_info:", parsed_info)
        
        mmcif_object = MmcifObject(file_id=file_id,
                                   header=header,
                                   structure=first_model_structure,
                                   chain_to_seqres=author_chain_to_sequence,
                                   seqres_to_structure=seq_to_structure_mappings,
                                   raw_string=parsed_info)
        
        # print("mmcif_object:",mmcif_object)
        
        return ParsingResult(mmcif_object=mmcif_object, errors=errors)
    except Exception as e:  # pylint:disable=broad-except
        errors[(file_id, '')] = e
        if not catch_all_errors:
            raise
        return ParsingResult(mmcif_object=None, errors=errors)


### 从 https://www.rcsb.org 下载好8jlp.cif 文件
with open("8jlp.cif") as f:
    mmcif_str = f.read()
                        
result = parse(file_id = '8jlp.cif', mmcif_string = mmcif_str)
print(result)

如何使用 Python+Flask+win32print 实现简易网络打印服务江梦寻 python flask 开发语言后端 pytest web3.py win32
Python实现网络打印机：Flask+win32print在工作场景中，我们可能需要一个简单的网页接口，供他人上传文档并自动打印到指定打印机。本文将演示如何使用Python+Flask+win32print库来实现这一需求。代码详见：https://github.com/poboll/webprint1.环境准备Windows10/11Python3.8+打印机（已安装并可用）Flaskpywi
Python 文档测试赔罪 Python 系统学习 python 服务器前端
目录文档测试练习小结文档测试如果你经常阅读Python的官方文档，可以看到很多文档都有示例代码。比如re模块就带了很多示例代码：>>>importre>>>m=re.search('(?>>m.group(0)'def'可以把这些示例代码在Python的交互式环境下输入并执行，结果与文档中的示例代码显示的一致。这些代码与其他说明可以写在注释中，然后，由一些工具来自动生成文档。既然这些代码本身就可以
Python Web开发（三）：HTTP请求的url路由是Dream呀 python 前端 http django 后端
本文目录：一、要实现的目标二、创建项目app1.APP介绍2.创建APP三、返回页面内容给浏览器四、url路由1.添加路由记录1.1解决ERROR:Couldnotfindaversionthatsatisfiestherequirementxxx1.2启动web服务2.路由子表`【系列好文推荐】`前言：作者简介：是Dream呀，华为云享专家、CSDN原力计划作者、Python领域优质创作者，专注
深入理解 Python 中的 copy 与 deepcopy 的使用 web安全工具库 python 开发语言
各类资料学习下载合集https://pan.quark.cn/s/8c91ccb5a474在Python中，数据的复制是一个重要的操作，尤其是在处理复杂数据结构（如列表、字典、集合等）时。copy和deepcopy是Python标准库copy模块提供的两种复制方法。它们之间有着明显的区别，理解这些区别对于避免潜在的错误和数据问题至关重要。本文将详细介绍copy和deepcopy的用法，包括代码示例
python前景和待遇-Python就业前景怎么样？薪资待遇多少 weixin_37988176
Python就业前景怎么样？薪资待遇多少？Python上手容易，入门简单Python是一门面向对象的编程语言，编译速度超快。它具有丰富和强大的库，常被称为"胶水语言”，能够把用其他语言编写的各种模块（尤其是C/C）很轻松地联结在一起。其特点在于灵活运用，因为其拥有大量第三方库，所以开发人员不必重复造轮子，就像搭积木一样，只要擅于利用这些库就可以完成绝大部分工作。如果你想选择一种语言来入门编程，那么
Python开发行业薪资多少？ Java大师兄-威哥 Python 编程 IT技术程序员 IT
大家都知道，人工智能越来越受欢迎了。而Python由于简单易用，是人工智能领域中使用最广泛的编程语言之一，它可以无缝地与数据结构和其他常用的AI算法一起使用。Python开发行业薪资多少？我们看看图片就能知道个大概。无论是国内还是国外对于编程语言的热度调查中，Python都是数得上名的。Python热度的持续升温，自然也引起了开源团队的项目。由于OSI认可的开放源码许可，程序员可以使用Python
UI自动化：Python + Selenium4.6+版本环境搭建双子测试自动化 python
以下是Python+Selenium4.12+环境搭建的详细步骤（无需手动下载浏览器驱动，利用SeleniumManager自动管理驱动）：1.安装Python1.1下载并安装Python官网下载地址：DownloadPython|Python.org安装时勾选AddPythontoPATH（自动配置环境变量）。1.2验证Python安装bash复制python--version#输出Python
python工资一般多少-Python开发的工资一般多少编程大乐趣
原标题：Python开发的工资一般多少Python开发的工资一般多少？要想知道Python开发的工资，就要先看看Python开发工程师的发展前景怎么样。Python的用武之地很多，它可读性好且开发效率很高、有着丰富的第三方库。（如GUI、API、开发框架）随着Python的流行，带动的是它的普及以及市场需求量。Python的未来薪资，究竟会朝怎样的方向发展呢？薪资的变化始终符合经济学原理：价格由供
Python就业薪资怎么样？前景如何？田野猫咪 Python 计算机 python 人工智能数据挖掘
Python是一种全栈的开发语言，你如果能学好Python，前端，后端，测试，大数据分析，爬虫等这些工作你都能胜任。那么Python现在在国内的就业薪资高吗？Python就业薪资怎么样？前景如何？对于这些问题，下面小编整理相关内容为大家详情解析，一起来了解吧~如果你也对Python感兴趣，想通过学习Python转行、做副业或者提升工作效率，我也为大家整理了一份【最新全套Python学习资料】一定对
python程序员工资高吗？ lmseo5hy python培训 python程序员
据统计数据显示，北京Python平均薪资为18860元，Python不同岗位薪资范围为：Python全栈开发工程师（10k-20K）、Python运维开发工程师（15k-20K）、Python高级开发工程师（15k-30K）、Python大数据工程师（15K-30K）、Python机器学习工程师（15k-30K）、Python架构师（20k-40k）等，相比于Java、PHP、C#等其他的编程语言
Python代码缩进及Pycharm中代码缩进 Hi~晴天大圣 Python python pycharm 缩进
1、代码缩进是编写Python代码时非常重要的部分，因为Python使用缩进来表示代码块。你可以选择使用Tabs或Spaces来进行缩进。2、在Python中，不建议将使用Tab键快捷缩进和点击使用Space（空格）进行缩进混用，虽然在很多时候Tab键为使用Space缩进4个空格的快捷方式，如Pycharm中Tab键为使用Space缩进4个空格的快捷方式：不同的编辑器或IDE对Tab和Space的
Python爬取58同城广州房源+可视化分析 R3eE9y2OeFcU40
感谢关注天善智能，走好数据之路↑↑↑欢迎关注天善智能，我们是专注于商业智能BI，人工智能AI，大数据分析与挖掘领域的垂直社区，学习，问答、求职一站式搞定！对商业智能BI、大数据分析挖掘、机器学习，python，R等数据领域感兴趣的同学加微信：tstoutiao，邀请你进入数据爱好者交流群，数据爱好者们都在这儿。消失了一段时间，这段时间在CSDN阅读了不少关于Python爬虫的文章，也学习了秦璐老师
如何用Python爬取Google新闻 2501_90631432 谷歌 python 人工智能开发语言
什么是Google新闻？Google新闻是Google推出的一项新闻聚合服务。它收集、整理和展示来自全球主要新闻网站的最新新闻报道。用户可以按关键词、主题、地区、发布来源等进行筛选，Google新闻算法会根据用户的兴趣和浏览习惯推荐个性化的新闻内容。Google新闻数据主要来自权威新闻机构、博客、政府公告等，因此它是获取全球实时信息的重要来源。你可以从Google新闻中获取哪些数据？新闻标题(ti
python 面向对象(类和对象)（详细版）帅维维 python面向对象 python 开发语言后端
学习任务1.理解面向过程编程和面向对象编程思想2.明确类和对象的关系，会独立设计和使用类3.会使用类创建对象，并添加属性4.掌握类的属性和方法5.掌握构造方法和析构方法的使用重点1.self的使用2.构造方法和析构方法3.类属性和实例属性4.方法的重载引入面向过程：先分析解决问题的步骤，使用函数把这些步骤以此实现，使用的时候需要逐个调用函数。面向对象：把解决问题的事物分为多个对象，对象具备解决问题
【Python运维】实现高效的自动化备份与恢复：Python脚本从入门到实践蒙娜丽宁 Python杂谈运维运维 python 自动化
《PythonOpenCV从菜鸟到高手》带你进入图像处理与计算机视觉的大门！解锁Python编程的无限可能：《奇妙的Python》带你漫游代码世界在信息化时代，数据备份和恢复的有效性对企业和个人来说至关重要。本文将带领读者深入了解如何使用Python编写自动化备份与恢复脚本，确保重要数据的安全。本篇文章涵盖了文件系统的备份、MySQL数据库的备份与恢复、定期任务的自动化调度等内容。我们将通过大量的
智能交通违章处理系统：AI赋能下的智慧交通解决方案 Echo_Wish Python 笔记 Python 算法人工智能
友友们好！我是Echo_Wish，我的的新专栏《Python进阶》以及《Python！实战！》正式启动啦！这是专为那些渴望提升Python技能的朋友们量身打造的专栏，无论你是已经有一定基础的开发者，还是希望深入挖掘Python潜力的爱好者，这里都将是你不可错过的宝藏。在这个专栏中，你将会找到：●深入解析：每一篇文章都将深入剖析Python的高级概念和应用，包括但不限于数据分析、机器学习、Web开发
1745. 分割回文串 IV 咔咔咔的 leetcode c++
1745.分割回文串IV题目链接：1745.分割回文串IV代码如下：//参考链接：https://leetcode.cn/problems/palindrome-partitioning-iv/solutions/3589992/zhi-jie-diao-yong-1278-ti-dai-ma-pythonj-u7pwclassSolution{public:boolcheckPartitioni
Python基础教程学习笔记第九章魔法方法，特性，迭代器只想开始 python
文章目录一，构造函数：\_\_init\_\_二，重写普通方法和特殊的构造函数拓展三，元素访问注意五，函数property5.1property特性5.2静态方法和类方法5.3\_\_getattr__、\_\_setattr__等方法注意六，迭代器iter6.1迭代器协议七，生成器7.1简单生成器7.2递归式生成器注意7.3通用生成器7.4生成器的方法拓展：7.5模拟生成器一，构造函数：__in
Python 中的异步与同步：解析与实践子墨将大数据 python
Python中的异步与同步：深度解析与实践在Python编程世界里，异步和同步的概念是理解程序执行流程和性能优化的关键。这篇文章将带你深入了解它们的差异，以及阻塞和非阻塞的特性，同时通过实际代码示例来加深理解。异步与同步的定义异步异步意味着多任务处理，任务之间的执行没有严格的先后顺序，甚至可以同时运行。这就好比你一边听音乐，一边浏览网页，听音乐和浏览网页这两个任务之间互不干扰，多条任务的执行路径同
数据结构：python实现最大堆算法 cqbelt python 算法数据结构
概念最大堆是一种完全二叉树，父节点的值总是大于或等于其子节点的值。通常，最大堆可以用数组来实现。最大堆的主要操作包括插入元素和提取最大值。在Python中，可以用一个列表来存储堆的元素。索引从0开始的话，父节点和子节点的位置关系需要确定。对于索引i的节点，其左子节点是2i+1，右子节点是2i+2，父节点则是(i-1)//2。插入元素时，需要将新元素添加到数组的末尾，然后进行上浮操作（percola
PyThon最详细入门语法笔记带ta去蒙古国 python 字符串列表机器学习深度学习
写在前面：这篇笔记是由本人原创(也是第一篇原创，萌新.jpg)，兄弟萌如果觉得不错的话，可以点个关注或收藏，方便以后查阅呀。文章目录前言一、PyThon数据：常量与变量1.常量1.1整型1.2浮点型1.3字符串1.4布尔型2.变量2.1查看变量类型：type(变量名)2.2强制类型转换：类型名(变量名)二、PyThon数据运算1.数学运算1.1加：``var1+var2``1.2减：``var1-
二.Python开发环境搭建许理 001python python 开发语言
1.环境搭建开发环境搭建（Python3环境搭建|菜鸟教程(runoob.com)）主要就是安装Python的解释器2.解释器分类Python的解释器分类：CPython（官方）用c语言编写的Python解释器PyPy用Python语言编写的Python解释器IronPython用.net编写的Python解释器Jython用Java编写的Python解释器3.步骤：1.下载安装包python-3
《Python基础教程》第2-4章笔记：列表和元组、字符串、字典 WalkingComputer python 笔记开发语言教程入门
《Python基础教程》第1章笔记https://blog.csdn.net/holeer/article/details/143052930目录第2章列表和元组2.1序列概述2.2通用的序列操作2.3列表：Python的主力2.3.1函数list2.3.2基本的列表操作2.3.3列表方法2.4元组：不可修改的序列第3章使用字符串3.2设置字符串的格式：精简版3.3设置字符串的格式：完整版3.3.
实现NTLM relay攻击工具的Python代码示例 go5463158465 python python 开发语言
以下是一个实现NTLMrelay攻击工具的Python代码示例，该工具可以完成自动扫描IP、配置相关协议、获取hash、自动化设置和执行攻击步骤等功能。代码思路IP扫描：使用scapy库进行IP扫描，找出活跃的IP地址。Responder配置：自动配置Responder工具，监听指定的协议。攻击执行：使用ntlmrelayx工具执行NTLMrelay攻击。日志处理：记录每个步骤的日志和错误信息，并
python经济模型，用于模拟不同政策对财富分配 Atlas Shepherd python python 人工智能算法
用python实现的一个经济模型，用于模拟不同政策对财富分配的影响。它主要包含以下几个部分：类和初始化方法：HierarchyLevel类：代表一个层级，具有层级ID、区域、资源、资产和属性。Hierarchy类：代表整个层级结构，包含多个HierarchyLevel实例，以及用于模拟的方法。层级结构管理：add_level：向层级结构中添加一个新的层级。calculate_total_resou
c#视觉应用开发中如何在C#中处理多光谱图像？ openwin_top C#视觉应用开发问题系列 c#开发语言计算机视觉视觉检测
microPythonPython最小内核源码解析NI-motion运动控制c语言示例代码解析python编程示例系列python编程示例系列二python的Web神器Streamlit如何应聘高薪职位在C#中处理多光谱图像（MultispectralImaging,MSI）通常涉及多个步骤，包括图像读取、处理和显示。多光谱图像包含多个频带（通常超过人类视觉的RGB频带），需要特殊处理才能进行分析
python使用flask框架ORM操作mysql oracle QMQ2021 flask python mysql
python使用flask框架ORM操作mysqloracle示例一：python调用flask框架调用方法输出示例二：python调用flask连接MySQL示例三：oracle连接需要指定instant_clientoracle需要下载instant_client示例四:mysqloracle共存(多库连接)扩展本文章记录着python使用flaskORM连接mysqloracle数据库的方法
numpy版本踩坑总结持续更新 AI算法网奇 python宝典 python基础 numpy
目录1.23版本报错module'numpy'hasnoattribute'bool'.协方差矩阵第2次优化：1.23版本影响库smplx报错module'numpy'hasnoattribute'bool'.解决方法：pipinstallnumpy==1.23.2测试版本命令：python-c"importnumpyasnp;print(np.__version__)"
blender python 不同的obj alpha设置不同颜色并保存 AI算法网奇 3d渲染深度学习宝典 jvm html
目录生成遮罩层，并且渲染保存：生成蓝色遮罩层并保存遮罩结果：blender不同的obj设置不同的alpha颜色节点模式实现：生成遮罩层，并且渲染保存：importbpyimportosbpy.context.view_layer.objects.active=bpy.context.selected_objects[0]bpy.ops.object.delete()bpy.ops.import_s
【Python】Python中的heapq模块详解，令人费解的_siftup与_siftdown函数观海胸襟阔 Python 数据结构与算法 python 算法数据结构
【Python】Python中的heapq模块详解，令人费解的_siftup与_siftdown函数heapq模块基本介绍小顶堆的定义和特性heapq模块的作用heappush函数heappop函数heappush函数与_siftdown函数改良后的heap_push函数heappop函数与_siftup函数改良后的heap_pop函数heapq模块与改进后的HeapLittle的验证heapq模
对于规范和实现，你会混淆吗？ yangshangchuan HotSpot
昨晚和朋友聊天，喝了点咖啡，由于我经常喝茶，很长时间没喝咖啡了，所以失眠了，于是起床读JVM规范，读完后在朋友圈发了一条信息： JVM Run-Time Data Areas：The Java Virtual Machine defines various run-time data areas that are used during execution of a program. So
android 网络百合不是茶网络
android的网络编程和java的一样没什么好分析的都是一些死的照着写就可以了,所以记录下来方便查找 , 服务器使用的是TomCat 服务器代码; servlet的使用需要在xml中注册 package servlet; import java.io.IOException; import java.util.Arr
[读书笔记]读法拉第传 comsci 读书笔记
1831年的时候,一年可以赚到1000英镑的人..应该很少的... 要成为一个科学家,没有足够的资金支持,很多实验都无法完成但是当钱赚够了以后....就不能够一直在商业和市场中徘徊......
随机数的产生沐刃青蛟随机数
c++中阐述随机数的方法有两种：一是产生假随机数（不管操作多少次，所产生的数都不会改变）这类随机数是使用了默认的种子值产生的，所以每次都是一样的。 //默认种子 for (int i = 0; i < 5; i++) { cout<<
PHP检测函数所在的文件名 IT独行者 PHP 函数
很简单的功能，用到PHP中的反射机制，具体使用的是ReflectionFunction类，可以获取指定函数所在PHP脚本中的具体位置。创建引用脚本。代码： [php] view plain copy // Filename: functions.php <?php&nbs
银行各系统功能简介文强chu 金融
银行各系统功能简介　业务系统核心业务系统业务功能包括：总账管理、卡系统管理、客户信息管理、额度控管、存款、贷款、资金业务、国际结算、支付结算、对外接口等清分清算系统以清算日期为准，将账务类交易、非账务类交易的手续费、代理费、网络服务费等相关费用，按费用类型计算应收、应付金额，经过清算人员确认后上送核心系统完成结算的过程国际结算系
Python学习1(pip django 安装以及第一个project) 小桔子 python django pip
最近开始学习python,要安装个pip的工具。听说这个工具很强大，安装了它，在安装第三方工具的话so easy!然后也下载了，按照别人给的教程开始安装，奶奶的怎么也安装不上！第一步：官方下载pip-1.5.6.tar.gz, https://pypi.python.org/pypi/pip easy! 第二部：解压这个压缩文件，会看到一个setup.p
php 数组 aichenglong PHP 排序数组循环多维数组
1 php中的创建数组 $product = array('tires','oil','spark');//array()实际上是语言结构而不是函数 2 如果需要创建一个升序的排列的数字保存在一个数组中，可以使用range()函数来自动创建数组 $numbers=range(1,10)//1 2 3 4 5 6 7 8 9 10 $numbers=range(1,10,
安装python2.7 AILIKES python
安装python2.7 1、下载可从 http://www.python.org/进行下载#wget https://www.python.org/ftp/python/2.7.10/Python-2.7.10.tgz 2、复制解压 #mkdir -p /opt/usr/python #cp /opt/soft/Python-2
java异常的处理探讨百合不是茶 JAVA异常
//java异常 /* 1，了解java 中的异常处理机制，有三种操作 a,声明异常 b,抛出异常 c,捕获异常 2，学会使用try-catch-finally来处理异常 3，学会如何声明异常和抛出异常 4，学会创建自己的异常 */ //2，学会使用try-catch-finally来处理异常
getElementsByName实例 bijian1013 element
实例1： <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/x
探索JUnit4扩展：Runner bijian1013 java 单元测试 JUnit
参加敏捷培训时，教练提到Junit4的Runner和Rule，于是特上网查一下，发现很多都讲的太理论，或者是举的例子实在是太牵强。多搜索了几下，搜索到两篇我觉得写的非常好的文章。文章地址：http://www.blogjava.net/jiangshachina/archive/20
[MongoDB学习笔记二]MongoDB副本集 bit1129 mongodb
1. 副本集的特性 1)一台主服务器(Primary),多台从服务器(Secondary) 2)Primary挂了之后，从服务器自动完成从它们之中选举一台服务器作为主服务器，继续工作，这就解决了单点故障，因此，在这种情况下，MongoDB集群能够继续工作 3)挂了的主服务器恢复到集群中只能以Secondary服务器的角色加入进来 2
【Spark八十一】Hive in the spark assembly bit1129 assembly
Spark SQL supports most commonly used features of HiveQL. However, different HiveQL statements are executed in different manners: 1. DDL statements (e.g. CREATE TABLE, DROP TABLE, etc.)
Nginx问题定位之监控进程异常退出 ronin47
nginx在运行过程中是否稳定，是否有异常退出过？这里总结几项平时会用到的小技巧。 1. 在error.log中查看是否有signal项，如果有，看看signal是多少。比如，这是一个异常退出的情况： $grep signal error.log 2012/12/24 16:39:56 [alert] 13661#0: worker process 13666 exited on s
No grammar constraints (DTD or XML schema).....两种解决方法 byalias xml
方法一：常用方法关闭XML验证工具栏：windows => preferences => xml => xml files => validation => Indicate when no grammar is specified:选择Ignore即可。方法二：（个人推荐）添加内容如下 <?xml version=
Netty源码学习-DefaultChannelPipeline bylijinnan netty
package com.ljn.channel; /** * ChannelPipeline采用的是Intercepting Filter 模式 * 但由于用到两个双向链表和内部类，这个模式看起来不是那么明显，需要仔细查看调用过程才发现 * * 下面对ChannelPipeline作一个模拟，只模拟关键代码： */ public class Pipeline {
MYSQL数据库常用备份及恢复语句 chicony mysql
备份MySQL数据库的命令，可以加选不同的参数选项来实现不同格式的要求。 mysqldump -h主机 -u用户名 -p密码数据库名 > 文件备份MySQL数据库为带删除表的格式，能够让该备份覆盖已有数据库而不需要手动删除原有数据库。 mysqldump -–add-drop-table -uusername -ppassword databasename > ba
小白谈谈云计算--基于Google三大论文 CrazyMizzz Google 云计算 GFS
之前在没有接触到云计算之前，只是对云计算有一点点模糊的概念，觉得这是一个很高大上的东西，似乎离我们大一的还很远。后来有机会上了一节云计算的普及课程吧，并且在之前的一周里拜读了谷歌三大论文。不敢说理解，至少囫囵吞枣啃下了一大堆看不明白的理论。现在就简单聊聊我对于云计算的了解。我先说说GFS &n
hadoop 平衡空间设置方法 daizj hadoop balancer
在hdfs-site.xml中增加设置balance的带宽，默认只有1M： <property> <name>dfs.balance.bandwidthPerSec</name> <value>10485760</value> <description&g
Eclipse程序员要掌握的常用快捷键 dcj3sjt126com 编程
判断一个人的编程水平，就看他用键盘多，还是鼠标多。用键盘一是为了输入代码（当然了，也包括注释），再有就是熟练使用快捷键。曾有人在豆瓣评《卓有成效的程序员》：“人有多大懒，才有多大闲”。之前我整理了一个程序员图书列表，目的也就是通过读书，让程序员变懒。程序员作为特殊的群体，有的人可以这么懒，懒到事情都交给机器去做，而有的人又可以那么勤奋，每天都孜孜不倦得
Android学习之路 dcj3sjt126com Android学习
转自：http://blog.csdn.net/ryantang03/article/details/6901459 以前有J2EE基础，接触JAVA也有两三年的时间了，上手Android并不困难，思维上稍微转变一下就可以很快适应。以前做的都是WEB项目，现今体验移动终端项目，让我越来越觉得移动互联网应用是未来的主宰。下面说说我学习Android的感受，我学Android首先是看MARS的视
java 遍历Map的四种方法 eksliang java HashMap java 遍历Map的四种方法
转载请出自出处： http://eksliang.iteye.com/blog/2059996 package com.ickes; import java.util.HashMap; import java.util.Iterator; import java.util.Map; import java.util.Map.Entry; /** * 遍历Map的四种方式
【精典】数据库相关相关 gengzg 数据库
package C3P0; import java.sql.Connection; import java.sql.SQLException; import java.beans.PropertyVetoException; import com.mchange.v2.c3p0.ComboPooledDataSource; public class DBPool{
自动补全 huyana_town 自动补全
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml&quo
jquery在线预览PDF文件，打开PDF文件天梯梦 jquery
最主要的是使用到了一个jquery的插件jquery.media.js，使用这个插件就很容易实现了。核心代码 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.
ViewPager刷新单个页面的方法 lovelease android viewpager tag 刷新
使用ViewPager做滑动切换图片的效果时，如果图片是从网络下载的，那么再子线程中下载完图片时我们会使用handler通知UI线程，然后UI线程就可以调用mViewPager.getAdapter().notifyDataSetChanged()进行页面的刷新，但是viewpager不同于listview，你会发现单纯的调用notifyDataSetChanged()并不能刷新页面
利用按位取反（~）从复合枚举值里清除枚举值草料场 enum
以 C# 中的 System.Drawing.FontStyle 为例。如果需要同时有多种效果，如：“粗体”和“下划线”的效果，可以用按位或（|） FontStyle style = FontStyle.Bold | FontStyle.Underline; 如果需要去除 style 里的某一种效果，
Linux系统新手学习的11点建议刘星宇编程工作 linux 脚本
　　随着Linux应用的扩展许多朋友开始接触Linux，根据学习Windwos的经验往往有一些茫然的感觉：不知从何处开始学起。这里介绍学习Linux的一些建议。　　一、从基础开始：常常有些朋友在Linux论坛问一些问题，不过，其中大多数的问题都是很基础的。例如：为什么我使用一个命令的时候，系统告诉我找不到该目录，我要如何限制使用者的权限等问题，这些问题其实都不是很难的，只要了解了 Linu
hibernate dao层应用之HibernateDaoSupport二次封装 wangzhezichuan DAO Hibernate
/** * 方法描述:sql语句查询返回List<Class> * 方法备注: Class 只能是自定义类 * @param calzz * @param sql * @return * 创建人：王川 * 创建时间：Jul

mmCIF 格式字符串解析

你可能感兴趣的:(python,生物信息学)