More than 5 years have passed since last update.

Biopython Tutorial and Cookbook和訳(4.1)

Last updated at 2020-07-01Posted at 2020-05-26

Biopythonのチュートリアルを訳する目的と理由：

生物の学習
Biopythonの勉強
英語の復習
少しでもこの分野に貢献したい

どっちも全くの初心者なので、意味を通じない部分があればぜひご指摘ください。
1週間で１節のペースで訳して行きます。
間違いを発見しやすいように英文も一緒に載せます。
一章を終えたら一つの記事にまとめます。

　Chapter 4 Sequence annotation objects

Chapter 3 introduced the sequence classes.
chp3はシーケンスクラスを紹介しました。

Immediately “above” the Seq class is the Sequence Record or SeqRecord class, defined in the Bio.SeqRecord module.
Seq classの真上(継承の意味？)はSequence Record、あるいはSeqRecordクラス、Bio.SeqRecordモジュール内に定義されています。

This class allows higher level features such as identifiers and features (as SeqFeature objects) to be associated with the sequence, and is used throughout the sequence input/output interface Bio.SeqIO described fully in Chapter 5.
このクラスはidentifiersやfeature（SeqFeatureオブジェクト）のようなハイレベルの特徴をシーケンスと結びつける、そしてchap5に紹介するシーケンスのinput/outputインタフェース - Bio.SeqIOに広く使われています。

If you are only going to be working with simple data like FASTA files, you can probably(おそらく) skip this chapter for now.
もしあなたがFASTAのようなシンプルなデータしか扱わないなら、この章を飛ばしてもたぶん問題ないでしょう。

While this chapter should cover most things to do with the SeqRecord and SeqFeature objects in this chapter,
you may also want to read the SeqRecord wiki page (http://biopython.org/wiki/SeqRecord), and the built in documentation (also online – SeqRecord and SeqFeature):
この章ではSeqRecord や SeqFeature objectsを使って多くのことをこなす必要があり、
SeqRecord wiki pageや関数のドキュメンテーションも参照したほうがいいかもしれないです。

>>> from Bio.SeqRecord import SeqRecord
>>> help(SeqRecord)

　4.1 The SeqRecord object

The SeqRecord (Sequence Record) class is defined in the Bio.SeqRecord module.
SeqRecord (Sequence Record)クラスはBio.SeqRecordモジュール内に定義されています。

This class allows higher level features such as identifiers and features to be associated with a sequence (see Chapter 3),
and is the basic data type for the Bio.SeqIO sequence input/output interface (see Chapter 5).
このクラスはシーケンスに高いレベルの特徴と修飾子を付ける機能を提供しています。
そして、それはBio.SeqIOシーケンスinput/outputインタフェースの基本のデータとなります（chap5を参照）

The SeqRecord class itself is quite simple, and offers the following information as attributes:
SeqRecordクラス自体はとてもシンプルで、以下の情報を属性として使います。

.seq
– The sequence itself, typically a Seq object.
.id
– The primary ID used to identify the sequence – a string. In most cases this is something like an accession number.
.name
– A “common” name/id for the sequence – a string. In some cases this will be the same as the accession number, but it could also be a clone name. I think of this as being analogous to the LOCUS id in a GenBank record.
.description
– A human readable description or expressive name for the sequence – a string.
.letter_annotations
– Holds per-letter-annotations using a (restricted) dictionary of additional information about the letters in the sequence. The keys are the name of the information, and the information is contained in the value as a Python sequence (i.e. a list, tuple or string) with the same length as the sequence itself. This is often used for quality scores (e.g. Section 20.1.6) or secondary structure information (e.g. from Stockholm/PFAM alignment files).
.annotations
– A dictionary of additional information about the sequence. The keys are the name of the information, and the information is contained in the value. This allows the addition of more “unstructured” information to the sequence.
.features
– A list of SeqFeature objects with more structured information about the features on a sequence (e.g. position of genes on a genome, or domains on a protein sequence). The structure of sequence features is described below in Section 4.3.
.dbxrefs
- A list of database cross-references as strings.

.seq
– シーケンス自分自身, 一般的にSeqオブジェクト.
.id
– シーケンスを識別するためのID – 文字列. 多くの場合はアクセッション番号.
.name
– シーケンスの通称 – 文字列. たまにはアクセッション番号かクローンネームとなります('Clone name'が'IR'で始まるクローンが完全長クローンです). GenBank recordの遺伝子座IDに似ています.
.description
– 人間が可読な情報あるいはシーケンスの意味を表す名前 – 文字列.
.letter_annotations
– （制限された）シーケンスの文字に関する付加情報（辞書型）を利用することで文字ごとのアノテーションを保持します. キーは情報の名前で, 情報は値に含まれます (i.e. リスト、タプルあるいは文字列) シーケンスと同じ長さになります. 品質スコア (e.g. Section 20.1.6) あるいは二次構造情報によく使われます (e.g. from Stockholm/PFAM alignment files).
.annotations
– シーケンス付加情報（辞書型）. キーは情報の名前, 値は情報. シーケンスにより多くの付加情報を追加することができます.
.features
– SeqFeature objectsのリスト型でより多くの構造化情報が含まれるシーケンスの特徴データ (e.g. ゲノム上の遺伝子位置情報, あるいはたんぱく質シーケンスのドメイン). Section 4.3では構造化したシーケンスの特徴を言及します.
.dbxrefs
– データベースの相互参照リスト(文字列).
4.2へ

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up