Polymorphisms (SNPs) > haplotypes(P

Polymorphisms (SNPs) > haplotypes
(Polymorphisms within the same locus that passes the criteria threshold) > Genotypes
(Haplotypes that passes the criteria threshold)
6- Correct, Export and Reload data with Stacks (optional)
6a- Stacks - Export_sql.pl
This program will allow to export data from the MySQL database into a Excel or TSV file.
The same filters accessible from the website can be enter in the command line.
6a.1- Create directories “db_export” in the directory
“stacks_analysis” to output the catalog files generated from export_sql.pl.
6a.2- Run export_sql.pl using the command line (3 examples)
To get a full spectrum of the data for all samples, 3 files should be exported: (1) to get the
haplotype, (2) to get the number of reads per haplotype (3) to get the genotype. The best loci
to be use as markers have to be found in a majority of progeny and have only 1 allele in the
parent. Two distinct type of usable markers can be extract from the data: (I) Genotype AA/--
(not listed as genotype by Stacks) which correspond to haplotypes with 1 allele present in 1
parent and about 75% of the F2 progeny. (II) Genotype AA/BB which correspond to 2
alleles each present in a different parent and about 75% of the F2 progeny.
> cd /usr/local/share/stacks/scripts
(1)> export_sql.pl -D db_name_radtags -b 1 -a haplo
-f /path/to/db_export/db_name_radtags_export_haplo
-o xls -L 3
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50
(2)>export_sql.pl -D db_name_radtags -b 1 -a haplo
-f /path/to/db_export/db_name_radtags_export_haplo_depth
-o xls -L 3
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50
-d
(3)> export_sql.pl -D db_name_radtags -b 1 -a geno
-f /path/to/db_export/db_name_radtags_export_geno
-o xls -m GEN -c
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50
• D — database to export from.
• b — batch ID of the dataset to export.
• a — type of data to export, 'geno' or 'haplo', for genotypes or observed haplotypes.
• f — file to output data (specify name of the new output file).
• o — type of data to export: 'tsv' or 'xls'.
• d — if exporting haplotype, output depth of alleles instead of the allele values.
• L — if exporting haplotypes, specify a stack depth limit.
• m —if exporting genotypes, specify the map type.
• c — if exporting genotypes, include manual corrections made online.
• F — include filters in the format name=value.
Important Filters (more are available on Stacks tutorial):
[-F snps_u=5] – locus with no more than 5 SNPs.
[-F snps_l=2] – locus with at least 2 SNPs.
[-F alle_u=1] – locus with no more than 1 allele.
[-F alle_l=2] – locus with at least 2 alleles.
[-F pare_u=1] – locus present in no more than 1 parent.
[-F pare_l=2] – locus present in at least 2 parents.
[-F prog=50] – rlocus present in at least 50 progeny.
[-F mark=aa/bb] – Export only a particular marker type.
6b- Stacks – Genotype
The genotypes.pl program can be performed on catalog files resulting from a denovo_map
or a ref_map analysis. It will perform corrections according to new criteria for the calling
of genotypes at each indiv_locus. These criteria are based on the haplotypes quality (mostly
define by the number of sequences composing each alleles). It will generate a new list of
mappable markers in the directory containing the original files of the catalog
(stacks_denovo or stacks_ref) that can be then use to upload into a new database.
denovo_map/ref_map > genotypes > load_radtags > index_radtags
6b.1- Run genotype using the command line
> cd /usr/local/share/stacks/
> genotypes -P /path/to/stacks_ref -b 1 -r 50 -m 3 -t GEN -c -s –min_hom_seqs 3 --
min_het_seqs 0.1
# In this example: minimum stack depth of 3 reads + limit markers that are found in at least
50 progeny + minimum 3 reads to constitute an homozygous locus + MAF minimum to call
homozygous raise to 0.1 (if MAF above the locus will be call hetero, so no more unknown)
• P – path to the directory containing the files of the catalog to examine.
• b — Batch ID to examine.
• r — minimum number of progeny required to print a marker.
• m – minimum stack depth to consider a locus.
.
• s — output a file to import results into an SQL database.
• t – map type: 'CP', 'DH', 'F2', 'BC1', and 'GEN'. Note that to upload markers back into the
database, they must be of the generic type “GEN".
• c — make automated corrections to the data.
Automated corrections options:
• --min_hom_seqs [num] — minimum number of reads to call a homozygous genotype
(default 5).
• --min_het_seqs [num] — if MAF below this, locus is called homozygous (default 0.05).
• --max_het_seqs [num] — if MAF above this, locus is called heterozygous (default 0.1).
MAF = Minor Allele Frequency = nb minor alleles / nb major alleles. Note that if the MAF
is between min_het_seqs and max_het_seqs, the locus is called unknown.
Filtering options:
• --lnl_lim [num] — filter loci with log likelihood values below this threshold.
6b.2- Load the new markers list into the database
> cd /usr/local/share/stacks/scripts
> load_radtags.pl -p /path/to/stacks_ref/ -D db _name_radtags -b 2 -e “Ref Map New
Genotypes” -c -B
• p – path to input catalog files
• D – database name.
• b — Batch ID.
• e – Catalog description.
• c – Load the catalog into the database.
• B – Load information into batch table.
6b.3- Index the database
> cd /usr/local/share/stacks/scripts
> index_radtags.pl -D db_name_radtags -c
• c – generate the catalog index
• D – database name
6c- Stacks – rxstacks
When sequencing error is present at a particular site it can be difficult to differentiate a
second allele from error. However, by looking at the same nucleotide site across a
population of individuals it is much easier to tell if an error is present. RXstacks program to
Correct genotypse and haplotypes based on population-wide examination. RXstacks can be
performed after a denovo_map or ref_map analysis. The corrected data are used to create a
new catalog of markers using Cstacks > Sstacks and then load into a new database.
Denovo_map/Ref_map > RXstacks > Cstacks > Sstacks > load_radtags > index_radtags
6c.1- Create directories “stacks_denovo_corr” in the directory
“stacks_analysis” to output the corrected catalog files generated from rxstacks.
6c.2- Run rxstacks from the Command Line
> cd /usr/local/share/stacks
> rxstacks -b 1 -t 15 -P /path/to/stacks_denovo/
-o /path/to/stacks_denovo_corr/
--prune_haplo --model_type bounded
--lnl_lim -10.0 --conf_lim 0.5
· b — Batch ID to examine when exporting from the catalog.
· P — path to the denovo_map/ref_map output files.
· o — output path to write results.
· t — number of threads to run in parallel sections of code.
Important Filters (more are available on Stacks tutorial):
[--prune_haplo] Haplotype pruning: the rxstacks program will remove excess of
haplotypes from individual loci according to their prevalence in the population. A threshold
can be defined using [--max_haplo].
[--model_type bounded] Sequencing errors are removed using the bounded SNP
model and SNPs are recalled. RXstacks uses the catalog to look at the alleles for a particular
nucleotide position in the population, filter out putative sequencing errors in each individual
and finally recall the nucleotide position as a heterozygote or homozygote using the bounded
SNP model. –model_type [type] can be either ‘snp’ (default), ‘bounded’ or ‘fixed’.
[--lnl_lim -10.0] Catalog loci with an average log likelihood below -10.0 will be
removed. A log likelihood is assign for each stacks (by Ustacks or Pstacks) based on depth
of coverage (number of reads) and presence of sequencing errors. Good log likelihoods are
close to zero, bad log likelihoods are highly negative. A log likelihood limit can be set in the
rxstacks program to remove loci that have a log likelihood below the threshold. To help
determining the threshold, the set of mean log likelihoods for each catalog locus can be
printed using RXstacks by specifying the [--lnl_dist] parameter (which will create the
batch_X.rxstacks_lnls.tsv file in the output directory).
[--conf_lim 0.5] Any catalog locus in which 50% or more individuals have a
confounded match to the catalog will be removed. For each catalog in the locus, we expect
each individual in the analysis to have a single, matching locus. When multiple loci from an
individual match a single catalog locus, we refer to the locus as confounded. These loci are
typically composed of repetitive sequence. By giving a percentage limit of confounded
individuals for each catalog locus, the program will remove the loci that exceed this limit.
6c.3- Redoing the catalog (Cstacks, Sstacks)
> cd /usr/local/share/stacks
> cstacks -b 1 -n 3 -p 15
-o /path/to/stacks_denovo_corr
-s /path/to/stacks_denovo_corr/parent1
-s /path/to/stacks_denovo_corr/parent2
-s /path/to/stacks_denovo_corr/progeny1
-s /path/to/stacks_denovo_corr/progeny2
…… enter all samples......
• b ! — batch.!ID
• s! — TSV!file!from!which!to!load!radtags.!
• o! — output!path!to!write!results.!
• n! — number!of!mismatches!allowed!between!sample!tags!when!generating!the!
catalog.
• P! – enable!parallel!execution!with!num_threads
> cd /usr/local/share/stacks
> sstacks -b 1 -p 15
-c ./path/to/stacks_denovo_corr/batch_1
-o /path/to/stacks_denovo_corr
-s /path/to/stacks_denovo_corr/parent1
-s /path/to/stacks_denovo_corr/parent2
-s /path/to/stacks_denovo_corr/progeny1
-s /path/to/stacks_denovo_corr/progeny2
…… enter all samples......
• b! — MySQL!ID!of!this!batch.!
• c! — TSV!file!from!which!to!load!the!catalog!RADFTags.!
• s! — TSV!file!from!which!to!load!sample!RADFTags.!
• o! — output!path!to!write!results.
• P! – enable!parallel! execution!with!num_threads
6c.4- Loading and Indexing the database

Polymorphisms (SNPs) > haplotypes
(Polymorphisms within the same locus that passes the criteria threshold) > Genotypes
(Haplotypes that passes the criteria threshold)
6- Correct, Export and Reload data with Stacks (optional)
6a- Stacks - Export_sql.pl
This program will allow to export data from the MySQL database into a Excel or TSV file.
The same filters accessible from the website can be enter in the command line.
6a.1- Create directories “db_export” in the directory
“stacks_analysis” to output the catalog files generated from export_sql.pl.
6a.2- Run export_sql.pl using the command line (3 examples)
To get a full spectrum of the data for all samples, 3 files should be exported: (1) to get the
haplotype, (2) to get the number of reads per haplotype (3) to get the genotype. The best loci
to be use as markers have to be found in a majority of progeny and have only 1 allele in the
parent. Two distinct type of usable markers can be extract from the data: (I) Genotype AA/--
(not listed as genotype by Stacks) which correspond to haplotypes with 1 allele present in 1
parent and about 75% of the F2 progeny. (II) Genotype AA/BB which correspond to 2
alleles each present in a different parent and about 75% of the F2 progeny.
> cd /usr/local/share/stacks/scripts
(1)> export_sql.pl -D db_name_radtags -b 1 -a haplo 
-f /path/to/db_export/db_name_radtags_export_haplo 
-o xls -L 3 
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50
(2)>export_sql.pl -D db_name_radtags -b 1 -a haplo 
-f /path/to/db_export/db_name_radtags_export_haplo_depth 
-o xls -L 3 
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50 
-d
(3)> export_sql.pl -D db_name_radtags -b 1 -a geno 
-f /path/to/db_export/db_name_radtags_export_geno 
-o xls -m GEN -c 
-F snps_u=1 -F alle_u=2 -F pare_l=1 -F prog=50
• D — database to export from.
• b — batch ID of the dataset to export.
• a — type of data to export, 'geno' or 'haplo', for genotypes or observed haplotypes.
• f — file to output data (specify name of the new output file).
• o — type of data to export: 'tsv' or 'xls'.
• d — if exporting haplotype, output depth of alleles instead of the allele values.
• L — if exporting haplotypes, specify a stack depth limit.
• m —if exporting genotypes, specify the map type.
• c — if exporting genotypes, include manual corrections made online.
• F — include filters in the format name=value.
Important Filters (more are available on Stacks tutorial):
[-F snps_u=5] – locus with no more than 5 SNPs.
[-F snps_l=2] – locus with at least 2 SNPs.
[-F alle_u=1] – locus with no more than 1 allele.
[-F alle_l=2] – locus with at least 2 alleles.
[-F pare_u=1] – locus present in no more than 1 parent.
[-F pare_l=2] – locus present in at least 2 parents.
[-F prog=50] – rlocus present in at least 50 progeny.
[-F mark=aa/bb] – Export only a particular marker type.
6b- Stacks – Genotype
The genotypes.pl program can be performed on catalog files resulting from a denovo_map
or a ref_map analysis. It will perform corrections according to new criteria for the calling
of genotypes at each indiv_locus. These criteria are based on the haplotypes quality (mostly
define by the number of sequences composing each alleles). It will generate a new list of
mappable markers in the directory containing the original files of the catalog
(stacks_denovo or stacks_ref) that can be then use to upload into a new database.
denovo_map/ref_map > genotypes > load_radtags > index_radtags
6b.1- Run genotype using the command line
> cd /usr/local/share/stacks/
> genotypes -P /path/to/stacks_ref -b 1 -r 50 -m 3 -t GEN -c -s –min_hom_seqs 3 --
min_het_seqs 0.1
# In this example: minimum stack depth of 3 reads + limit markers that are found in at least
50 progeny + minimum 3 reads to constitute an homozygous locus + MAF minimum to call
homozygous raise to 0.1 (if MAF above the locus will be call hetero, so no more unknown)
• P – path to the directory containing the files of the catalog to examine.
• b — Batch ID to examine.
• r — minimum number of progeny required to print a marker.
• m – minimum stack depth to consider a locus.
.
• s — output a file to import results into an SQL database.
• t – map type: 'CP', 'DH', 'F2', 'BC1', and 'GEN'. Note that to upload markers back into the
database, they must be of the generic type “GEN".
• c — make automated corrections to the data.
Automated corrections options:
• --min_hom_seqs [num] — minimum number of reads to call a homozygous genotype
(default 5).
• --min_het_seqs [num] — if MAF below this, locus is called homozygous (default 0.05).
• --max_het_seqs [num] — if MAF above this, locus is called heterozygous (default 0.1).
MAF = Minor Allele Frequency = nb minor alleles / nb major alleles. Note that if the MAF
is between min_het_seqs and max_het_seqs, the locus is called unknown.
Filtering options:
• --lnl_lim [num] — filter loci with log likelihood values below this threshold.
6b.2- Load the new markers list into the database
> cd /usr/local/share/stacks/scripts
> load_radtags.pl -p /path/to/stacks_ref/ -D db _name_radtags -b 2 -e “Ref Map New
Genotypes” -c -B
• p – path to input catalog files
• D – database name.
• b — Batch ID.
• e – Catalog description.
• c – Load the catalog into the database.
• B – Load information into batch table.
6b.3- Index the database
> cd /usr/local/share/stacks/scripts
> index_radtags.pl -D db_name_radtags -c
• c – generate the catalog index
• D – database name
6c- Stacks – rxstacks
When sequencing error is present at a particular site it can be difficult to differentiate a
second allele from error. However, by looking at the same nucleotide site across a
population of individuals it is much easier to tell if an error is present. RXstacks program to
Correct genotypse and haplotypes based on population-wide examination. RXstacks can be
performed after a denovo_map or ref_map analysis. The corrected data are used to create a
new catalog of markers using Cstacks > Sstacks and then load into a new database.
Denovo_map/Ref_map > RXstacks > Cstacks > Sstacks > load_radtags > index_radtags
6c.1- Create directories “stacks_denovo_corr” in the directory
“stacks_analysis” to output the corrected catalog files generated from rxstacks.
6c.2- Run rxstacks from the Command Line
> cd /usr/local/share/stacks
> rxstacks -b 1 -t 15 -P /path/to/stacks_denovo/ 
-o /path/to/stacks_denovo_corr/ 
--prune_haplo --model_type bounded 
--lnl_lim -10.0 --conf_lim 0.5
· b — Batch ID to examine when exporting from the catalog.
· P — path to the denovo_map/ref_map output files.
· o — output path to write results.
· t — number of threads to run in parallel sections of code.
Important Filters (more are available on Stacks tutorial):
[--prune_haplo] Haplotype pruning: the rxstacks program will remove excess of
haplotypes from individual loci according to their prevalence in the population. A threshold
can be defined using [--max_haplo].
[--model_type bounded] Sequencing errors are removed using the bounded SNP
model and SNPs are recalled. RXstacks uses the catalog to look at the alleles for a particular
nucleotide position in the population, filter out putative sequencing errors in each individual
and finally recall the nucleotide position as a heterozygote or homozygote using the bounded
SNP model. –model_type [type] can be either ‘snp’ (default), ‘bounded’ or ‘fixed’.
[--lnl_lim -10.0] Catalog loci with an average log likelihood below -10.0 will be
removed. A log likelihood is assign for each stacks (by Ustacks or Pstacks) based on depth
of coverage (number of reads) and presence of sequencing errors. Good log likelihoods are
close to zero, bad log likelihoods are highly negative. A log likelihood limit can be set in the
rxstacks program to remove loci that have a log likelihood below the threshold. To help
determining the threshold, the set of mean log likelihoods for each catalog locus can be
printed using RXstacks by specifying the [--lnl_dist] parameter (which will create the
batch_X.rxstacks_lnls.tsv file in the output directory).
[--conf_lim 0.5] Any catalog locus in which 50% or more individuals have a
confounded match to the catalog will be removed. For each catalog in the locus, we expect
each individual in the analysis to have a single, matching locus. When multiple loci from an
individual match a single catalog locus, we refer to the locus as confounded. These loci are
typically composed of repetitive sequence. By giving a percentage limit of confounded
individuals for each catalog locus, the program will remove the loci that exceed this limit.
6c.3- Redoing the catalog (Cstacks, Sstacks)
> cd /usr/local/share/stacks
> cstacks -b 1 -n 3 -p 15 
-o /path/to/stacks_denovo_corr 
-s /path/to/stacks_denovo_corr/parent1 
-s /path/to/stacks_denovo_corr/parent2 
-s /path/to/stacks_denovo_corr/progeny1 
-s /path/to/stacks_denovo_corr/progeny2 
…… enter all samples......
• b ! — batch.!ID
• s! — TSV!file!from!which!to!load!radtags.!
• o! — output!path!to!write!results.!
• n! — number!of!mismatches!allowed!between!sample!tags!when!generating!the!
catalog.
• P! – enable!parallel!execution!with!num_threads
> cd /usr/local/share/stacks
> sstacks -b 1 -p 15
-c ./path/to/stacks_denovo_corr/batch_1 
-o /path/to/stacks_denovo_corr 
-s /path/to/stacks_denovo_corr/parent1 
-s /path/to/stacks_denovo_corr/parent2 
-s /path/to/stacks_denovo_corr/progeny1 
-s /path/to/stacks_denovo_corr/progeny2 
…… enter all samples......
• b! — MySQL!ID!of!this!batch.!
• c! — TSV!file!from!which!to!load!the!catalog!RADFTags.!
• s! — TSV!file!from!which!to!load!sample!RADFTags.!
• o! — output!path!to!write!results.
• P! – enable!parallel! execution!with!num_threads
6c.4- Loading and Indexing the database

0/5000

Từ: -

Sang: -

Kết quả (Việt) 1: [Sao chép]

Sao chép!

Polymorphisms (SNPs) > haplotypes(Polymorphisms trong cùng một locus mà đi tiêu chí ngưỡng) > kiểu gen(Các Haplotypes vượt qua ngưỡng tiêu chuẩn)6 - sửa chữa, xuất khẩu và tải lại dữ liệu với ngăn xếp (tùy chọn)6A-ngăn xếp - Export_sql.plChương trình này sẽ cho phép để xuất dữ liệu từ cơ sở dữ liệu MySQL vào một tập tin Excel hoặc TSV.Các bộ lọc tương tự có thể truy cập từ các trang web có thể được nhập vào dòng lệnh.6A.1-tạo thư mục "db_export" trong thư mục"stacks_analysis" để đầu ra tệp catalô được tạo ra từ export_sql.pl.export_sql.pl 6A.2-chạy bằng cách sử dụng dòng lệnh (3 chiếc)Để có được một phổ đầy đủ của dữ liệu cho tất cả mẫu, 3 tập tin nên được xuất khẩu: (1) để có được cáchaplotype, (2) để có được số lần đọc một haplotype (3) để có được kiểu gen. Loci tốt nhấtđược sử dụng như dấu hiệu phải tìm thấy trong một đa số con cháu và có chỉ có 1 allele trong cácphụ huynh. Hai loại riêng biệt của các dấu hiệu có thể sử dụng có thể là chiết xuất từ các dữ liệu: (I) kiểu gen AA /--(không liệt kê như là kiểu gen của ngăn xếp) mà tương ứng với haplotypes với 1 allele hiện diện trong 1phụ huynh và khoảng 75% con cháu F2. (II) kiểu gen AA/BB mà tương ứng với 2allele mỗi trình bày trong một phụ huynh khác nhau và khoảng 75% con cháu F2.> cd /usr/local/share/stacks/scripts(1) > export_sql.pl -D db_name_radtags -b 1 - một haplo được viết bởi admin-f /path/to/db_export/db_name_radtags_export_haplo được viết bởi admin-o xls -L 3 -F snps_u = 1 -F alle_u = 2 -F pare_l = 1 -F prog = 50(2) > export_sql.pl -D db_name_radtags -b 1 - một haplo được viết bởi admin-f /path/to/db_export/db_name_radtags_export_haplo_depth được viết bởi admin-o xls -L 3 -F snps_u = 1 -F alle_u = 2 -F pare_l = 1 -F prog = 50 -d(3) > export_sql.pl -D db_name_radtags -b 1 - một geno -f /path/to/db_export/db_name_radtags_export_geno được viết bởi admin-o xls -m GEN - c được viết bởi admin-F snps_u = 1 -F alle_u = 2 -F pare_l = 1 -F prog = 50• D-cơ sở dữ liệu để xuất khẩu từ.• b-hàng loạt các ID của bộ dữ liệu để xuất khẩu.• một — loại dữ liệu để xuất khẩu, 'geno' hay 'haplo', cho kiểu gen hoặc quan sát haplotypes.• f-các tập tin dữ liệu đầu ra (chỉ định tên của tập tin đầu ra mới).• o-loại dữ liệu để xuất khẩu: 'tsv' hoặc 'xls'.• d — Nếu xuất khẩu haplotype, đầu ra chiều sâu của alen thay vì các giá trị allele.• L — nếu xuất khẩu haplotypes, chỉ định một giới hạn độ sâu ngăn xếp.• m — nếu xuất khẩu kiểu gen, chỉ định các loại bản đồ.• c — nếu xuất khẩu kiểu gen, bao gồm hướng dẫn sử dụng sửa chữa được thực hiện trực tuyến.• F-bao gồm bộ lọc định dạng tên = giá_trị.Quan trọng các bộ lọc (nhiều hơn nữa có sẵn trên hướng dẫn ngăn xếp):[-F snps_u = 5]-locus với không quá 5 SNPs.[-F snps_l = 2]-locus với ít nhất 2 SNPs.[-F alle_u = 1]-locus với không quá 1 allele.[-F alle_l = 2]-locus với ít nhất 2 allele.[-F pare_u = 1]-locus hiện diện trong không quá 1 phụ huynh.[-F pare_l = 2]-locus hiện diện trong ít nhất 2 phụ huynh.[-F prog = 50]-rlocus trình bày trong ít nhất 50 con cháu.[-F mark = aa/bb]-xuất khẩu chỉ là một loại đặc biệt đánh dấu.6B-ngăn xếp-kiểu genChương trình genotypes.pl có thể được thực hiện trên các tập tin vào cửa hàng gây ra bởi một denovo_maphoặc một phân tích ref_map. Nó sẽ thực hiện các chỉnh sửa theo tiêu chuẩn mới cho các cuộc gọicủa kiểu gen tại mỗi indiv_locus. Các tiêu chí này được dựa trên chất lượng haplotypes (chủ yếuxác định bởi số lượng các trình tự sáng tác alen mỗi). Nó sẽ tạo ra một danh sách mớiCác đánh dấu mappable trong thư mục chứa các tập tin ban đầu của các cửa hàng(stacks_denovo hoặc stacks_ref) mà có thể là sử dụng sau đó tải lên vào cơ sở dữ liệu mới.denovo_map/ref_map > kiểu gen > load_radtags > index_radtagskiểu gen 6B.1-chạy bằng cách sử dụng dòng lệnh> cd/usr/local/share/ngăn xếp /> kiểu gen -P /path/to/stacks_ref -b 1 - r 50 - m 3 -t GEN - c -s-min_hom_seqs 3--min_het_seqs 0,1# Trong ví dụ này: tối thiểu ngăn xếp chiều sâu của 3 lần đọc + dấu hiệu giới hạn được tìm thấy ở tối thiểu50 con cháu + tối thiểu 3 lần đọc để tạo thành một màu locus + MAF tối thiểu để gọimàu nâng cao để 0,1 (nếu MAF trên locus sẽ là cuộc gọi dị, vì vậy không có nhiều không rõ)• P-đường dẫn đến thư mục chứa các tập tin của các cửa hàng để kiểm tra.• b-lô ID để kiểm tra.• r-số lượng tối thiểu của con cháu phải in một điểm đánh dấu.• ngăn xếp m-tối thiểu chiều sâu để xem xét một locus..• s-đầu ra tập tin cần nhập kết quả vào một cơ sở dữ liệu SQL.• t-bản đồ loại: 'CP', 'DH', 'F2', 'BC1' và 'GEN'. Lưu ý rằng để tải lên đánh dấu trở lại vào cáccơ sở dữ liệu, họ phải loại chung, "GEN".• c-thực hiện các sửa chữa tự động để các dữ liệu.Tự động chỉnh tùy chọn:•--min_hom_seqs [số] — tối thiểu số lần đọc để gọi một kiểu gen màu(mặc định 5).•--min_het_seqs [số] — Nếu MAF dưới đây, locus được gọi là màu (mặc định 0,05).•--max_het_seqs [số] — Nếu MAF trên đây, locus được gọi là hổ (mặc định 0.1).MAF = nhỏ Allele tần số = nb nhỏ alen / nb lớn allele. Lưu ý rằng nếu MAFlà giữa min_het_seqs và max_het_seqs, locus được gọi là không rõ.Bộ lọc tùy chọn:•--lnl_lim [số] — lọc loci với giá trị khả năng đăng nhập dưới ngưỡng này.6B.2 – tải danh sách đánh dấu mới vào cơ sở dữ liệu> cd /usr/local/share/stacks/scripts> load_radtags.pl -p/path/to/stacks_ref / -D db _name_radtags -b 2 -e "Ref bản đồ mớiKiểu gen"- c -B• p-đường dẫn đến nhập tệp catalô• D-cơ sở dữ liệu tên.• b-lô của bạn.• e-cửa hàng mô tả.• c-tải các cửa hàng vào cơ sở dữ liệu.• B-tải thông tin lô bàn.6B.3-chỉ số cơ sở dữ liệu> cd /usr/local/share/stacks/scripts> index_radtags.pl -D db_name_radtags - c• c-tạo mục danh mục• D-cơ sở dữ liệu tên6c-ngăn xếp-rxstacksKhi trình tự lỗi là hiện nay tại một trang web cụ thể, nó có thể được khó khăn để phân biệt mộtThứ hai allele từ lỗi. Tuy nhiên, bằng cách nhìn vào cùng một trang nucleotide trên mộtdân số cá nhân là dễ dàng hơn nhiều để cho biết nếu một lỗi được trình bày. Chương trình RXstacksChính xác genotypse và haplotypes dựa trên dân toàn kiểm tra. RXstacks có thểthực hiện sau khi phân tích denovo_map hoặc ref_map. Sửa chữa dữ liệu được sử dụng để tạo ra mộtcác cửa hàng mới của các dấu hiệu bằng cách sử dụng Cstacks > Sstacks và sau đó nạp vào cơ sở dữ liệu mới.Denovo_map/Ref_map > RXstacks > Cstacks > Sstacks > load_radtags > index_radtags6c.1-tạo thư mục "stacks_denovo_corr" trong thư mục"stacks_analysis" với sản lượng các tập tin vào cửa hàng sửa chữa được tạo ra từ rxstacks.rxstacks 6c.2 – chạy từ dòng lệnh> cd /usr/local/share/stacks> rxstacks -b 1 -t 15 - P/path/to/stacks_denovo / -o/path/to/stacks_denovo_corr / --prune_haplo--model_type bao bọc --lnl_lim-10.0--conf_lim 0,5· b-lô ID để kiểm tra khi xuất khẩu từ các cửa hàng.· P-các đường dẫn đến tập tin đầu ra denovo_map/ref_map.· o-đầu ra đường dẫn đến viết kết quả.· t-số của chủ đề để chạy trong song song phần của mã.Quan trọng các bộ lọc (nhiều hơn nữa có sẵn trên hướng dẫn ngăn xếp):[-prune_haplo] Cắt tỉa Haplotype: chương trình rxstacks sẽ loại bỏ dư thừa củahaplotypes từ cá nhân loci theo của phổ biến trong dân số. Một ngưỡngcó thể được xác định bằng cách sử dụng [-max_haplo].[--model_type bao bọc] Trình tự lỗi được loại bỏ bằng cách sử dụng các SNP bị chặnMô hình và SNPs đang nhớ lại. RXstacks sử dụng các cửa hàng để xem xét alen cho một đặc biệtnucleotide position in the population, filter out putative sequencing errors in each individualand finally recall the nucleotide position as a heterozygote or homozygote using the boundedSNP model. –model_type [type] can be either ‘snp’ (default), ‘bounded’ or ‘fixed’.[--lnl_lim -10.0] Catalog loci with an average log likelihood below -10.0 will beremoved. A log likelihood is assign for each stacks (by Ustacks or Pstacks) based on depthof coverage (number of reads) and presence of sequencing errors. Good log likelihoods areclose to zero, bad log likelihoods are highly negative. A log likelihood limit can be set in therxstacks program to remove loci that have a log likelihood below the threshold. To helpdetermining the threshold, the set of mean log likelihoods for each catalog locus can beprinted using RXstacks by specifying the [--lnl_dist] parameter (which will create thebatch_X.rxstacks_lnls.tsv file in the output directory).[--conf_lim 0.5] Any catalog locus in which 50% or more individuals have aconfounded match to the catalog will be removed. For each catalog in the locus, we expecteach individual in the analysis to have a single, matching locus. When multiple loci from anindividual match a single catalog locus, we refer to the locus as confounded. These loci aretypically composed of repetitive sequence. By giving a percentage limit of confoundedindividuals for each catalog locus, the program will remove the loci that exceed this limit.6c.3- Redoing the catalog (Cstacks, Sstacks)> cd /usr/local/share/stacks> cstacks -b 1 -n 3 -p 15 -o /path/to/stacks_denovo_corr -s /path/to/stacks_denovo_corr/parent1 -s /path/to/stacks_denovo_corr/parent2 -s /path/to/stacks_denovo_corr/progeny1 -s /path/to/stacks_denovo_corr/progeny2 …… enter all samples......• b ! — batch.!ID• s! — TSV!file!from!which!to!load!radtags.!• o! — output!path!to!write!results.!• n! — number!of!mismatches!allowed!between!sample!tags!when!generating!the!catalog.• P! – enable!parallel!execution!with!num_threads> cd /usr/local/share/stacks> sstacks -b 1 -p 15-c ./path/to/stacks_denovo_corr/batch_1 -o /path/to/stacks_denovo_corr -s /path/to/stacks_denovo_corr/parent1 -s /path/to/stacks_denovo_corr/parent2 -s /path/to/stacks_denovo_corr/progeny1 -s /path/to/stacks_denovo_corr/progeny2 …… enter all samples......• b! — MySQL!ID!of!this!batch.!• c! — TSV!file!from!which!to!load!the!catalog!RADFTags.!• s! — TSV!file!from!which!to!load!sample!RADFTags.!• o! — output!path!to!write!results.• P! – enable!parallel! execution!with!num_threads6c.4- Loading and Indexing the database

đang được dịch, vui lòng đợi..

Kết quả (Việt) 2:[Sao chép]

Sao chép!

Các đa hình (SNPs)> haplotype
(đa hình trong cùng một locus đã vượt qua ngưỡng tiêu chuẩn)> Kiểu gen
(haplotype đã vượt qua ngưỡng tiêu chuẩn)
6- đúng, xuất khẩu và Nạp dữ liệu với Stacks (tùy chọn)
6a- Stacks - Export_sql.pl
chương trình này sẽ cho phép xuất dữ liệu từ cơ sở dữ liệu MySQL vào một Excel hoặc TSV tập tin.
Các bộ lọc giống như truy cập từ các trang web có thể được nhập vào dòng lệnh.
6a.1- Tạo thư mục "db_export" trong thư mục
"stacks_analysis" để sản xuất các danh mục . file được tạo ra từ export_sql.pl
6a.2- export_sql.pl Run bằng cách sử dụng dòng lệnh (ví dụ 3)
Để có được một quang phổ đầy đủ các dữ liệu cho tất cả các mẫu, 3 file sẽ xuất khẩu: (1) để có được các
haplotype, ( 2) để có được số lượt đọc mỗi haplotype (3) để có được kiểu gen. Các locus tốt nhất
để được sử dụng như là dấu hiệu phải được tìm thấy trong phần lớn các thế hệ con cháu và chỉ có 1 allele ở
bố mẹ. Hai loại riêng biệt của các dấu hiệu có thể sử dụng có thể được chiết xuất từ các dữ liệu: (I) kiểu gen AA / -
(không được liệt kê như kiểu gen của Stacks) tương ứng với các haplotype với 1 alen có trong 1
mẹ và khoảng 75% của các thế hệ con cháu F2. (II) kiểu gen AA / BB tương ứng với 2
alen từng hiện diện trong một mẹ khác nhau và khoảng 75% của các thế hệ con cháu F2.
> cd / usr / local / share / stacks / scripts
(1)> export_sql.pl -D db_name_radtags - b 1 -a haplo
f / path / to / db_export / db_name_radtags_export_haplo
-o xls -L 3
-F snps_u = 1 -F alle_u = 2 -F pare_l = 1 -F prog = 50
(2)> export_sql. pl -D db_name_radtags -b 1 -a haplo
f / path / to / db_export / db_name_radtags_export_haplo_depth
-o xls -L 3
-F snps_u = 1 -F alle_u = 2 -F pare_l = 1 -F prog = 50
-d
(3)> export_sql.pl -D db_name_radtags -b 1 -a Geno
f / path / to / db_export / db_name_radtags_export_geno
-o xls -m GEN -c
-F snps_u = 1 -F alle_u = 2 - F pare_l = 1 -F prog = 50
• D - cơ sở dữ liệu kết xuất ra.
• b - lô ID của tập dữ liệu để xuất khẩu.
• a - loại dữ liệu để xuất khẩu, "Geno 'hoặc' haplo ', cho kiểu gen hoặc haplotype quan sát .
• f - tập tin dữ liệu đầu ra (ghi rõ tên của tập tin đầu ra mới).
• o - loại dữ liệu để xuất khẩu: 'tsv' hoặc 'xls'.
• d - nếu xuất khẩu haplotype, đầu ra chiều sâu của các alen thay vì các allele giá trị.
• L -. nếu xuất khẩu haplotype, chỉ định giới hạn độ sâu ngăn xếp
• m -Nếu xuất khẩu kiểu gen, xác định loại bản đồ.
• c - nếu xuất khẩu kiểu gen, bao gồm chỉnh dẫn thực hiện trực tuyến.
• F - bao gồm bộ lọc trong tên format = giá trị.
Bộ lọc quan trọng (hơn có sẵn trên Stacks hướng dẫn):
. [-F snps_u = 5] - locus với không quá 5 SNPs
. [-F snps_l = 2] - locus với ít nhất 2 SNPs
[-F alle_u = 1 ] -. locus với không nhiều hơn 1 allele
[-F alle_l = 2] -. locus với ít nhất 2 alen
[-F pare_u = 1] -. locus hiện tại không quá 1 mẹ
[-F pare_l = 2] - . locus có mặt trong ít nhất 2 cha mẹ
[-F prog = 50] - rlocus hiện diện trong ít nhất 50 con cháu.
[-F dấu = aa / bb] -. Xuất khẩu chỉ là một loại marker đặc biệt
6b- Stacks - kiểu gen
Các genotypes.pl chương trình có thể được thực hiện trên tệp danh mục kết quả từ một denovo_map
hoặc một phân tích ref_map. Nó sẽ thực hiện cải chính theo tiêu chuẩn mới cho sự kêu gọi
của các kiểu gen ở mỗi indiv_locus. Các tiêu chí này dựa trên các haplotype chất lượng (chủ yếu là
xác định bởi số lượng của các trình tự sáng tác mỗi alen). Nó sẽ tạo ra một danh sách mới của
dấu mappable trong thư mục chứa các tập tin ban đầu của các cửa hàng
(stacks_denovo hoặc stacks_ref) mà có thể được sử dụng sau đó tải lên thành một cơ sở dữ liệu mới.
denovo_map / ref_map> kiểu gen> load_radtags> index_radtags
6b.1- Run kiểu gen bằng cách sử dụng dòng lệnh
> cd / usr / local / share / stacks /
> kiểu gen -P / path / to / stacks_ref -b 1 -r -t 50 -m 3 GEN -c -s -min_hom_seqs 3 -
min_het_seqs 0.1
# Trong ví dụ này: tối thiểu ngăn xếp sâu 3 lần đọc + đánh dấu giới hạn đó được tìm thấy trong ít nhất
50 con cháu + tối thiểu 3 lần đọc để tạo thành một locus đồng hợp tử + MAF tối thiểu để gọi
đồng hợp tử tăng đến 0,1 (nếu MAF trên các locus sẽ được gọi dị, vì vậy không lạ hơn)
• P - đường dẫn đến thư mục chứa các tập tin của các cửa hàng để kiểm tra.
• b - Batch ID để kiểm tra.
• r -. Số lượng tối thiểu của các thế hệ con cháu bắt buộc phải in một dấu
• m - tối thiểu ngăn xếp sâu xem xét . một locus
.
• s - đầu ra một tập tin để nhập kết quả vào một cơ sở dữ liệu SQL.
• t - loại bản đồ: "CP", "DH", "F2", "BC1 ', và' GEN '. Lưu ý rằng để tải lên các dấu hiệu trở lại vào
cơ sở dữ liệu, chúng phải có kiểu chung chung "GEN".
• c - sửa chữa tự động để các dữ liệu.
chỉnh tự động tùy chọn:
• --min_hom_seqs [num] - số lượng tối thiểu là đọc để gọi một kiểu gen đồng hợp tử
(mặc định 5).
• --min_het_seqs [num] -. nếu MAF bên dưới này, locus được gọi là đồng hợp tử (mặc định 0.05)
• --max_het_seqs [num] - nếu MAF ở trên này, locus được gọi là dị hợp tử (mặc định 0.1) .
MAF = Tiểu Allele Tần số alen = nb nhỏ / alen lớn nb Lưu ý rằng nếu MAF.
là giữa min_het_seqs và max_het_seqs, các locus được gọi là không rõ.
tùy chọn lọc:
• --lnl_lim [num] - lọc loci với log giá trị khả năng dưới đây ngưỡng này.
6b.2- Nạp danh sách các cột mốc mới vào cơ sở dữ liệu
> cd / usr / local / share / stacks / script
> load_radtags.pl -p / path / to / stacks_ref / -D db _name_radtags -b 2 -e " Ref Bản đồ New
Kiểu gen "-c -B
• p - đường dẫn đến tệp danh mục đầu vào
• D - Tên cơ sở dữ liệu.
• b - ID hàng loạt.
• e - Mô tả Catalog.
• c - Nạp danh mục vào các cơ sở dữ liệu.
• B - thông tin tải vào bảng hàng loạt.
Index 6b.3- cơ sở dữ liệu
> cd / usr / local / share / stacks / script
> index_radtags.pl -D db_name_radtags -c
• c - tạo chỉ mục
• D - Tên cơ sở dữ liệu
6c- Stacks - rxstacks
Khi lỗi sequencing có mặt tại một trang web cụ thể nó có thể được khó khăn để phân biệt một
allele thứ hai từ lỗi. Tuy nhiên, bằng cách nhìn vào các trang web của nucleotide này trên
dân số của cá nhân đó là dễ dàng hơn nhiều để nói nếu một lỗi xuất hiện. Chương trình RXstacks để
Đúng genotypse và haplotype dựa trên kiểm tra dân số toàn. RXstacks có thể được
thực hiện sau khi một denovo_map hoặc phân tích ref_map. Các dữ liệu điều chỉnh được sử dụng để tạo ra một
cửa hàng mới của các dấu hiệu sử dụng Cstacks> Sstacks và sau đó tải vào một cơ sở dữ liệu mới.
Denovo_map / Ref_map> RXstacks> Cstacks> Sstacks> load_radtags> index_radtags
6c.1- Tạo thư mục "stacks_denovo_corr" trong thư mục
" stacks_analysis "để sản xuất các tệp danh mục sửa chữa được tạo ra từ rxstacks.
6c.2- rxstacks Run từ Command Line
> cd / usr / local / share / ngăn xếp
> rxstacks -b 1 -t 15 -P / path / to / stacks_denovo /
-o / path / to / stacks_denovo_corr /
--prune_haplo --model_type bounded
--lnl_lim -10.0 --conf_lim 0.5
· b - Batch ID để kiểm tra khi xuất khẩu từ các cửa hàng.
· P - đường dẫn đến denovo_map / đầu ra ref_map . file
· o - đường dẫn đầu ra để viết kết quả.
· t - số chủ đề để chạy trong phần song song mã.
Bộ lọc quan trọng (hơn có sẵn trên Stacks hướng dẫn):
[--prune_haplo] haplotype tỉa: chương trình rxstacks sẽ loại bỏ dư thừa của
haplotype từ loci cá nhân theo tỷ lệ của họ trong dân số. Một ngưỡng
có thể được xác định bằng cách sử dụng [--max_haplo].
[--model_type giáp] lỗi Sequencing được loại bỏ bằng cách sử dụng SNP giáp
mô hình và SNPs được nhắc lại. RXstacks sử dụng danh mục để xem xét các alen cho một cụ thể
vị trí nucleotide trong dân số, lọc ra các lỗi lập trình tự giả định trong mỗi cá nhân
và cuối cùng nhớ lại những vị trí nucleotide như một dị hợp tử hoặc homozygote sử dụng các giới hạn
mô hình SNP. -model_type [type] có thể là 'SNP' (mặc định), 'giáp' hoặc 'cố định'.
[--lnl_lim -10.0] Catalog loci với một khả năng đăng nhập trung bình dưới -10.0 sẽ được
gỡ bỏ. Một khả năng đăng nhập là gán cho mỗi ngăn xếp (bởi Ustacks hoặc Pstacks) dựa vào độ sâu
của vùng phủ sóng (số lần đọc) và sự hiện diện của các lỗi lập trình tự. Các khả năng log tốt là
gần bằng không, các khả năng log xấu là rất tiêu cực. Một giới hạn khả năng ghi có thể được thiết lập trong
chương trình rxstacks để loại bỏ loci rằng có một khả năng log dưới ngưỡng. Để giúp
xác định các ngưỡng, các thiết lập của các khả năng đăng nhập trung bình của mỗi catalog locus có thể được
in sử dụng RXstacks bằng cách xác định [--lnl_dist] tham số (mà sẽ tạo ra các
tập tin batch_X.rxstacks_lnls.tsv trong thư mục đầu ra).
[- conf_lim 0,5] Bất kỳ catalog locus trong đó 50% hoặc nhiều cá nhân có một
trận đấu bắt bẻ vào danh mục sẽ được loại bỏ. Đối với mỗi mục trong các locus, chúng tôi hy vọng
mỗi cá nhân trong việc phân tích để có một duy nhất, phù hợp với locus. Khi nhiều loci từ một
cá nhân phù hợp với một mục locus đơn, chúng tôi đề cập đến các locus như xấu hổ. Những loci được
thường bao gồm các trình tự lặp đi lặp lại. Bằng cách đưa ra một giới hạn tỷ lệ mắc cỡ
cá nhân cho mỗi locus catalog, chương trình sẽ loại bỏ các loci vượt quá giới hạn này.
6c.3- làm lại danh mục (Cstacks, Sstacks)
> cd / usr / local / share / ngăn xếp
> cstacks -b 1 -n 3 -p 15
o / path / to / stacks_denovo_corr
-s / path / to / stacks_denovo_corr / parent1
-s / path / to / stacks_denovo_corr / parent2
-s / path / to / stacks_denovo_corr / progeny1
-s / path / to / stacks_denovo_corr / progeny2
...... nhập tất cả các mẫu ......
• b! ID lô -.!
• s! -!!!!!!.! TSV tập tin mà từ đó để radtags tải
• o! -!!! Đường dẫn đầu ra để viết kết quả!.!
• n! -!!!!!!!!!! Số sai lệch cho phép giữa các mẫu thẻ khi tạo
. catalog
• P! - Cho phép thực hiện song song với num_threads!!!!
> cd / usr / local / share / ngăn xếp
> sstacks -b 1-p 15
-c ./path/to/stacks_denovo_corr/batch_1
o / path / to / stacks_denovo_corr
-s / path / to / stacks_denovo_corr / parent1
-s / path / to / stacks_denovo_corr / parent2
-s / path / to / stacks_denovo_corr / progeny1
-s / path / to / stacks_denovo_corr / progeny2
...... nhập tất cả các mẫu. .....
• b! -!!!!.! ID MySQL này hàng loạt
• c! -!!!! TSV tập tin mà từ đó để RADFTags catalog tải!!!!.!
• s! -!!!! TSV tập tin mà từ đó để tải RADFTags mẫu!!!.!
• o! -!!! Đường dẫn đầu ra để viết kết quả!.
• P! - Cho phép song song! thực hiện! với! num_threads
6c.4- tải và Lập chỉ mục cơ sở dữ liệu

đang được dịch, vui lòng đợi..

Kết quả (Việt) 3:[Sao chép]

Sao chép!

đang được dịch, vui lòng đợi..

Các ngôn ngữ khác

Hỗ trợ công cụ dịch thuật: Albania, Amharic, Anh, Armenia, Azerbaijan, Ba Lan, Ba Tư, Bantu, Basque, Belarus, Bengal, Bosnia, Bulgaria, Bồ Đào Nha, Catalan, Cebuano, Chichewa, Corsi, Creole (Haiti), Croatia, Do Thái, Estonia, Filipino, Frisia, Gael Scotland, Galicia, George, Gujarat, Hausa, Hawaii, Hindi, Hmong, Hungary, Hy Lạp, Hà Lan, Hà Lan (Nam Phi), Hàn, Iceland, Igbo, Ireland, Java, Kannada, Kazakh, Khmer, Kinyarwanda, Klingon, Kurd, Kyrgyz, Latinh, Latvia, Litva, Luxembourg, Lào, Macedonia, Malagasy, Malayalam, Malta, Maori, Marathi, Myanmar, Mã Lai, Mông Cổ, Na Uy, Nepal, Nga, Nhật, Odia (Oriya), Pashto, Pháp, Phát hiện ngôn ngữ, Phần Lan, Punjab, Quốc tế ngữ, Rumani, Samoa, Serbia, Sesotho, Shona, Sindhi, Sinhala, Slovak, Slovenia, Somali, Sunda, Swahili, Séc, Tajik, Tamil, Tatar, Telugu, Thái, Thổ Nhĩ Kỳ, Thụy Điển, Tiếng Indonesia, Tiếng Ý, Trung, Trung (Phồn thể), Turkmen, Tây Ban Nha, Ukraina, Urdu, Uyghur, Uzbek, Việt, Xứ Wales, Yiddish, Yoruba, Zulu, Đan Mạch, Đức, Ả Rập, dịch ngôn ngữ.