GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:27:48 Sequence gi568815597f:87232016_87440204 : 208189 bp : 40.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8991 9058 68 2 2 88 106 65 0.920 8.91 1.02 Intr + 11006 11134 129 2 0 126 44 20 0.489 0.29 1.03 Intr + 13690 13903 214 1 1 65 52 107 0.450 2.60 1.04 Intr + 14412 14458 47 2 2 48 92 48 0.425 -2.51 1.05 Intr + 14738 15032 295 2 1 70 55 143 0.170 5.29 1.06 Intr + 24371 24494 124 1 1 47 84 126 0.214 7.34 1.07 Intr + 33687 33719 33 1 0 74 107 35 0.036 1.28 1.08 Intr + 43376 43435 60 0 0 69 77 56 0.131 0.39 1.09 Term + 51261 51391 131 1 2 77 49 131 0.437 5.46 1.10 PlyA + 52846 52851 6 1.05 2.00 Prom + 59465 59504 40 -6.15 2.01 Init + 60641 60683 43 1 1 57 65 66 0.454 1.84 2.02 Term + 62443 62630 188 2 2 18 53 230 0.872 9.17 2.03 PlyA + 63433 63438 6 1.05 3.04 PlyA - 64294 64289 6 1.05 3.03 Term - 68517 68280 238 2 1 -4 40 271 0.204 7.76 3.02 Intr - 73365 73275 91 1 1 67 116 25 0.365 1.33 3.01 Init - 77580 77505 76 0 1 99 62 71 0.552 6.90 3.00 Prom - 83301 83262 40 -8.65 4.00 Prom + 84780 84819 40 -5.15 4.01 Init + 96007 96057 51 0 0 67 115 21 0.665 3.91 4.02 Intr + 97131 97229 99 2 0 94 110 29 0.839 5.09 4.03 Intr + 99998 100236 239 1 2 111 82 217 0.905 18.89 4.04 Intr + 102470 102545 76 2 1 61 72 92 0.979 3.50 4.05 Intr + 103156 103288 133 0 1 82 39 136 0.786 7.40 4.06 Intr + 104258 104363 106 0 1 101 74 10 0.808 -0.75 4.07 Intr + 107521 107617 97 1 1 97 62 98 0.867 7.19 4.08 Intr + 108032 108187 156 1 0 99 63 113 0.976 9.19 4.09 Intr + 118111 118229 119 0 2 66 43 98 0.014 1.44 4.10 Intr + 121246 121297 52 1 1 88 93 37 0.007 2.19 4.11 Intr + 123717 123861 145 2 1 48 80 59 0.002 0.03 4.12 Intr + 129457 129671 215 2 2 68 103 99 0.008 6.91 4.13 Intr + 132387 132488 102 2 0 111 103 17 0.007 4.95 4.14 Term + 132729 132743 15 2 0 79 48 -6 0.003 -8.24 4.15 PlyA + 133029 133034 6 1.05 5.02 PlyA - 133359 133354 6 1.05 5.01 Sngl - 135010 134438 573 1 0 52 55 254 0.914 14.41 5.00 Prom - 147856 147817 40 -4.45 6.03 PlyA - 147903 147898 6 1.05 6.02 Term - 153543 153304 240 0 0 49 49 219 0.701 9.14 6.01 Init - 156367 156350 18 1 0 74 85 16 0.287 -0.31 6.00 Prom - 156558 156519 40 -2.55 7.00 Prom + 159522 159561 40 -6.15 7.01 Init + 165559 165651 93 0 0 51 89 72 0.380 4.03 7.02 Term + 173510 173575 66 1 0 101 40 123 0.715 5.76 7.03 PlyA + 173710 173715 6 1.05 8.04 PlyA - 173897 173892 6 1.05 8.03 Term - 178078 177963 116 2 2 29 42 115 0.873 -1.25 8.02 Intr - 178902 178777 126 1 0 109 103 76 0.315 10.93 8.01 Init - 193809 193800 10 0 1 103 119 3 0.060 5.66 8.00 Prom - 194460 194421 40 -7.15 9.03 PlyA - 194576 194571 6 1.05 9.02 Term - 195402 195185 218 1 2 47 54 188 0.391 7.62 9.01 Init - 199350 199200 151 0 1 69 44 89 0.224 2.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 185325 185260 66 0 0 129 39 105 0.895 6.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_1|366_aa MEAWGSRDRVALPSGFGCVGFGKIGRESCPAVRHCFLPLTGYTGSHFADKTSIKFCEIVA SIGVERWHHVNEFWPIDVARSNVYHFQAWPTKSPYATLHGLSSYIHWLNGEDSVDLEYSG PRGEKECRSLNEQRVCRVTAQAFRILTLKRVDGVILLPHGDHSGELTVQQVSYFLVPPKE HPGISEYFPYKDGDSSKPTQACFQPNHDFALIRPPRVGCGGGDSTLQGFFSEIQRRTRAS FLLVLWQTGSKSSHHAVISPSETGRPGGEELRLSSGQLRLSSQPKASINYQPCRDMGTEK VEGATEIEHQAIVLHELVGLYLEGMDTHRKQANQNLSLGFSELEPREDCLSPSDGSSPCM PDSGAF >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_1|1101_bp atggaggcctggggcagcagggaccgagtggcactgccatctggctttggttgcgtgggc tttggaaaaattggaagggaatcctgccctgccgttaggcactgctttctccctttgact ggttatactggaagtcattttgcagataagaccagcattaagttttgtgaaatagtggcg tccattggtgttgaaaggtggcatcatgtcaatgagttctggccaatagatgttgccaga agcaatgtgtaccatttccaggcctggcccacaaaatcaccctatgcaaccctccatggt ctctcctcctatattcactggttgaatggagaggactctgtagatttggagtatagtgga cccagaggagagaaagagtgtagatctctaaatgagcaaagagtttgtagagtaactgct caggctttcagaatcctcacgctcaagagggtcgatggggtcattttgctcccccatggt gaccactctggcgagctgactgttcagcaagtcagttactttctcgttcctcctaaagag catccaggaatttctgagtattttccatacaaggatggagactccagcaaaccaactcag gcctgctttcagcctaatcatgactttgctctgatacggccaccacgtgttgggtgtggg ggtggggacagtactctgcaaggtttcttctcagagattcagagaaggacaagggcctct tttcttttggttttgtggcagacaggctctaagtctagccatcatgctgtgataagccca agtgaaacagggagaccaggtggagaagaactgaggctctccagtggacagctcagatta agctcccaaccaaaagccagcatcaactatcagccatgtagagacatggggactgagaag gtggaaggagcaactgagattgaacatcaagctattgtcctccatgagctggttgggctt tatctggaagggatggacacccatcgaaaacaggccaatcagaatctttccctggggttt tctgaattggaaccaagagaagattgcctctctccctctgatggcagcagcccctgcatg ccggattcaggggctttctga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_2|76_aa MAVMVSQNLRETAQECMLLATQRNNIFPRRTGHSAPHRPGPRAEEEDTPRHLSEDNWTRK QEPAGRTPSLRTGMNE >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_2|231_bp atggcggttatggtgtcacagaatctgcgagaaacagcacaagaatgcatgctgctggca acgcagaggaataatatcttcccaagacgaaccggacattctgctcctcaccgtccagga cccagagctgaagaggaggacacacctagacacctttctgaggacaactggacacggaag caggagccggcaggcaggaccccctctctgaggacaggaatgaatgaatga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_3|134_aa MVSAAQASPEQLNAQQQAGHSLAPPEKGKLESAIETSVFPNQKQASLPLRKRQGLRNLKV SLVCNGPSWWAFREAGRVSVSPDSVAAPDSEPGAHYQSNPPGFIDDTPGGSAAPIAPGAE QGKPVDCAKAWPRN >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_3|405_bp atggtctcagcagcgcaggcctctccagagcagctgaatgcacagcagcaggcaggacac agtttggcccccccagaaaaagggaagctagagtctgccattgaaacctcagtgtttccc aatcagaaacaagcaagcttgcctcttagaaagagacagggactaaggaacttgaaagtc agcctcgtttgcaatggaccaagctggtgggctttcagagaagctggacgtgtctctgtt tccccggattcagttgcagcgccagattcagaaccaggagcccattaccagtcaaatcct ccaggcttcattgatgataccccaggaggttctgcagctccaattgcccctggcgctgag caggggaagccagtggactgtgctaaggcttggccccggaattag >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_4|534_aa MQDLSSQTGRDPFLRKMLRQPRRPFPGSRAASLAFHRRRLSQYCNIGEKQTMVNPGSSSQ PPPVTAGSLSWKRCAGCGGKIADRFLLYAMDSYWHSRCLKCSCCQAQLGDIGTSCYTKSG MILCRNDYISPLVGGAFNAAGTAAEEWSNLLARRQLHPSAPSSFGNSVSLPRTVFPPPPP RARRLARKVQREEEVRRPADRLLIFTCRYKCIYLSATLSETYLSLPKSSLENTARLFGNS GACSACGQSIPASELVMRAQGNVYHLKCFTCSTCRNRLVPGDRFHYINGSLFCEHDRPTA LINGHLNSLQSNPLLPDQKPLTCRGIVIKAQNPRIAIERWHLGHFGKITCTDDVLYPVRN LLTTAVFANAFDISLKRITSHVRGEPSSLMKALGKAFLFCAHGTLFLKVAIWRVTIWKRP SLSSALFTIAKTWNQPKCPSMVDWIKKMWYMYTVEFYAAIKRNEIMSFAGIWMKLEAIIL SKLMQEQKTKHRMFSHVCQVCPNEGRFLKTEIECSIPCVHPKLPCSVVYVPRTP >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_4|1605_bp atgcaggacttatcttcccaaacaggaagagacccatttctgagaaagatgctccgccag cccaggcgccccttccctggaagccgagcggcttcgctcgcatttcaccgccgccgcctc tcgcaatattgcaatataggggaaaagcagaccatggtgaatccgggcagcagctcgcag ccgcccccggtgacggccggctccctctcctggaagcggtgcgcaggctgcgggggcaag attgcggaccgctttctgctctatgccatggacagctattggcacagccggtgcctcaag tgctcctgctgccaggcgcagctgggcgacatcggcacgtcctgttacaccaaaagtggc atgatcctttgcagaaatgactacattagtccacttgttggaggagctttcaatgccgca gggaccgctgcagaagagtggagtaatctgctcgcccgccgccagctccacccaagcgcc ccaagtagcttcggaaactccgtttctcttcctcgaaccgtctttcctccacccccacct cgggcccgaagactggcgcggaaagtgcagcgagaggaggaagttcggagaccggcggat aggttgctgatatttacttgcaggtacaaatgtatatacctttcagccacactctcagaa acgtacctctccttgcccaaaagcagtttagaaaacacagccaggttatttggaaatagc ggtgcttgcagcgcttgcggacagtcgattcctgcgagtgaactcgtcatgagggcgcaa ggcaatgtgtatcatcttaagtgttttacatgctctacctgccggaatcgcctggtcccg ggagatcggtttcactacatcaatggcagtttattttgtgaacatgatagacctacagct ctcatcaatggccatttgaattcacttcagagcaatccactactgccagaccagaagcct ctaacatgtcggggtattgttattaaggcccagaatcccagaattgctatcgaacgctgg catttggggcattttggtaaaattacctgcactgatgatgtgctttatcctgttaggaat ttactaacaacagctgtatttgccaatgcatttgatatatcattaaagagaatcacttcc catgtacgtggagaaccaagtagtctcatgaaggctttgggaaaagcctttttgttctgt gctcatggaaccttatttttaaaagtcgctatctggcgtgtgactatctggaaaaggccg tcactgtcatcagcactattcacaattgcaaagacatggaaccaacccaaatgcccgtca atggtagactggataaagaaaatgtggtacatgtatactgtggaattctatgcagccata aaaaggaatgagatcatgtcctttgcaggaatatggatgaagctggaagccattatcctc agcaaactaatgcaggaacagaaaaccaaacaccgcatgttctcacatgtatgtcaggtc tgcccaaatgagggtcggtttctcaagacagaaattgaatgctctataccttgtgttcac cccaaactgccttgtagtgttgtgtatgtgccccgcaccccctga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_5|190_aa MDIYAKIINKILANRIQQHINKLIRHDQVGFIPGMQGWFNIHKSINVIHHINRTNDKNHM IISIDAEKALDKIKHPIMLKTLNKLGIDGTYLKIIRAIYDKPIANIILNGQKLEAFPLKT DTRQGCPLSPLPFNIVLEVLARAIRQEKEIKGIQIGREEVKLSLFADDMIVYLENSAQNL LKLVSNFSKV >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_5|573_bp atggacatctatgcgaagatcatcaataaaatattggcaaaccgaatccagcagcacatt aacaagcttatccgccacgatcaagtcggcttcatccctgggatgcaaggctggttcaac atacacaaatcaataaatgtaatccatcacataaacagaaccaatgacaaaaaccacatg attatctcaatagatgcagaaaaggccttggataaaattaaacaccccatcatgctaaaa actctcaataaactaggtattgatggaacatatctcaaaataataagagccatttatgac aaacccatagccaatatcatactgaatgggcaaaagctggaagcattccctttgaaaact gacacaagacaaggatgccctctctcaccactcccattcaacatagtattggaagttctg gccagggcaatcaggcaagaaaaagaaataaagggcattcaaataggaagagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaactcagcccaaaatctc cttaagctggtaagcaacttcagcaaagtctga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_6|85_aa MALGSKCKRTHCSSTKRYTCLGGDAVREYGELGNGHAELCSTTSRAMERIFLQQLKGMIK RQHKSYGSCPSIMLSEDNDDEGNDE >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_6|258_bp atggcccttggaagcaagtgcaagagaacacactgctcaagtacaaaacgatacacctgc ctaggtggagatgctgttagggagtacggtgagcttggcaatgggcatgctgaactctgc tccaccacaagcagagccatggagaggatttttctgcagcagttgaaaggaatgattaag aggcagcataagtcatatggatcctgtcctagcattatgctttctgaagataatgatgat gaaggaaatgatgaatga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_7|52_aa MNLNAACRGIDLLSTLKFDEFKEQNEYMMQRLTQCEDDEDEDLYDDLLPLNE >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_7|159_bp atgaatctgaatgctgcatgcagagggatagatttactgtcaacccttaaattcgatgag tttaaagagcaaaatgagtatatgatgcagagactgactcaatgtgaagatgatgaggat gaagacctttatgatgatctacttccacttaatgaatag >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_8|83_aa MEAGLSIGYLFSVPLEHMETIILLQYTLNTDNMPDDGTRMAGHNPGVYSGNLQENCSEMP QCEAEHECRVLTIAAFDELSLYP >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_8|252_bp atggaggctgggctatccatagggtatttgttctctgtaccactagaacacatggagacc atcatcctcctgcaatatacactgaacactgacaacatgccagatgacggaaccaggatg gcaggacacaatccaggggtatatagcggtaacctccaagaaaactgctcagaaatgccc cagtgtgaggctgagcatgagtgcagagtgctcacaattgctgcatttgatgagttaagt ttatacccctga >gi568815597f:87232016_87440204|GENSCAN_predicted_peptide_9|122_aa MVCPEFVPSDVQMCPKFLPSGEFMVSQTSGMKPQTLAVSATAHKGSVDPKRFWLQGLHLD LGKVEEHSENWTAESDELASYPNTLCLWAFLSRYLMYYWRSKCNKAKMSSFSVKARPQLQ CD >gi568815597f:87232016_87440204|GENSCAN_predicted_CDS_9|369_bp atggtgtgtccagagtttgttccttcagatgttcagatgtgtccaaagtttcttccttct ggtgagttcatggtctcacagacttcaggaatgaagccgcagacccttgcagtgagtgct acagctcataaaggtagtgtggacccaaagagattctggctccaaggactccatttggac ttaggaaaagtagaagagcactctgaaaactggacagcggaaagtgatgaactggcatcg taccccaacacactctgcttgtgggccttcctctccagatacctgatgtattactggaga agcaaatgcaacaaggcaaagatgtcttcattttctgtcaaagcacggcctcaactacag tgtgattga