GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:56:18 Sequence gi568815586f:12808250_13012536 : 204287 bp : 44.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 609 655 47 0 2 115 55 59 0.194 3.35 1.02 Intr + 5093 5205 113 0 2 39 85 106 0.223 5.50 1.03 Intr + 5882 5975 94 1 1 29 100 54 0.961 0.24 1.04 Intr + 12959 13147 189 0 0 101 103 81 0.981 10.46 1.05 Intr + 13716 13834 119 1 2 71 83 128 0.999 10.78 1.06 Intr + 14412 14483 72 2 0 89 62 89 0.984 6.00 1.07 Intr + 14954 15070 117 1 0 87 93 17 0.869 2.76 1.08 Intr + 15621 15767 147 2 0 59 110 5 0.510 0.23 1.09 Intr + 16291 16428 138 0 0 61 113 98 0.992 10.26 1.10 Intr + 17751 17821 71 2 2 105 98 33 0.995 3.98 1.11 Intr + 18997 19126 130 1 1 69 68 150 0.969 11.80 1.12 Term + 21174 21305 132 2 0 126 37 184 0.991 15.19 1.13 PlyA + 21705 21710 6 1.05 2.00 Prom + 25939 25978 40 -2.96 2.01 Init + 27475 27659 185 0 2 99 75 23 0.246 0.70 2.02 Term + 31608 31701 94 0 1 77 54 89 0.805 1.70 2.03 PlyA + 32108 32113 6 1.05 3.00 Prom + 41932 41971 40 -2.66 3.01 Sngl + 42616 43455 840 0 0 86 49 518 0.697 43.45 3.02 PlyA + 44107 44112 6 1.05 4.00 Prom + 49859 49898 40 -6.16 4.01 Sngl + 67250 67861 612 1 0 96 52 956 0.890 87.00 4.02 PlyA + 67866 67871 6 1.05 5.08 PlyA - 71496 71491 6 1.05 5.07 Term - 77590 77534 57 1 0 87 44 67 0.068 -0.11 5.06 Intr - 82467 82380 88 2 1 118 18 41 0.171 0.07 5.05 Intr - 83467 83249 219 0 0 84 72 110 0.639 6.42 5.04 Intr - 84639 84557 83 0 2 104 64 68 0.906 4.44 5.03 Intr - 92222 92187 36 2 0 121 108 -21 0.688 1.66 5.02 Intr - 93129 93011 119 1 2 99 107 -41 0.350 -0.92 5.01 Init - 95961 95808 154 0 1 58 63 86 0.474 3.24 5.00 Prom - 98409 98370 40 -4.16 6.00 Prom + 98437 98476 40 -11.14 6.01 Init + 100001 100922 922 1 1 75 87 990 0.962 91.74 6.02 Intr + 103835 103893 59 0 2 133 111 66 0.999 12.00 6.03 Intr + 105274 105299 26 0 2 58 84 60 0.252 -0.58 6.04 Intr + 119455 119554 100 1 1 97 103 19 0.131 4.41 6.05 Term + 127118 127216 99 1 0 45 37 122 0.475 0.93 6.06 PlyA + 127274 127279 6 1.05 7.05 PlyA - 128153 128148 6 1.05 7.04 Term - 132600 132526 75 0 0 105 35 27 0.173 -3.06 7.03 Intr - 134079 134012 68 1 2 109 99 41 0.621 5.92 7.02 Intr - 135687 135637 51 1 0 141 19 33 0.254 0.58 7.01 Init - 142135 141241 895 1 1 86 73 786 0.487 71.40 7.00 Prom - 143205 143166 40 -4.26 8.05 PlyA - 148941 148936 6 1.05 8.04 Term - 167230 167059 172 0 1 88 48 297 0.991 23.00 8.03 Intr - 179083 178903 181 2 1 137 75 126 0.882 15.13 8.02 Intr - 181166 181028 139 2 1 131 96 236 0.984 28.74 8.01 Init - 186817 186758 60 1 0 84 61 18 0.258 0.15 8.00 Prom - 190868 190829 40 -3.56 9.00 Prom + 192847 192886 40 -7.96 9.01 Init + 193253 193318 66 1 0 82 59 64 0.211 4.06 9.02 Intr + 193581 193786 206 2 2 25 82 436 0.819 34.70 9.03 Intr + 194030 194138 109 2 1 62 29 98 0.674 1.59 9.04 Term + 194288 194581 294 1 0 75 42 165 0.798 5.91 9.05 PlyA + 194606 194611 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 55039 54944 96 0 0 95 74 76 0.856 6.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_1|456_aa XKVCVAAFEEESCFRLYPLPETSHKMAAPEEHDSPTEASQPIVEEEETKTFKDLGVTDVL CEACDQLGWTKPTKIQIEAIPLALQGRDIIGLAETGSGKTGAFALPILNALLETPQRLFA LVLTPTRELAFQISEQFEALGSSIGVQSATPGRLIDHLENTKGFNLRALKYLVMDEADRI LNMDFETEVDKILKVIPRDRKTFLFSATMTKKVQKLQRAALKNPVKCAVSSKYQTVEKLQ QYYIFIPSKFKDTYLVYILNELAGNSFMIFCSTCNNTQRTALLLRNLGFTAIPLHGQMSQ SKRLGSLNKFKAKARSILLATDVASRGLDIPHVDVVVNFDIPTHSKDYIHRVGRTARAGR SGKAITFVTQYDVELFQRIEHLIGKKLPGFPTQDDEVMMLTERVAEAQRFARMELREHGE KKKRSREDAGDNDDTEGAIGVRNKVAGGKMKKRKGR >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_1|1371_bp nnaaaagtctgtgtggctgcttttgaagaagaaagctgctttcgattatacccacttccg gagacctcacacaagatggcggcacccgaggaacacgattctccgaccgaagcgtcccag ccgattgtggaagaggaggaaactaaaacatttaaagacctgggtgtgacagatgtgttg tgtgaagcttgtgaccagttgggatggacaaaacccaccaagatccagattgaagctatt cctttggccttacaaggtcgtgatatcattgggcttgcagaaactggctctggaaagaca ggcgcctttgctttgcccattctaaacgcactgctggagaccccgcagcgtttgtttgcc ctagttcttaccccgactcgggagctggcctttcagatctcagagcagtttgaagccctg gggtcctctattggagtgcagagtgcaactcctggtcgactgattgaccacttggaaaat acgaaaggtttcaacttgagagctctcaaatacttggtcatggatgaagccgaccgaata ctgaatatggattttgagacagaggttgacaagatcctcaaagtgattcctcgagatcgg aaaacattcctcttctctgccaccatgaccaagaaggttcaaaaacttcagcgagcagct ctgaagaatcctgtgaaatgtgccgtttcctctaaataccagacagttgaaaaattacag caatattatatttttattccctctaaattcaaggatacctacctggtttatattctaaat gaattggctggaaactcctttatgatattctgcagcacctgtaataatacccagagaaca gctttgctactgcgaaatcttggcttcactgccatccccctccatggacaaatgagtcag agtaagcgcctaggatcccttaataagtttaaggccaaggcccgttccattcttctagca actgacgttgccagccgaggtttggacatacctcatgtagatgtggttgtcaactttgac attcctacccattccaaggattacatccatcgagtaggtcgaacagctagagctgggcgc tccggaaaggctattacttttgtcacacagtatgatgtggaactcttccagcgcatagaa cacttaattgggaagaaactaccaggttttccaacacaggatgatgaggttatgatgctg acagaacgcgtcgctgaagcccaaaggtttgcccgaatggagttaagggagcatggagaa aagaagaaacgctcgcgagaggatgctggagataatgatgacacagagggtgctattggt gtcaggaacaaggtggctggaggaaaaatgaagaagcggaaaggccgttaa >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_2|92_aa MQSLGGGPQGLLRVLELLFARSHPSPYTIQTQRMSPVTVTLQSSASAVKEAGRSRRNWER NSIYVWVNSELCVITKGSLEGDERALQGQVSD >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_2|279_bp atgcagtctctaggaggtgggccccaaggtctgctcagagtcttggagcttttgtttgct cgtagccatccatccccttataccatccaaacacaaaggatgtctcctgtcactgtcacc cttcaatcatctgccagtgccgtcaaagaggccggaaggagcaggaggaattgggaaagg aacagcatctatgtctgggtgaactcagaattatgtgtgatcaccaagggcagccttgaa ggggacgagagggccctacaaggacaagtctctgactga >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_3|279_aa MVESKGGARHVLHGGKQERSRGELPLESHQVLEAFSCFKMKLNISFPVTGCQKLIEVDDE CKLRTFYEKLMATEVAADTLGEEWKGYVVRISGGNNKQGFPMKQGVLTHGRVHLLLSKGH SCYRPRRTGERKRKSVRGCIVDANLSILNLIIVKKKKKVKKDIPGLTDTMVPCRLGPKKA SRICKLSNLSEEDDVRQYVVRKPSNKGGKKPRTKAPKIQHLVTPHFLQHKGQHIALKKPC TKKNKEEAAEYAKLLGKGMKEAKEKRQEQIAKRHRLSSL >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_3|840_bp atggtggaaagcaaaggaggagcaaggcatgtcttacatggtggcaagcaagagagaagt agaggggaactccctttagaaagccatcaggtcttggaggcattcagctgcttcaagatg aagctgaacatctccttcccggtcactggctgccagaaactcattgaagtggatgatgaa tgcaaacttcgtactttttatgagaagcttatggccacagaagttgctgctgacactctg ggtgaagaatggaaaggttatgtggtccgaatcagtggtgggaacaacaaacaaggtttt cccatgaagcagggtgtcttgacccatggccgtgtccacctgctactgagtaaggggcat tcctgttacagaccaaggagaactggagaaagaaagagaaaatcagttcgtggttgcata gtggatgccaatctgagcattctcaacttgattattgttaaaaaaaaaaaaaaagtaaag aaggatattcctggactgactgatactatggtgccttgtcgcctggggcccaaaaaagct agcagaatctgcaaactttccaatctctctgaagaagatgatgtccgccagtatgttgta agaaagccctcaaacaaaggaggtaagaaacctaggaccaaagcacccaagattcagcat cttgttactccacatttcctgcagcacaaagggcagcatattgctctgaagaagccgtgt actaagaaaaataaggaagaggctgcagaatatgctaaacttttaggcaagggaatgaag gaggctaaagagaagcgccaggaacaaattgcgaagagacacagactttcctctctgtga >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_4|203_aa MAEVQVLVLDGRGHLLGHLAAIVAKQVLLGRKVVVVCCEGINISGNFYRNKLKYLAFLRK RMNSNPSRGPYPLQAPSRIFWQTMRGMPPHKTKPGQAALDCLKVFDGIPPPYDKKKRMVV PAALKVVRLKPARKFAYLGRLAHEVGWKYQAVTATLEEKRKEKAKIHYRKKKQFMRLWKQ AEKNVEKKIDKYTEVLKTHGLLV >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_4|612_bp atggcggaggtacaggtcctggtgcttgatggtcgaggccatctcctgggccacctggcg gccatcgtggctaaacaggtactgctgggccggaaggtggtggttgtatgctgtgaaggc atcaacatttctggcaatttctacagaaacaagttgaagtacctggctttcctccgcaag cggatgaacagcaacccttcccgaggcccctaccccctccaggcccccagccgcatcttc tggcagaccatgcgaggtatgccgccccacaagaccaagccaggccaggccgctctggac tgcctcaaggtgtttgacggcatcccaccgccctacgacaagaaaaagcggatggtggtt cctgctgccctcaaggttgtgcgtctgaagcctgcaagaaagtttgcctatctggggcgc ctggctcacgaggttggctggaagtaccaggcagtgacagccaccctggaggagaagagg aaagagaaagccaagatccactaccggaagaagaaacagttcatgaggctatggaaacag gccgagaagaacgtggagaagaaaattgacaaatacacagaagtcctcaagacccacgga ctcctggtctga >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_5|251_aa MRALPGYTVNVEEEDPISFKGGERKELRPCALHCVSPHPGRLITRVLSSRKGCQKLHFGS EIRSAWPCLNPAPILVAAGGSQESMNPEVLLGEGVSAARLCEQPIVSEVAEVFPFRNEPT AVADSAEPFNRRFKSGGPSALGSWRLTYPSAKALAVNLVLLPARKRAAGELLSAGGDLGQ LSRPRTLSSCYKGGPRRREGTSEGIKPIGPKLVFVYVWNDPGVAARLCRGSSQQQYTFEM YTFEIFAYASM >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_5|756_bp atgagggctctcccaggctacaccgtgaatgtggaagaggaagaccctatttccttcaag ggcggcgagcggaaggaactcaggccctgtgcacttcactgtgtctctccacaccccggc agattaatcaccagggtgttgtctagcaggaaaggctgccaaaaattgcactttgggtct gagattaggagcgcctggccatgtctgaaccccgctcctatcttagtggccgcaggtggc tcccaggaaagcatgaatcctgaggttttactgggagaaggggtttcagctgccaggctg tgtgaacaacccattgtgtctgaagtagctgaagtgttccccttcaggaatgaacccaca gcagtggctgattcggccgaacccttcaacagacgatttaaatccggaggccccagcgct ctgggctcctggcgcctcacttaccctagtgccaaggcgttggccgtgaacttggtgctg cttcccgcgcgcaagagggcagcaggcgagctcctcagtgctgggggagaccttggacag ctatcccgccctcgcactctgagcagttgttataaaggcggccctcgccggagggaggga acgagcgaggggatcaagccaatcggaccgaaactcgtctttgtttacgtgtggaacgat cctggagtggctgcccgcctgtgtcggggctcaagccagcaacaatacacctttgaaatg tacacctttgaaatttttgcatatgccagcatgtga >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_6|401_aa MATTVPDGCRNGLKSKYYRLCDKAEAWGIVLETVATAGVVTSVAFMLTLPILVCKVQDSN RRKMLPTQFLFLLGVLGIFGLTFAFIIGLDGSTGPTRFFLFGILFSICFSCLLAHAVSLT KLVRGRKPLSLLVILGLAVGFSLVQDVIAIEYIVLTMNRTNVNVFSELSAPRRNEDFVLL LTYVLFLMALTFLMSSFTFCGSFTGWKRHGAHIYLTMLLSIAIWVAWITLLMLPDFDRRW DDTILSSALAANGWVFLLAYVSPEFWLLTKQRNPMDYPVEDAFCKPQLVKKSYGVENRAY SQEEITQGFEETGDTLYAPYSTHFQLQIYRIYGCILCWQATAQVWTAYTKGRVRGEGTQD SGSMPAYKTTIKGSFQEKVDLELIVEEENKEEEGTKLADIE >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_6|1206_bp atggctacaacagtccctgatggttgccgcaatggcctgaaatccaagtactacagactt tgtgataaggctgaagcttggggcatcgtcctagaaacggtggccacagccggggttgtg acctcggtggccttcatgctcactctcccgatcctcgtctgcaaggtgcaggactccaac aggcgaaaaatgctgcctactcagtttctcttcctcctgggtgtgttgggcatctttggc ctcaccttcgccttcatcatcggactggacgggagcacagggcccacacgcttcttcctc tttgggatcctcttttccatctgcttctcctgcctgctggctcatgctgtcagtctgacc aagctcgtccgggggaggaagcccctttccctgttggtgattctgggtctggccgtgggc ttcagcctagtccaggatgttatcgctattgaatatattgtcctgaccatgaataggacc aacgtcaatgtcttttctgagctttccgctcctcgtcgcaatgaagactttgtcctcctg ctcacctacgtcctcttcttgatggcgctgaccttcctcatgtcctccttcaccttctgt ggttccttcacgggctggaagagacatggggcccacatctacctcacgatgctcctctcc attgccatctgggtggcctggatcaccctgctcatgcttcctgactttgaccgcaggtgg gatgacaccatcctcagctccgccttggctgccaatggctgggtgttcctgttggcttat gttagtcccgagttttggctgctcacaaagcaacgaaaccccatggattatcctgttgag gatgctttctgtaaacctcaactcgtgaagaagagctatggtgtggagaacagagcctac tctcaagaggaaatcactcaaggttttgaagagacaggggacacgctctatgccccctat tccacacattttcagctgcagatttaccgcatttacggctgcattctgtgctggcaggcc actgcgcaagtgtggacagcctacaccaagggaagagtcaggggagaaggaacacaagac tccggaagcatgccagcgtataaaaccaccatcaaaggaagtttccaggaaaaggtagat ctcgaattaattgtggaagaagaaaacaaagaagaagaaggcacgaagttggccgatata gaatag >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_7|362_aa MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLLLAFLFLMRKIQDCSQWNVL PTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFALCFSCLLAHASNLVKLVRG CVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRGMMFVNMTPCQLNVDFVVLLVYVLFL MALTFFVSKATFCGPCENWKQHGRLIFITVLFSIIIWVVWISMLLRGNPQFQRQPQWDDP VVCIALVTNAWVFLLLYIVPELCILYRSCRQECPLQGNACPVTAYQHSFQVENQELSRDK WKVLLNSDFLSHSGAARDSDGAEEDVALTSYGTPIQPQTVDPTQECFIPQAKLSPQQDAG GV >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_7|1089_bp atgtacaaggactgcatcgagtccactggagactattttcttctctgtgacgccgagggg ccatggggcatcattctggagtccctggccatacttggcatcgtggtcacaattctgcta ctcttagcatttctcttcctcatgcgaaagatccaagactgcagccagtggaatgtcctc cccacccagctcctcttcctcctgagtgtcctggggctcttcggactcgcttttgccttc atcatcgagctcaatcaacaaactgcccccgtacgctactttctctttggggttctcttt gctctctgtttctcatgcctcttagctcatgcctccaatctagtgaagctggttcggggt tgtgtctccttctcctggacgacaattctgtgcattgctattggttgcagtctgttgcaa atcattattgccactgagtatgtgactctcatcatgaccagaggtatgatgtttgtgaat atgacaccctgccagctcaatgtggactttgttgtactcctggtctatgtcctcttcctg atggccctcacattcttcgtctccaaagccaccttctgtggcccgtgtgagaactggaag cagcatggaaggctcatctttatcactgtgctcttctccatcatcatctgggtggtgtgg atctccatgctcctgagaggcaacccgcagttccagcgacagccccagtgggacgacccg gtcgtctgcattgctctggtcaccaacgcatgggttttcctgctgctgtacatcgtccct gagctctgcattctctacagatcgtgtagacaggagtgccctttacaaggcaatgcctgc cccgtcacagcctaccaacacagcttccaagtggagaaccaggagctctccagagataaa tggaaggtcttactcaactcggacttcctatcacacagtggtgcagcccgagacagtgat ggagctgaggaggatgtagcattaacttcatatggtactcccattcagccgcagactgtt gatcccacacaagagtgtttcatcccacaggctaaactaagcccccagcaagatgcagga ggagtataa >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_8|183_aa MASKETAESCRAGAVSVAGNEEVAYEERACEGGKFATVEVTDKPVDEALREAMPKVAKYA GGTNDKGIGMGMTVPISFAVFPNEDGSLQKKLKVWFRIPNQFQSDPPAPSDKSVKIEERE GITVYSMQFGGYAKEADYVAQATRLRAALEGTATYRGDIYFCTGYDPPMKPYGRRNEIWL LKT >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_8|552_bp atggcatcaaaggaaacagcagaaagctgcagagcaggtgctgtgagtgtggctggcaac gaagaagttgcctatgaagaaagggcctgtgaaggcggcaaatttgccacagtagaagtg acagataagcctgtggatgaggctctacgggaagcaatgcccaaggtcgcaaagtatgcg gggggcaccaatgacaagggaattgggatggggatgacagtccctatttcctttgctgtg ttccccaatgaagatggctctctgcagaagaaattaaaagtctggttccggattccaaac caatttcaaagcgacccaccagctcccagtgacaaaagcgttaagattgaggaacgggaa ggcatcactgtctattccatgcagtttggtggttatgccaaggaagcagactacgtagca caagccacccgtctgcgtgctgccctggagggcacagccacctaccggggggacatctac ttctgcacgggttatgaccctcccatgaagccctacggacggcgcaatgagatctggctg ttgaagacatga >gi568815586f:12808250_13012536|GENSCAN_predicted_peptide_9|224_aa MDVNSSGHPDLYGRLCSFLLPEKLRQPSNYLIVSMALANLSVAMAVMPFISVTDLIGGKW IFGHFFCNVFSMNVMCCTAWILTLYVISIDRFTRPPGKARPNTGYLASLEWSQTAVVTLN GTVKFQEPDPSVYGTACSCIPLWVERIFPWLGYANSLINPFIYAFFNWDLRTTYCSRLQC QYQNINQTLSAAGMHEALKLAERPERPEFVLQNSDYCRKKSHDS >gi568815586f:12808250_13012536|GENSCAN_predicted_CDS_9|675_bp atggatgttaacagcagcggccacccggacctctacgggcgcctctgctctttcctcctg ccggagaagctccgccagccctccaactacctgatcgtgtccatggcgctggccaacctc tcggtggccatggcggtcatgcccttcatcagtgtcaccgacctcatcgggggcaagtgg atctttggacactttttctgtaacgtcttctccatgaatgtcatgtgctgcacggcctgg atcttgaccttgtacgtgatcagcatcgacagatttacaaggccgccaggaaaagcgcgg cctaacacaggttacctggcttccctcgagtggagccagacagcagtagtcaccctgaat ggcacagtgaagttccaggagccagacccttctgtctatggcactgcctgcagctgcatc ccactgtgggtggagaggatatttccatggctgggctatgcaaactctctcattaaccct tttatttatgccttcttcaactgggacctgaggaccacctattgcagccggctccagtgc cagtaccagaatatcaaccagacactctcagctgcaggcatgcatgaagccctgaagctt gctgagaggccagagagacctgagtttgtcctacaaaactctgactactgtagaaaaaaa agtcatgattcatga