GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:55:50 Sequence gi568815586r:12840778_13050384 : 209607 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9404 9443 40 -2.66 1.01 Sngl + 10088 10927 840 1 0 86 49 518 0.697 43.45 1.02 PlyA + 11579 11584 6 1.05 2.00 Prom + 17331 17370 40 -6.16 2.01 Sngl + 34722 35333 612 2 0 96 52 956 0.890 87.00 2.02 PlyA + 35338 35343 6 1.05 3.08 PlyA - 38968 38963 6 1.05 3.07 Term - 45062 45006 57 2 0 87 44 67 0.068 -0.11 3.06 Intr - 49939 49852 88 0 1 118 18 41 0.171 0.07 3.05 Intr - 50939 50721 219 1 0 84 72 110 0.639 6.42 3.04 Intr - 52111 52029 83 1 2 104 64 68 0.906 4.44 3.03 Intr - 59694 59659 36 0 0 121 108 -21 0.688 1.66 3.02 Intr - 60601 60483 119 2 2 99 107 -41 0.350 -0.92 3.01 Init - 63433 63280 154 1 1 58 63 86 0.474 3.24 3.00 Prom - 65881 65842 40 -4.16 4.00 Prom + 65909 65948 40 -11.14 4.01 Init + 67473 68394 922 2 1 75 87 990 0.962 91.74 4.02 Intr + 71307 71365 59 1 2 133 111 66 0.999 12.00 4.03 Intr + 72746 72771 26 1 2 58 84 60 0.252 -0.58 4.04 Intr + 86927 87026 100 2 1 97 103 19 0.131 4.41 4.05 Term + 94590 94688 99 2 0 45 37 122 0.475 0.93 4.06 PlyA + 94746 94751 6 1.05 5.05 PlyA - 95625 95620 6 1.05 5.04 Term - 100072 99998 75 1 0 105 35 27 0.173 -3.06 5.03 Intr - 101551 101484 68 2 2 109 99 41 0.621 5.92 5.02 Intr - 103159 103109 51 2 0 141 19 33 0.254 0.58 5.01 Init - 109607 108713 895 2 1 86 73 786 0.487 71.40 5.00 Prom - 110677 110638 40 -4.26 6.05 PlyA - 116413 116408 6 1.05 6.04 Term - 134702 134531 172 1 1 88 48 297 0.991 23.00 6.03 Intr - 146555 146375 181 0 1 137 75 126 0.882 15.13 6.02 Intr - 148638 148500 139 0 1 131 96 236 0.984 28.74 6.01 Init - 154289 154230 60 2 0 84 61 18 0.258 0.15 6.00 Prom - 158340 158301 40 -3.56 7.00 Prom + 160319 160358 40 -7.96 7.01 Init + 160725 160790 66 2 0 82 59 64 0.211 4.06 7.02 Intr + 161053 161258 206 0 2 25 82 436 0.819 34.70 7.03 Intr + 161502 161610 109 0 1 62 29 98 0.673 1.59 7.04 Term + 161760 162053 294 2 0 75 42 165 0.775 5.91 7.05 PlyA + 162078 162083 6 1.05 8.00 Prom + 175276 175315 40 -2.46 8.01 Init + 180554 180879 326 1 2 80 67 144 0.467 8.32 8.02 Intr + 196030 196088 59 1 2 117 70 33 0.456 2.93 8.03 Intr + 203553 203663 111 1 0 49 90 138 0.333 10.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 22511 22416 96 1 0 95 74 76 0.856 6.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_1|279_aa MVESKGGARHVLHGGKQERSRGELPLESHQVLEAFSCFKMKLNISFPVTGCQKLIEVDDE CKLRTFYEKLMATEVAADTLGEEWKGYVVRISGGNNKQGFPMKQGVLTHGRVHLLLSKGH SCYRPRRTGERKRKSVRGCIVDANLSILNLIIVKKKKKVKKDIPGLTDTMVPCRLGPKKA SRICKLSNLSEEDDVRQYVVRKPSNKGGKKPRTKAPKIQHLVTPHFLQHKGQHIALKKPC TKKNKEEAAEYAKLLGKGMKEAKEKRQEQIAKRHRLSSL >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_1|840_bp atggtggaaagcaaaggaggagcaaggcatgtcttacatggtggcaagcaagagagaagt agaggggaactccctttagaaagccatcaggtcttggaggcattcagctgcttcaagatg aagctgaacatctccttcccggtcactggctgccagaaactcattgaagtggatgatgaa tgcaaacttcgtactttttatgagaagcttatggccacagaagttgctgctgacactctg ggtgaagaatggaaaggttatgtggtccgaatcagtggtgggaacaacaaacaaggtttt cccatgaagcagggtgtcttgacccatggccgtgtccacctgctactgagtaaggggcat tcctgttacagaccaaggagaactggagaaagaaagagaaaatcagttcgtggttgcata gtggatgccaatctgagcattctcaacttgattattgttaaaaaaaaaaaaaaagtaaag aaggatattcctggactgactgatactatggtgccttgtcgcctggggcccaaaaaagct agcagaatctgcaaactttccaatctctctgaagaagatgatgtccgccagtatgttgta agaaagccctcaaacaaaggaggtaagaaacctaggaccaaagcacccaagattcagcat cttgttactccacatttcctgcagcacaaagggcagcatattgctctgaagaagccgtgt actaagaaaaataaggaagaggctgcagaatatgctaaacttttaggcaagggaatgaag gaggctaaagagaagcgccaggaacaaattgcgaagagacacagactttcctctctgtga >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_2|203_aa MAEVQVLVLDGRGHLLGHLAAIVAKQVLLGRKVVVVCCEGINISGNFYRNKLKYLAFLRK RMNSNPSRGPYPLQAPSRIFWQTMRGMPPHKTKPGQAALDCLKVFDGIPPPYDKKKRMVV PAALKVVRLKPARKFAYLGRLAHEVGWKYQAVTATLEEKRKEKAKIHYRKKKQFMRLWKQ AEKNVEKKIDKYTEVLKTHGLLV >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_2|612_bp atggcggaggtacaggtcctggtgcttgatggtcgaggccatctcctgggccacctggcg gccatcgtggctaaacaggtactgctgggccggaaggtggtggttgtatgctgtgaaggc atcaacatttctggcaatttctacagaaacaagttgaagtacctggctttcctccgcaag cggatgaacagcaacccttcccgaggcccctaccccctccaggcccccagccgcatcttc tggcagaccatgcgaggtatgccgccccacaagaccaagccaggccaggccgctctggac tgcctcaaggtgtttgacggcatcccaccgccctacgacaagaaaaagcggatggtggtt cctgctgccctcaaggttgtgcgtctgaagcctgcaagaaagtttgcctatctggggcgc ctggctcacgaggttggctggaagtaccaggcagtgacagccaccctggaggagaagagg aaagagaaagccaagatccactaccggaagaagaaacagttcatgaggctatggaaacag gccgagaagaacgtggagaagaaaattgacaaatacacagaagtcctcaagacccacgga ctcctggtctga >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_3|251_aa MRALPGYTVNVEEEDPISFKGGERKELRPCALHCVSPHPGRLITRVLSSRKGCQKLHFGS EIRSAWPCLNPAPILVAAGGSQESMNPEVLLGEGVSAARLCEQPIVSEVAEVFPFRNEPT AVADSAEPFNRRFKSGGPSALGSWRLTYPSAKALAVNLVLLPARKRAAGELLSAGGDLGQ LSRPRTLSSCYKGGPRRREGTSEGIKPIGPKLVFVYVWNDPGVAARLCRGSSQQQYTFEM YTFEIFAYASM >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_3|756_bp atgagggctctcccaggctacaccgtgaatgtggaagaggaagaccctatttccttcaag ggcggcgagcggaaggaactcaggccctgtgcacttcactgtgtctctccacaccccggc agattaatcaccagggtgttgtctagcaggaaaggctgccaaaaattgcactttgggtct gagattaggagcgcctggccatgtctgaaccccgctcctatcttagtggccgcaggtggc tcccaggaaagcatgaatcctgaggttttactgggagaaggggtttcagctgccaggctg tgtgaacaacccattgtgtctgaagtagctgaagtgttccccttcaggaatgaacccaca gcagtggctgattcggccgaacccttcaacagacgatttaaatccggaggccccagcgct ctgggctcctggcgcctcacttaccctagtgccaaggcgttggccgtgaacttggtgctg cttcccgcgcgcaagagggcagcaggcgagctcctcagtgctgggggagaccttggacag ctatcccgccctcgcactctgagcagttgttataaaggcggccctcgccggagggaggga acgagcgaggggatcaagccaatcggaccgaaactcgtctttgtttacgtgtggaacgat cctggagtggctgcccgcctgtgtcggggctcaagccagcaacaatacacctttgaaatg tacacctttgaaatttttgcatatgccagcatgtga >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_4|401_aa MATTVPDGCRNGLKSKYYRLCDKAEAWGIVLETVATAGVVTSVAFMLTLPILVCKVQDSN RRKMLPTQFLFLLGVLGIFGLTFAFIIGLDGSTGPTRFFLFGILFSICFSCLLAHAVSLT KLVRGRKPLSLLVILGLAVGFSLVQDVIAIEYIVLTMNRTNVNVFSELSAPRRNEDFVLL LTYVLFLMALTFLMSSFTFCGSFTGWKRHGAHIYLTMLLSIAIWVAWITLLMLPDFDRRW DDTILSSALAANGWVFLLAYVSPEFWLLTKQRNPMDYPVEDAFCKPQLVKKSYGVENRAY SQEEITQGFEETGDTLYAPYSTHFQLQIYRIYGCILCWQATAQVWTAYTKGRVRGEGTQD SGSMPAYKTTIKGSFQEKVDLELIVEEENKEEEGTKLADIE >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_4|1206_bp atggctacaacagtccctgatggttgccgcaatggcctgaaatccaagtactacagactt tgtgataaggctgaagcttggggcatcgtcctagaaacggtggccacagccggggttgtg acctcggtggccttcatgctcactctcccgatcctcgtctgcaaggtgcaggactccaac aggcgaaaaatgctgcctactcagtttctcttcctcctgggtgtgttgggcatctttggc ctcaccttcgccttcatcatcggactggacgggagcacagggcccacacgcttcttcctc tttgggatcctcttttccatctgcttctcctgcctgctggctcatgctgtcagtctgacc aagctcgtccgggggaggaagcccctttccctgttggtgattctgggtctggccgtgggc ttcagcctagtccaggatgttatcgctattgaatatattgtcctgaccatgaataggacc aacgtcaatgtcttttctgagctttccgctcctcgtcgcaatgaagactttgtcctcctg ctcacctacgtcctcttcttgatggcgctgaccttcctcatgtcctccttcaccttctgt ggttccttcacgggctggaagagacatggggcccacatctacctcacgatgctcctctcc attgccatctgggtggcctggatcaccctgctcatgcttcctgactttgaccgcaggtgg gatgacaccatcctcagctccgccttggctgccaatggctgggtgttcctgttggcttat gttagtcccgagttttggctgctcacaaagcaacgaaaccccatggattatcctgttgag gatgctttctgtaaacctcaactcgtgaagaagagctatggtgtggagaacagagcctac tctcaagaggaaatcactcaaggttttgaagagacaggggacacgctctatgccccctat tccacacattttcagctgcagatttaccgcatttacggctgcattctgtgctggcaggcc actgcgcaagtgtggacagcctacaccaagggaagagtcaggggagaaggaacacaagac tccggaagcatgccagcgtataaaaccaccatcaaaggaagtttccaggaaaaggtagat ctcgaattaattgtggaagaagaaaacaaagaagaagaaggcacgaagttggccgatata gaatag >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_5|362_aa MYKDCIESTGDYFLLCDAEGPWGIILESLAILGIVVTILLLLAFLFLMRKIQDCSQWNVL PTQLLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLFGVLFALCFSCLLAHASNLVKLVRG CVSFSWTTILCIAIGCSLLQIIIATEYVTLIMTRGMMFVNMTPCQLNVDFVVLLVYVLFL MALTFFVSKATFCGPCENWKQHGRLIFITVLFSIIIWVVWISMLLRGNPQFQRQPQWDDP VVCIALVTNAWVFLLLYIVPELCILYRSCRQECPLQGNACPVTAYQHSFQVENQELSRDK WKVLLNSDFLSHSGAARDSDGAEEDVALTSYGTPIQPQTVDPTQECFIPQAKLSPQQDAG GV >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_5|1089_bp atgtacaaggactgcatcgagtccactggagactattttcttctctgtgacgccgagggg ccatggggcatcattctggagtccctggccatacttggcatcgtggtcacaattctgcta ctcttagcatttctcttcctcatgcgaaagatccaagactgcagccagtggaatgtcctc cccacccagctcctcttcctcctgagtgtcctggggctcttcggactcgcttttgccttc atcatcgagctcaatcaacaaactgcccccgtacgctactttctctttggggttctcttt gctctctgtttctcatgcctcttagctcatgcctccaatctagtgaagctggttcggggt tgtgtctccttctcctggacgacaattctgtgcattgctattggttgcagtctgttgcaa atcattattgccactgagtatgtgactctcatcatgaccagaggtatgatgtttgtgaat atgacaccctgccagctcaatgtggactttgttgtactcctggtctatgtcctcttcctg atggccctcacattcttcgtctccaaagccaccttctgtggcccgtgtgagaactggaag cagcatggaaggctcatctttatcactgtgctcttctccatcatcatctgggtggtgtgg atctccatgctcctgagaggcaacccgcagttccagcgacagccccagtgggacgacccg gtcgtctgcattgctctggtcaccaacgcatgggttttcctgctgctgtacatcgtccct gagctctgcattctctacagatcgtgtagacaggagtgccctttacaaggcaatgcctgc cccgtcacagcctaccaacacagcttccaagtggagaaccaggagctctccagagataaa tggaaggtcttactcaactcggacttcctatcacacagtggtgcagcccgagacagtgat ggagctgaggaggatgtagcattaacttcatatggtactcccattcagccgcagactgtt gatcccacacaagagtgtttcatcccacaggctaaactaagcccccagcaagatgcagga ggagtataa >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_6|183_aa MASKETAESCRAGAVSVAGNEEVAYEERACEGGKFATVEVTDKPVDEALREAMPKVAKYA GGTNDKGIGMGMTVPISFAVFPNEDGSLQKKLKVWFRIPNQFQSDPPAPSDKSVKIEERE GITVYSMQFGGYAKEADYVAQATRLRAALEGTATYRGDIYFCTGYDPPMKPYGRRNEIWL LKT >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_6|552_bp atggcatcaaaggaaacagcagaaagctgcagagcaggtgctgtgagtgtggctggcaac gaagaagttgcctatgaagaaagggcctgtgaaggcggcaaatttgccacagtagaagtg acagataagcctgtggatgaggctctacgggaagcaatgcccaaggtcgcaaagtatgcg gggggcaccaatgacaagggaattgggatggggatgacagtccctatttcctttgctgtg ttccccaatgaagatggctctctgcagaagaaattaaaagtctggttccggattccaaac caatttcaaagcgacccaccagctcccagtgacaaaagcgttaagattgaggaacgggaa ggcatcactgtctattccatgcagtttggtggttatgccaaggaagcagactacgtagca caagccacccgtctgcgtgctgccctggagggcacagccacctaccggggggacatctac ttctgcacgggttatgaccctcccatgaagccctacggacggcgcaatgagatctggctg ttgaagacatga >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_7|224_aa MDVNSSGHPDLYGRLCSFLLPEKLRQPSNYLIVSMALANLSVAMAVMPFISVTDLIGGKW IFGHFFCNVFSMNVMCCTAWILTLYVISIDRFTRPPGKARPNTGYLASLEWSQTAVVTLN GTVKFQEPDPSVYGTACSCIPLWVERIFPWLGYANSLINPFIYAFFNWDLRTTYCSRLQC QYQNINQTLSAAGMHEALKLAERPERPEFVLQNSDYCRKKSHDS >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_7|675_bp atggatgttaacagcagcggccacccggacctctacgggcgcctctgctctttcctcctg ccggagaagctccgccagccctccaactacctgatcgtgtccatggcgctggccaacctc tcggtggccatggcggtcatgcccttcatcagtgtcaccgacctcatcgggggcaagtgg atctttggacactttttctgtaacgtcttctccatgaatgtcatgtgctgcacggcctgg atcttgaccttgtacgtgatcagcatcgacagatttacaaggccgccaggaaaagcgcgg cctaacacaggttacctggcttccctcgagtggagccagacagcagtagtcaccctgaat ggcacagtgaagttccaggagccagacccttctgtctatggcactgcctgcagctgcatc ccactgtgggtggagaggatatttccatggctgggctatgcaaactctctcattaaccct tttatttatgccttcttcaactgggacctgaggaccacctattgcagccggctccagtgc cagtaccagaatatcaaccagacactctcagctgcaggcatgcatgaagccctgaagctt gctgagaggccagagagacctgagtttgtcctacaaaactctgactactgtagaaaaaaa agtcatgattcatga >gi568815586r:12840778_13050384|GENSCAN_predicted_peptide_8|166_aa MGFTRSIPDYFAVNRTTHTVMQLHIQLGKHIGIEDAYFRNVPDCCSLHYVSNDEFLNGLV LGHALGTVRAANKLHVAMAHFGTTIVPSFLCRLGGTDWRLLQLFEFWVSLYVNNFYPFFR CQLKYQFLTPRTAARNYTSRQNAGARTRTGASAMATVLSRALKLPX >gi568815586r:12840778_13050384|GENSCAN_predicted_CDS_8|498_bp atgggcttcacgaggtcaattcctgactactttgctgtgaatcgcacaacccacacagtg atgcagcttcacatacagcttgggaagcacataggcatcgaagatgcttatttcagaaat gtccctgactgttgcagcctccactatgtttcaaatgacgaatttcttaatggccttgtc cttgggcacgcattgggcacagttcgtgcagcaaataagctgcatgtggccatggcccat tttggcacgaccattgttccttcttttctttgtcgtcttggaggtacagactggagactc cttcagctctttgagttctgggtcagcctttacgtgaacaacttctacccattcttcagg tgtcagcttaagtatcagtttctcaccccacgcaccgccgctcggaactacacttcccgg cagaacgcgggcgcgcgcacgcgcaccggggcctcagccatggcgaccgtgctgtccagg gcgctcaagctgccggnn