GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:15:54 Sequence gi568815597r:211563429_211775479 : 212051 bp : 44.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3921 3916 6 1.05 1.03 Term - 12472 11960 513 1 0 28 47 181 0.545 2.34 1.02 Intr - 14395 14295 101 2 2 96 43 35 0.326 -0.37 1.01 Init - 15184 14563 622 1 1 110 99 1171 0.859 113.91 1.00 Prom - 31269 31230 40 -5.06 2.03 PlyA - 36106 36101 6 1.05 2.02 Term - 42912 42680 233 1 2 65 41 134 0.707 3.04 2.01 Init - 46075 46045 31 1 1 54 113 46 0.494 3.61 2.00 Prom - 49283 49244 40 -2.86 3.00 Prom + 54862 54901 40 -5.46 3.01 Init + 55228 55290 63 0 0 87 57 26 0.426 0.55 3.02 Term + 57875 58177 303 1 0 68 48 198 0.508 8.97 3.03 PlyA + 59049 59054 6 1.05 4.00 Prom + 62456 62495 40 -2.26 4.01 Init + 79898 80057 160 1 1 55 89 78 0.338 4.62 4.02 Intr + 84269 84301 33 0 0 84 94 19 0.365 0.29 4.03 Intr + 84381 84600 220 1 1 33 52 139 0.298 2.16 4.04 Intr + 87227 87387 161 2 2 43 97 63 0.269 2.33 4.05 Term + 87877 88004 128 2 2 73 38 47 0.131 -3.36 4.06 PlyA + 92785 92790 6 1.05 5.03 PlyA - 93048 93043 6 1.05 5.02 Term - 97626 97040 587 1 2 -29 48 694 0.973 48.38 5.01 Init - 97824 97635 190 0 1 56 1 212 0.972 8.47 5.00 Prom - 98394 98355 40 -12.01 6.10 PlyA - 99371 99366 6 1.05 6.09 Term - 100224 99998 227 1 2 72 42 173 0.513 8.04 6.08 Intr - 103803 103678 126 1 0 75 80 133 0.951 11.85 6.07 Intr - 105904 105685 220 1 1 87 94 147 0.999 13.07 6.06 Intr - 106979 106853 127 1 1 91 98 106 0.999 12.58 6.05 Intr - 107856 107774 83 0 2 99 101 39 0.999 4.74 6.04 Intr - 110295 110055 241 2 1 46 95 190 0.682 13.05 6.03 Intr - 111085 110868 218 1 2 56 121 242 0.999 21.60 6.02 Intr - 112135 111956 180 1 0 69 105 227 0.967 22.56 6.01 Init - 118710 118567 144 0 0 100 102 57 0.589 8.50 6.00 Prom - 132556 132517 40 -5.86 7.11 PlyA - 132819 132814 6 1.05 7.10 Term - 133231 133139 93 1 0 117 54 62 0.815 3.53 7.09 Intr - 143854 143737 118 0 1 63 37 83 0.098 1.17 7.08 Intr - 149946 149889 58 1 1 126 103 18 0.123 5.04 7.07 Intr - 156687 156557 131 2 2 47 78 18 0.004 -2.96 7.06 Intr - 168039 168001 39 2 0 108 53 65 0.037 2.34 7.05 Intr - 174277 173993 285 0 0 84 76 91 0.243 3.96 7.04 Intr - 185393 185337 57 1 0 73 96 61 0.295 3.40 7.03 Intr - 186622 186511 112 2 1 100 32 35 0.340 -1.46 7.02 Intr - 187639 187533 107 0 2 65 86 95 0.892 6.86 7.01 Init - 207531 207524 8 0 2 102 91 12 0.168 2.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_1|411_aa MGCWGRNRGRLLCMLALTFMFMVLEVVVSRVTSSLAMLSDSFHMLSDVLALVVALVAERF ARRTHATQKNTFGWIRAEVMGALVNAIFLTGLCFAILLEAIERFIEPHEMQQPLVVLGVG VAGLLVNVLGLCLFHHHSGFSQDSGHGHSHGGHGHGHGLPKGPRVKSTRPGSSDINVAPG EQGPDQEETNTLVANTSNSNGLKLDPAGNQNPLSRVQGAGDASGIFVGGGCGSDPKKKGQ EESALILLQTVPKQIDIRNLIKELRNVEGVEEVHELHVWQLAGSRIIATAHIKCEDPTSY MEVAKTIKDVFHNHGIHATTIQPEFASVGSKSSVVPCELACRTQCALKQCCGTLPQAPSG KDAEKTPAVSISCLELSNNLEKKPRRTKAENIPAVVIEIKNMPNKQPESSL >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_1|1236_bp atggggtgttggggtcggaaccggggccggctgctgtgcatgctggcgctgaccttcatg ttcatggtgctggaggtggtggtgagccgggtgacctcgtcgctggcgatgctctccgac tccttccacatgctgtcggacgtgctggcgctggtggtggcgctggtggccgagcgcttc gcccggcggacccacgccacccagaagaacacgttcggctggatccgagccgaggtaatg ggggctctggtgaacgccatcttcctgactggcctctgtttcgccatcctgctggaggcc atcgagcgcttcatcgagccgcacgagatgcagcagccgctggtggtccttggggtcggc gtggccgggctgctggtcaacgtgctggggctctgcctcttccaccatcacagcggcttc agccaggactccggccacggccactcgcacgggggtcacggccacggccacggcctcccc aaggggcctcgcgttaagagcacccgccccgggagcagcgacatcaacgtggccccgggc gagcagggtcccgaccaggaggagaccaacaccctggtggccaataccagcaactccaac gggctgaaattggaccccgcaggaaatcagaaccccctcagtcgggtgcaaggagccggc gacgcgtctggcatatttgtaggtgggggctgtgggtcagaccctaagaaaaagggtcag gaggaatctgctcttattcttctacaaactgttcctaaacaaattgatatcagaaatttg ataaaagaacttcgaaatgttgaaggagttgaggaagttcatgaattacatgtttggcaa cttgctggaagcagaatcattgccactgctcacataaaatgtgaagatccaacatcatac atggaggtggctaaaaccattaaagacgtttttcataatcacggaattcacgctactacc attcagcctgaatttgctagtgtaggctctaaatcaagtgtagttccgtgtgaacttgcc tgcagaacccagtgtgctttgaagcaatgttgtgggacactaccacaagccccttctgga aaggatgcagaaaagaccccagcagttagcatttcttgtttagaacttagtaacaatcta gagaagaagcccaggaggactaaagctgaaaacatccctgctgttgtgatagagattaaa aacatgccaaacaaacaacctgaatcatctttgtga >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_2|87_aa MEKHEEDLGQAQKDSPEKCHLLIPFAYQEPPLWTQEQGLCPLKHVTAEDGRLHLNMTRGL LGRKNGDGSCRGNEQLPPHMPYKACMG >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_2|264_bp atggaaaagcatgaagaggacctggggcaagcccaaaaagattccccggagaaatgccat ttactaatacccttcgcctatcaggaaccacccctgtggacccaggaacagggtctgtgt cccttgaagcacgtgacagcagaggacgggaggctccacctgaacatgaccaggggtctc ctggggaggaaaaatggagatggaagctgcagaggaaacgaacagctgcccccacacatg ccgtacaaggcctgcatgggatga >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_3|121_aa MTPRCGLVVFSSDGRENKIREGHSPSLGSPMSGPIRPLFLPDCVLAKSCHREPVGGGSDS FFMHPGGSQFRFWIPGHDFPELSIYTSSPALLHKSMAKRGLMTSQRRASPPPGKQMGEPM L >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_3|366_bp atgactccaagatgtggcctagtagtattttcatctgacggacgagagaacaagattaga gagggccacagccctagcctcggatcccccatgtctgggcccatccgcccattgttcctg cctgactgtgttcttgccaagagctgccacagggagccagtcggaggaggttccgatagc ttcttcatgcatcctggaggaagccagttccgcttctggattcccggccacgacttccca gaactcagcatctacacctccagcccagccctgctgcacaagagcatggctaagagagga ctgatgacttctcagagaagggcctccccacctcctgggaagcaaatgggagaacctatg ctctga >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_4|233_aa MNAGCPVASPTRPPCWKAFLTISLINKAKVKAAAHGESRESLWEVHCPAMPLNDEGVEIK GTVPEGSCPCSPKTVSGQTQLGVCCLAWCSWLGVCCLDSSRFKGSQTFLGSKLPGLIKLW AMAVPRKPQNPIKGVPIRNSGSPHGIAQGSGGMGPQLLFAQLSALGDCLVHELGWPSLSL AILGTSAVDILVSWAWRGALIPFSFSLDILNIFATRQWATDELHQYPQSLLEM >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_4|702_bp atgaatgccggctgtcctgtggcctccccgacacgtcccccctgctggaaggctttcctc accatctccctcataaacaaagccaaggtcaaggctgcagcccatggtgagagcagggag agcttgtgggaagttcactgccccgccatgcccctgaatgatgaaggagtagaaattaag ggcacagtgccagaaggaagctgcccgtgctctccaaagacagtgtctggtcagacccag cttggtgtctgctgcctggcttggtgctcctggcttggtgtctgctgcctggacagttcc agattcaagggcagccaaacatttctgggcagcaagctccctggcctgatcaagctctgg gccatggctgtgccccgcaaaccccagaacccaataaaaggagtccccattaggaactct ggaagtcctcatggcattgctcaagggagtggtgggatgggtccccagctgctctttgca cagctctctgctctaggtgactgtctggtccatgagctgggctggcccagtctgtctctg gcaatcctggggacatctgctgttgacatccttgtttcctgggcctggaggggagctcta atccctttcagttttagccttgatatccttaacatctttgccactagacagtgggccacg gacgagctgcatcagtatccccagagcttattagaaatgtag >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_5|258_aa MNGDQKSDAYAQEKQDFIQHFSRIVRVLTEDEMGHPETGDAIAQLKEVLEYNAIGGKLSP GFDAFRELVEPRKQDADSFQRALMVGWCVELLQAFFLVVDDITDSSLTHWGQICWYQKLG VGLDAVNDAMLLEACIYCLLKLCCWEQPYYLNLIELFLQSSYQTEIGQTLDPITAPPGQC GSSQIQGKELQIVKYKTVFYSFYLPIAAAMHMAGIDGEKAHTNAKKILLEMGEFFKIQDD HFDLFGDSTVTDKVGTAI >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_5|777_bp atgaatggagatcaaaaatcagatgcttatgcccaagaaaaacaggatttcattcagcac ttctcccggattgttagggtgctgactgaggatgagatggggcacccagagacaggagat gctatcgcccagctcaaggaggtcctggagtacaatgccatcggaggcaagttatcaccg gggtttgacgcattccgggagctggtggagccaaggaaacaggatgctgatagtttccag cgggccctgatggtgggttggtgtgtggaactgctgcaagctttcttcctcgtggtagat gacatcacggattcatccctcacccactggggacagatctgctggtatcagaagctgggc gtgggtttggatgctgtcaatgatgctatgcttctggaagcatgtatctactgcctgctg aagctctgttgttgggagcagccctattacctgaacctgattgagctcttcctgcagagt tcctatcagactgagattgggcagaccctggaccccatcacagcacccccagggcaatgt ggatcttcgcagattcaaggaaaagaattacaaattgtcaagtacaagacagttttctac tctttctaccttcctatagctgcagccatgcatatggcaggcattgatggtgagaaggca cacaccaatgccaagaagatcctgctggagatgggagagttctttaagattcaggatgat cactttgacctctttggggactccactgtgactgacaaagttggcactgccatctag >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_6|521_aa MAPLHSSLGDRVSKQTNKHTKKTELEEPGNLELKLTAPTITVLPPMEKVPGVPGPWSSAL GGATCVRQRDSGDWPAMPSRAEDYEVLYTIGTGSYGRCQKIRRKSDGKILVWKELDYGSM TEAEKQMLVSEVNLLRELKHPNIVRYYDRIIDRTNTTLYIVMEYCEGGDLASVITKGTKE RQYLDEEFVLRVMTQLTLALKECHRRSDGGHTVLHRDLKPANVFLDGKQNVKLGDFGLAR ILNHDTSFAKTFVGTPYYMSPEQMNRMSYNEKSDIWSLGCLLYELCALMPPFTAFSQKEL AGKIREGKFRRIPYRYSDELNEIITRMLNLKDYHRPSVEEILENPLIADLVADEQRRNLE RRGRQLGEPEKSQDSSPVLSELKLKEIQLQERERALKAREERLEQKEQELCVRERLAEDK LARAENLLKNYSLLKERKFLSLASNPELLNLPSSVIKKKVHFSGESKENIMRSENSESQL TSKSKCKDLKKRLHAAQLRAQALSDIEKNYQLKSRQILGMR >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_6|1566_bp atggcacccctgcactccagcctgggtgacagagtgagcaaacaaaccaacaaacacaca aagaagacagaactagaagagcctgggaacctggagctcaagctcactgctcctaccatc actgtgctgcccccgatggaaaaggtccctggagttcctggtccctggagctccgcactt ggcggcgcaacctgcgtgaggcagcgcgactctggcgactggccggccatgccttcccgg gctgaggactatgaagtgttgtacaccattggcacaggctcctacggccgctgccagaag atccggaggaagagtgatggcaagatattagtttggaaagaacttgactatggctccatg acagaagctgagaaacagatgcttgtttctgaagtgaatttgcttcgtgaactgaaacat ccaaacatcgttcgttactatgatcggattattgaccggaccaatacaacactgtacatt gtaatggaatattgtgaaggaggggatctggctagtgtaattacaaagggaaccaaggaa aggcaatacttagatgaagagtttgttcttcgagtgatgactcagttgactctggccctg aaggaatgccacagacgaagtgatggtggtcataccgtattgcatcgggatctgaaacca gccaatgttttcctggatggcaagcaaaacgtcaagcttggagactttgggctagctaga atattaaaccatgacacgagttttgcaaaaacatttgttggcacaccttattacatgtct cctgaacaaatgaatcgcatgtcctacaatgagaaatcagatatctggtcattgggctgc ttgctgtatgagttatgtgcattaatgcctccatttacagcttttagccagaaagaactc gctgggaaaatcagagaaggcaaattcaggcgaattccataccgttactctgatgaattg aatgaaattattacgaggatgttaaacttaaaggattaccatcgaccttctgttgaagaa attcttgagaaccctttaatagcagatttggttgcagacgagcaaagaagaaatcttgag agaagagggcgacaattaggagagccagaaaaatcgcaggattccagccctgtattgagt gagctgaaactgaaggaaattcagttacaggagcgagagcgagctctcaaagcaagagaa gaaagattggagcagaaagaacaggagctttgtgttcgtgagagactagcagaggacaaa ctggctagagcagaaaatctgttgaagaactacagcttgctaaaggaacggaagttcctg tctctggcaagtaatccagaacttcttaatcttccatcctcagtaattaagaagaaagtt catttcagtggggaaagtaaagagaacatcatgaggagtgagaattctgagagtcagctc acatctaagtccaagtgcaaggacctgaagaaaaggcttcacgctgcccagctgcgggct caagccctgtcagatattgagaaaaattaccaactgaaaagcagacagatcctgggcatg cgctag >gi568815597r:211563429_211775479|GENSCAN_predicted_peptide_7|335_aa MLRIFPIKDVPLETDDLTTWLYQRFVEKEDLLSHFYETGAFPPSKGHKEAVSREMTLSNL WIFLIQSFAFLSGYMWTCADSSLLNHYIKELPLSGGLSPSSKLPGRTSYVVCVTQYKMKM RLPVQRALRISSRQQKSSKTQAPHEGCPVMRTTKEEMRNGKEPGPWGWSMSPCCPSCQGT AQDVLREEEMSGDLKSKIKVSARDFSQCISKEGKETQIRKKEVKLFILRLYEENLIDSTN KLLELISPGILATVCLCEHDYSRYLINILPEGGIDPDPKRGFLDFTQERIQGESAVQSES KFIKKGWLRHEGSDSPGDTQNLTNMEESGKTLGPP >gi568815597r:211563429_211775479|GENSCAN_predicted_CDS_7|1008_bp atgctcaggatctttccaattaaagatgtacccctggagactgatgaccttaccacttgg ctctatcagcggtttgttgaaaaagaagacctcttatcacatttttatgaaacaggagct tttccaccttccaagggccataaggaagctgtttccagggagatgaccctcagcaacttg tggatatttctcatacagtcttttgcatttttgtcaggctatatgtggacctgtgcagac agtagtctgttgaatcattacatcaaggagctgcccctgtcaggaggcctctccccaagc agcaaactaccaggcaggaccagctacgttgtctgtgtgacccagtacaaaatgaaaatg cgactccctgttcaaagagcattacgaatttcaagtcggcaacagaagagcagtaaaacg caggcacctcatgaaggctgccctgtcatgagaaccacgaaggaggaaatgaggaatggg aaggagccaggcccctgggggtggtccatgtccccttgttgtccctcctgccaaggcact gcacaagatgttcttagagaggaggagatgtcaggagacctgaagtccaaaatcaaggtg tcagcaagagacttcagccaatgcataagtaaagaaggaaaagaaacacagattagaaag aaagaagttaaactgtttattctcagattatatgaagaaaatctgattgactctacaaat aagctactagagctaataagccctggcatccttgctactgtctgcctctgtgaacatgac tactctaggtacctcataaacatcttaccagaagggggtatcgatccagaccccaagaga gggttcttggatttcacgcaagaaagaattcagggcgagtccgcagtgcaaagtgaaagc aagtttattaagaaaggatggctgaggcatgagggctctgacagccctggagacacacag aacctcaccaatatggaagagtctgggaaaaccttggggccaccctga