GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:04:01 Sequence gi568815591f:6287211_6502443 : 215233 bp : 47.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 269 308 40 -2.46 1.01 Init + 393 525 133 2 1 78 47 57 0.418 0.90 1.02 Intr + 3221 3405 185 0 2 112 78 -4 0.041 0.51 1.03 Intr + 27215 27275 61 1 1 113 103 16 0.417 3.91 1.04 Intr + 30511 30668 158 2 2 47 56 92 0.351 1.63 1.05 Term + 33715 33750 36 0 0 74 49 36 0.137 -4.36 1.06 PlyA + 34435 34440 6 1.05 2.02 PlyA - 34469 34464 6 1.05 2.01 Sngl - 43944 43165 780 0 0 75 45 162 0.608 4.69 2.00 Prom - 49170 49131 40 -3.76 3.04 PlyA - 49769 49764 6 1.05 3.03 Term - 51468 51367 102 0 0 66 43 102 0.969 1.78 3.02 Intr - 51849 51785 65 1 2 105 79 38 0.931 3.04 3.01 Init - 61548 61359 190 0 1 43 80 509 0.878 42.77 3.00 Prom - 62012 61973 40 -8.86 4.00 Prom + 63367 63406 40 -3.36 4.01 Init + 87126 87296 171 2 0 115 76 56 0.967 5.31 4.02 Intr + 87487 87560 74 0 2 59 100 95 0.813 6.00 4.03 Intr + 97128 97241 114 0 0 98 82 35 0.911 3.46 4.04 Intr + 100002 100073 72 0 0 120 75 32 0.917 3.62 4.05 Intr + 104714 104831 118 2 1 84 70 73 0.982 5.57 4.06 Intr + 112916 112978 63 1 0 114 100 -26 0.504 0.11 4.07 Term + 114658 115236 579 0 0 70 36 275 0.586 14.99 4.08 PlyA + 115293 115298 6 -0.45 5.03 PlyA - 115397 115392 6 -0.45 5.02 Term - 115906 115800 107 2 2 61 49 93 0.651 1.27 5.01 Init - 120417 120060 358 0 1 33 29 254 0.669 11.27 5.00 Prom - 121334 121295 40 -7.86 6.23 PlyA - 121939 121934 6 1.05 6.22 Term - 122825 122627 199 1 1 94 47 224 0.887 15.77 6.21 Intr - 123170 122920 251 2 2 55 101 232 0.957 17.44 6.20 Intr - 125673 125601 73 2 1 93 121 102 0.996 13.41 6.19 Intr - 125824 125756 69 0 0 34 115 80 0.920 3.60 6.18 Intr - 129544 129417 128 1 2 127 83 119 0.954 14.78 6.17 Intr - 129711 129631 81 0 0 121 39 105 0.998 8.73 6.16 Intr - 134628 134517 112 2 1 91 86 161 0.963 16.58 6.15 Intr - 137106 136991 116 0 2 54 66 48 0.280 -1.55 6.14 Intr - 137625 137542 84 0 0 119 97 52 0.816 9.22 6.13 Intr - 138904 138778 127 0 1 91 70 154 0.897 14.58 6.12 Intr - 143397 143270 128 0 2 108 68 125 0.996 11.98 6.11 Intr - 145749 145627 123 0 0 71 99 117 0.989 11.88 6.10 Intr - 147810 147552 259 2 1 113 63 81 0.724 5.57 6.09 Intr - 149323 149152 172 2 1 67 61 185 0.959 12.70 6.08 Intr - 154850 154722 129 0 0 64 45 76 0.582 1.57 6.07 Intr - 158894 158743 152 1 2 122 83 120 0.996 14.71 6.06 Intr - 160774 160538 237 0 0 50 79 346 0.115 26.63 6.05 Intr - 174029 173846 184 0 1 53 75 49 0.061 -1.05 6.04 Intr - 179113 178861 253 1 1 66 95 293 0.726 24.81 6.03 Intr - 182544 182386 159 0 0 99 88 117 0.775 12.98 6.02 Intr - 187074 186974 101 1 2 143 86 44 0.906 9.53 6.01 Init - 196847 196757 91 2 1 107 105 218 0.905 25.25 6.00 Prom - 200631 200592 40 -5.46 7.06 PlyA - 201620 201615 6 1.05 7.05 Term - 203892 203508 385 2 1 57 42 176 0.016 4.26 7.04 Intr - 210635 210568 68 2 2 109 21 66 0.062 -0.30 7.03 Intr - 211052 210854 199 1 1 -26 83 287 0.414 16.15 7.02 Intr - 214689 214527 163 1 1 101 -11 150 0.355 5.43 7.01 Init - 214932 214779 154 0 1 39 91 141 0.593 9.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 203891 203508 384 2 0 76 42 178 0.856 8.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_1|190_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIVSCLTHPVNTKVEMGQ DVHRLEGSPSGDSGRWILAVPAPQRLGQLLVASRKPTDRTARPHTQPLAAIVLLTVSPEL SLLGTSLWKSTLRFQMEENRLKMKALKFYCEAGRGVPFPAHLQGTSQAWQENSHFLPMTE NENDCFSGLF >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_1|573_bp atggaatactatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaactggaaaccatcattctcagcaaactatcgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagtgtcatgtctgacacatcctgttaacactaaagttgaaatgggacag gatgtccacagacttgaaggctctccaagtggtgactcggggaggtggattcttgctgta cctgccccacagaggctgggacagctcctggttgcttccagaaaaccgacagacagaact gccaggccacatactcagcccctggcagccatcgttctacttactgtctctccggagctg tctcttctaggaacctcattgtggaaaagcacgctgcgcttccaaatggaagaaaatcgc cttaaaatgaaggcgttgaagttctattgtgaagcaggacgaggagtacccttcccagct catctccaggggacctcacaggcctggcaggaaaacagccacttcctgcccatgactgaa aatgaaaacgactgcttttcaggactgttctga >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_2|259_aa MRDRRGPLGTCLAQVQQAGGGDSDKLSCSLKKRMPEGPWPADAPSWMNKPVVDGNSQSEA LSLEMRKDPSGAGLWLHSGGPVLPYVRESVRRNPASAATPSTAVGLFPAPTECFARVSCS GVEALGRRDWLGGGPRATDGHRGQCPKGEPRVSRLPRHQKVPEMGSFQDDPPSAFPKGLG SELEPACLHSILSATLHVYPEVLLSEETKRIFLDRLKPMFSKQTIEFKKMLKSTSDGLQI TLGLLALQPFELANTLCHS >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_2|780_bp atgagggacagaagagggcccctcggcacctgcctggcacaagtgcagcaggccggagga ggtgactcggacaaactatcatgcagccttaagaaaagaatgccggagggcccttggcct gcagatgcaccctcctggatgaataagcctgtggttgatggaaattcacaaagtgaggca ttatcactggaaatgagaaaggatccgagcggggctggcctctggcttcacagtggcggc ccagtgcttccatatgtgagagaatcagtaagaagaaatccagcctcagcagccactccg agcacagccgtgggtttgttccctgctccaacagagtgttttgctcgggtgtcctgcagt ggtgttgaagctctggggcggcgagactggctgggaggagggcccagggccactgacggc cacagaggacagtgccccaaaggagagcctcgggtgtcacgactgccacgccatcaaaaa gtgccggaaatgggaagttttcaggatgacccaccaagtgcttttcccaagggtctgggc tctgagttggaacccgcttgcctgcactccatcctgtctgcaacgctgcacgtgtatccc gaagtgctcctgagtgaggagacaaaacgcattttccttgaccgtttaaagcccatgttt tcaaagcaaacaatagaattcaagaaaatgcttaaaagcacctcagatggtctgcagata acactggggttactggctctgcaaccttttgaattagcaaatacattatgccatagttaa >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_3|118_aa MAAALSGLAVRLSRSAAARSYGVFCKGLTRTLLIFFDLAWRLRINFPYLYIVASMMLNVR LQVGCVEIRGFVSATSSLIIWAMSTVSWAGICEEGDRSCCGEDRVYLDSLVSPGTAPL >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_3|357_bp atggcggcggctctgtcgggcctggctgtccggctctcgcgctcggccgccgcccgctcc tatggggtcttctgcaaggggctgactcgcacgctgctcatcttcttcgacctggcctgg cggctgcgtatcaacttcccctacctctacatcgtggcttccatgatgctcaacgtccgc ctgcaggtgggctgtgtggaaatccgggggtttgtttctgctacctccagtctgatcatt tgggccatgagcacagtcagctgggcagggatctgcgaggaaggtgaccgctcctgttgt ggtgaggatcgggtatacctggacagcctcgtttcccctggcacggcgcccctgtga >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_4|396_aa MAAATPRGRSRVGPSASEHSRSPEKLRERRGRERRGRERPRPEPGSGGAGAGEGRPDRPA AAAAAQRAALMQAIKCVVVGDGLPATLQLEASSKIQCFLGNLKPVTFQVGSSPHVLNKKR AVGKTCLLISYTTNAFPGEYIPTVFDNYSANVMVDGKPVNLGLWDTAGQEDYDRLRPLSY PQTDVFLICFSLVSPASFENVRAKWYPEVRHHCPNTPIILVGTKLDLRDDKDTIEKLKEK KLTPITYPQGLAMAKEIGMESCVFPPPCTSFIVVTETGVQSGKGGCVSPTQGLVYSWGTS WQGPVGLNVSVGRWKQGWEPAEGARAPGAASRWWCDQKRVGSSVHCRVVVFPVGAVKYLE CSALTQRGLKTVFDEAIRAVLCPPPVKKRKRKCLLL >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_4|1191_bp atggcggctgccacgccccgcggccggagccgagtgggcccgagcgcttccgagcattcc cgaagtccagagaaactccgggagcggcgcgggcgggagcggcgcgggcgggagcggccc cgcccggaacctgggagcggcggcgccggcgcgggggaggggaggccggatcgccctgcc gccgccgccgcggcccagcgagcggccctgatgcaggccatcaagtgtgtggtggtggga gacgggcttcctgccacactgcagttggaggcttcttctaaaatccagtgctttctgggc aatttaaaacctgtgactttccaagtaggatctagtccccatgtccttaacaagaagaga gctgtaggtaaaacttgcctactgatcagttacacaaccaatgcatttcctggagaatat atccctactgtctttgacaattattctgccaatgttatggtagatggaaaaccggtgaat ctgggcttatgggatacagctggacaagaagattatgacagattacgccccctatcctat ccgcaaacagatgtgttcttaatttgcttttcccttgtgagtcctgcatcatttgaaaat gtccgtgcaaagtggtatcctgaggtgcggcaccactgtcccaacactcccatcatccta gtgggaactaaacttgatcttagggatgataaagacacgatcgagaaactgaaggagaag aagctgactcccatcacctatccgcagggtctagccatggctaaggagattggtatggaa tcctgtgtttttcctcctccttgtacctcttttattgtagtgacagagactggagtccag tctgggaaaggagggtgtgtgtctcccactcagggcctggtgtactcttggggaaccagc tggcaaggccctgtgggtcttaacgtcagcgttggaaggtggaagcagggctgggagccg gcagaaggcgcccgggccccaggagctgcctcccgctggtggtgtgatcagaagagagtg gggtcgagtgtacattgccgtgtggtcgtgtttcctgtaggtgctgtaaaatacctggag tgctcggcgctcacacagcgaggcctcaagacagtgtttgacgaagcgatccgagcagtc ctctgcccgcctcccgtgaagaagaggaagagaaaatgcctgctgttgtaa >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_5|154_aa MLWLGTQRKRCDSELTLKAHSGESGKELLKMKSIRQNQKLKDNKPGKEKGYEDTPGLTSE FESHEFPRRKEAIKRTARWIKETLGCGREKDNLTRKKYIPTEMAKKNHGRTALRILKGLA KQNSSLEEIAVLFKISAKAGQKTRYEITKTDPKS >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_5|465_bp atgctatggcttgggactcagagaaagcgctgtgactcagaactgaccctgaaggcccac agtggggagagtggaaaagagctcctgaagatgaaaagcatcagacaaaatcaaaagctc aaagataacaaaccaggaaaagaaaaaggttacgaggacactccaggtctgacatctgag tttgaaagtcacgagtttccaagacggaaagaggccatcaagcgcacagcacggtggata aaagaaaccctaggatgtggaagagaaaaggacaatctcacaagaaaaaaatacatccca acagaaatggctaagaaaaatcatggaagaactgccctcagaattctgaagggactagct aaacagaattcttcattagaggaaatagcagttctgttcaaaatctccgcaaaagctggt cagaaaactcgctatgaaatcacaaagactgatccaaagagctga >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_6|1075_aa MNIFRLTGDLSHLAAIVILLLKIWKTRSCAGISGKSQLLFALVFTTRYLDLFTSFISLYN TSMKVIYLACSYATVYLIYLKFKATYDGNHDTFRVEFLVVPVGGLSFLVNHDFSPLEILW TFSIYLESVAILPQLFMISKTGEAETITTHYLFFLGLYRALYLVNWIWRFYFEGFFDLIA VVAGVVQTILYCDFFYLYITKGGHLSKGACEGVWDWASSSPALETKALYATPYEALFHSY FHFPNKGYQVINIFIEVFVCFSIVSMRRSPPSLRLRLSADNLVAASGGCWFVLGERRAGS LLSASYGTFAMPGMVLFGRRWAIASDDLVFPGFFELVVRVLWWIGILTLYLMHRGKLDCA GGALLSSYLIVLMILLAVVICTVSAIMCVSMRGRIQKPFIVALLAEAVSPLTAQGKLTYR AGESWEKHKEMEACLGTICNPGPRKSMSKLLYIRLALFFPEMVWASLGAAWVADGVQCDR TVVNGIIATVVVSWIIIAATVVSIIIVFDPLGGKMAPYSSAGPSHLDSHDSSQLLNGLKT AATSVWETRIKLLCCCIGKDDHTRVAFSSTAELFSTYFSDTDLVPSDIAAGLALLHQQQD NIRNNQEPAQVVCHAPGSSQEADLDAELENCHHYMQFAAAAYGWPLYIYRNPLTGLCRIG GDCCRSRTTDYDLVGGDQLNCHFGSILHTTGLQYRDFIHVSFHDKVYELPFLVALDHRKE SVVVAVRGTMSLQALVACRSPGEVRTADIHPQPMAPLLSSSVHTPGPLGDARLTAASATP FLQDVLTDLSAESEVLDVECEVQDRLAHKGISQAARYVYQRLINDGILSQAFSIAPEYRL VIVGHSLGGGAAALLATMLRAAYPQVRCYAFSPPRGLWSKALQEYSQSFIVSLVLGKDVI PRLSVTNLEDLKRRILRVVAHCNKPKYKILLHGLWYELFGGNPNNLPTELDGGDQEVLTQ PLLGEQSLLTRWSPAYSFSSDSPLDSSPKYPPLYPPGRIIHLQEEGASGRFGCCSAAHYS AKWSHEAEFSKILIGPKMLTDHMPDILMRALDSVVSDRAACVSCPAQGVSSVDVA >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_6|3228_bp atgaacattttccggctgactggggacctgtcccacctggcggccatcgtcatcctgctg ctgaagatctggaagacgcgctcctgcgccggtatttctgggaaaagccagcttctgttt gcactggtcttcacaactcgttacctggatctttttacttcatttatttcattgtataac acatctatgaaggttatctaccttgcctgctcctatgccacagtgtacctgatctacctg aaatttaaggcaacctacgatggaaatcatgataccttccgagtggagtttctggtggtc cctgtgggaggcctctcatttttagttaatcacgatttctctcctcttgagatcctctgg accttctccatctacctggagtccgtggctatccttccgcagctatttatgatcagcaag actggggaggccgagaccatcaccacccactacctgttcttcctgggcctctatcgtgct ttgtatcttgtcaactggatctggcgcttctactttgagggcttctttgacctcattgct gtggtggccggcgtagtccagaccatcctatactgtgacttcttctacttgtacattaca aaaggaggacatcttagcaagggtgcctgcgagggagtgtgggactgggcctcatcctcg ccggcgttggaaaccaaggccttgtatgccacgccttatgaagcactgtttcacagttac tttcacttcccgaataaaggttaccaggtaattaatatttttattgaggtgtttgtgtgt ttttcaatcgtgagcatgcgcaggtccccgccctcgctgcgtttgcgcttgagcgccgat aatttggtggcggcgtccggagggtgctggtttgttctcggtgaacggcgcgcggggtct ctcctgagtgcgagctacgggaccttcgccatgccggggatggtactcttcggccggcgc tgggccatcgccagcgacgacttggtcttcccagggttcttcgagctggtcgtgcgagtg ctgtggtggattggcattctgacgttgtatctcatgcacagaggaaagctggactgtgct ggtggagccttgctcagcagttacttgatcgtcctcatgattctcctggcagttgtcata tgtactgtgtcagccatcatgtgtgtcagcatgagaggaaggatccaaaagccatttatc gtcgctctcctggctgaagctgtgtctccactcacagcgcagggcaagctgacatacagg gcaggggagagctgggagaagcacaaggagatggaggcttgtctcggaacgatttgtaac cctggaccgcggaagtctatgtctaagctgctttacatccgcctggcgctgttttttcca gagatggtctgggcctctctgggggctgcctgggtggcagatggtgttcagtgcgacagg acagttgtaaacggcatcatcgcaaccgtcgtggtcagttggatcatcatcgctgccaca gtggtttccattatcattgtctttgaccctcttggggggaaaatggctccatattcctct gccggccccagccacctggatagtcatgattcaagccagttacttaatggcctcaagaca gcagctacaagcgtgtgggaaaccagaatcaagctcttgtgctgttgcattgggaaagac gaccatactcgggttgctttttcgagtacggcagagcttttctcaacctacttttcagac acagatctggtgcccagcgacattgcggcgggcctcgccctgcttcatcagcaacaggac aatatcaggaacaaccaagagcctgcccaggtggtctgccatgccccagggagctcccag gaagctgatctggatgcagaattagaaaactgccatcattacatgcagtttgcagcagcg gcctatgggtggcccctctacatctacagaaaccccctcacggggctgtgcaggattggt ggtgactgctgcagaagcagaaccacagactatgacttggtcggaggcgatcagctcaac tgtcacttcggctccatcctgcacaccacagggctgcagtacagggacttcatccacgtc agcttccatgacaaggtttacgagctgccgtttttagtggctctggatcacaggaaagag tctgttgtggtcgctgtgagggggaccatgtctctgcaggccctcgtggcctgccgttct ccaggtgaggtcaggactgccgacatccatccccagcccatggcccctcttctgtcttcc tcagtccatactcctgggccccttggagatgccaggctgacggccgcttctgcaacgccc tttttacaggatgtccttacggacctgtcagcggagagtgaggtgctggacgtggagtgt gaggtgcaggaccgcctggcacacaagggtatttctcaagctgccagatacgtttaccaa cgactcatcaacgacgggattttgagccaagccttcagcattgctcctgagtaccggctg gtcatagtgggccacagcctcgggggcggggcggccgccctgctggccaccatgctcaga gccgcctacccgcaggtcaggtgctacgccttctccccaccccgggggctgtggagcaaa gctctgcaggaatattctcagagcttcatcgtgtcactcgtcctggggaaggatgtgatt cccaggctcagtgtgaccaacttggaagatctgaagagaagaatcttgcgagtggtcgcg cactgcaataaacccaagtacaagatcttgctgcacggtttgtggtacgaactgtttgga ggaaaccccaacaacttgcccacggagctggacgggggcgaccaggaagtcctgacacag cctcttctgggggagcagagcctactgacgcgctggtccccggcctacagcttctccagc gactccccactggactcttctcccaagtacccccctctctaccctcccggcaggatcatc cacctgcaggaggagggcgcctcggggcggtttggctgctgctctgctgctcactatagc gccaagtggtcacacgaagcggaattcagcaaaatactcataggtccgaagatgctcacc gaccacatgccagacatcctgatgcgggccttggacagcgtggtctccgacagagcggcc tgcgtctcctgtccagcacaaggggtctccagtgtggacgtggcctga >gi568815591f:6287211_6502443|GENSCAN_predicted_peptide_7|322_aa MYIGGHLQLNSTKTVDGKSTFLHILAKSLSQHFPELLGFAQDLPTVPLAAKVNQRALTSD LADLHGTISEIQDACQSISPSSEDKFAMVMSASSEQPVPAPGWILGGLAVSLTEACQSFL ETAQPALRALDGLQREAMEELGKALAFFGEDSKATTSEAFFGIFAEFMSKFERALSDLQA GEGLRSSGMVSPLAWMASAKKGGEKKGRSAISEAVTPEYTSAWVEWASRSAPRAHREIQK FAMKEMGTPNLHIDVRLNKALWAKGIRNVPYHIHMKLPRKLNEDEDSPDKLYALVPTYTC YHFHKSIDRQCGRELTTDGSIH >gi568815591f:6287211_6502443|GENSCAN_predicted_CDS_7|969_bp atgtatattgggggtcacttgcagctgaactccaccaagacagtggatgggaagtccacc ttcctgcacatccttgccaaatcgctgagccagcacttccctgaactcctgggctttgct caggacctgcccaccgtgcccctggctgccaaagtgaaccaacgggccctgaccagtgac ctggctgacctccatggcactatcagcgagatacaggatgcctgccagagcatttccccc tctagcgaggacaagtttgcaatggtcatgtcggcatcctctgagcagcctgttcctgca ccaggctggatcctgggcgggctggctgtttccttaactgaggcctgtcagtccttcctg gagacggcccagccagcacttcgggcgctcgacgggctgcagcgcgaggccatggaggag ctgggcaaggcgctggccttctttggggaggattccaaggccaccacctctgaggctttc ttcggcatctttgcagagttcatgagcaaattcgagcgagcgctgagtgacctgcaggcc ggggagggcctgcgcagctccgggatggtttcacccctggcctggatggcttctgcaaag aagggtggcgagaagaaaggtcgttccgccattagtgaggcggttaccccagaatacacc agcgcatgggtggagtgggcttcaagaagtgcccctcgggcacacagagagatccagaaa ttcgccatgaaggagatggggactccaaatttgcacattgatgtgaggctcaacaaagct ctctgggccaaaggaataaggaatgtcccataccatatccatatgaagttgcccagaaaa cttaatgaggatgaagattcaccagacaagctctatgctttggttcctacatatacctgt taccactttcacaaatctatagacaggcaatgtggaagagagctaaccactgatggttca atacattaa