GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:25:43 Sequence gi568815576r:31339883_31588718 : 248836 bp : 45.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1666 1661 6 -0.45 1.02 Term - 2840 2663 178 1 1 146 42 63 0.607 4.66 1.01 Init - 5720 4450 1271 2 2 111 94 1499 0.956 144.58 1.00 Prom - 24508 24469 40 -0.96 2.02 PlyA - 26542 26537 6 1.05 2.01 Sngl - 45709 45344 366 1 0 76 41 169 0.976 7.10 2.00 Prom - 48612 48573 40 -5.96 3.00 Prom + 50119 50158 40 -4.96 3.01 Init + 59802 59843 42 2 0 98 80 33 0.484 3.92 3.02 Intr + 60738 60861 124 2 1 99 76 82 0.969 8.26 3.03 Intr + 63147 63322 176 1 2 82 109 105 0.998 11.66 3.04 Intr + 71130 71199 70 2 1 95 109 47 0.944 6.25 3.05 Intr + 80374 80543 170 2 2 73 93 169 0.999 15.57 3.06 Intr + 83398 83528 131 0 2 107 84 205 0.992 21.49 3.07 Intr + 86733 86900 168 0 0 95 101 115 0.999 12.56 3.08 Intr + 87178 87300 123 1 0 70 92 91 0.939 7.40 3.09 Term + 93990 94089 100 0 1 102 47 50 0.800 -0.10 3.10 PlyA + 95870 95875 6 1.05 4.13 PlyA - 96831 96826 6 1.05 4.12 Term - 100239 99998 242 1 2 110 53 217 0.829 16.59 4.11 Intr - 100986 100822 165 1 0 120 96 10 0.669 4.93 4.10 Intr - 103212 102921 292 0 1 90 42 234 0.541 15.71 4.09 Intr - 104808 104724 85 2 1 71 61 64 0.853 1.82 4.08 Intr - 107686 107544 143 1 2 124 99 66 0.705 10.45 4.07 Intr - 114494 114262 233 0 2 73 94 195 0.717 16.09 4.06 Intr - 115433 115254 180 0 0 55 87 76 0.840 4.04 4.05 Intr - 116105 115970 136 2 1 80 98 -10 0.517 -0.66 4.04 Intr - 118768 118593 176 2 2 106 109 128 0.967 16.36 4.03 Intr - 123251 123050 202 2 1 75 86 178 0.999 15.16 4.02 Intr - 124085 123799 287 0 2 108 9 293 0.999 20.46 4.01 Init - 130699 130630 70 1 1 72 77 8 0.249 -2.69 4.00 Prom - 134142 134103 40 -0.66 5.00 Prom + 134720 134759 40 -4.56 5.01 Init + 148753 148901 149 0 2 79 98 52 0.248 4.96 5.02 Intr + 149529 149877 349 0 1 33 -12 250 0.028 4.86 5.03 Intr + 156756 156921 166 2 1 72 38 62 0.020 -0.87 5.04 Intr + 157092 157235 144 1 0 42 90 60 0.647 1.85 5.05 Term + 157895 158052 158 0 2 66 41 96 0.906 0.80 5.06 PlyA + 160420 160425 6 1.05 6.00 Prom + 167505 167544 40 -0.96 6.01 Init + 172110 172160 51 2 0 58 103 0 0.045 -0.34 6.02 Intr + 181353 181519 167 2 2 73 53 102 0.051 3.96 6.03 Intr + 188808 188981 174 0 0 54 99 149 0.960 11.65 6.04 Intr + 189616 189711 96 1 0 57 87 44 0.483 0.32 6.05 Intr + 191176 191247 72 1 0 109 67 16 0.215 0.12 6.06 Intr + 206979 207089 111 0 0 81 71 85 0.885 5.59 6.07 Intr + 210372 210520 149 0 2 80 35 234 0.779 17.18 6.08 Intr + 217060 217177 118 2 1 49 113 93 0.734 7.42 6.09 Intr + 219703 220041 339 1 0 16 4 318 0.025 10.79 6.10 Intr + 221408 221510 103 2 1 89 109 19 0.054 4.28 6.11 Intr + 233176 233332 157 0 1 126 101 81 0.849 12.78 6.12 Intr + 238500 238570 71 1 2 49 99 67 0.369 2.80 6.13 Intr + 243993 244090 98 2 2 73 91 63 0.491 3.91 6.14 Intr + 245186 245252 67 2 1 84 80 46 0.194 2.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 181428 181317 112 2 1 42 43 175 0.924 6.43 S.002 Init + 206604 206611 8 2 2 114 91 0 0.992 3.40 S.003 Init + 232194 232247 54 2 0 89 94 -1 0.938 1.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_1|482_aa MERVNDASCGPSGCYTYQVSRHSTEMLHNLNQQRKNGGRFCDVLLRVGDESFPAHRAVLA ACSEYFESVFSAQLGDGGAADGGPADVGGATAAPGGGAGGSRELEMHTISSKVFGDILDF AYTSRIVVRLESFPELMTAAKFLLMRSVIEICQEVIKQSNVQILVPPARADIMLFRPPGT SDLGFPLDMTNGAALAANSNGIAGSMQPEEEAARAAGAAIAGQASLPVLPGVDRLPMVAG PLSPQLLTSPFPSVASSAPPLTGKRGRGRPRKANLLDSMFGSPGGLREAGILPCGLCGKV FTDANRLRQHEAQHGVTSLQLGYIDLPPPRLGENGLPISEDPDGPRKRSRTRKQVACEIC GKIFRDVYHLNRHKLSHSGEKPYSCPVCGLRFKRKDRMSYHVRSHDGSVGKPYICQSCGK GFSRPPSPTADVPPSWLLGPGSHPVFAFTLLLGFPLLVFAGSLDGGPYHTDVVLGDPALR WA >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_1|1449_bp atggagcgggtgaacgacgcttcgtgcggcccgtctggctgctacacataccaggtgagc agacacagcacggagatgctgcacaacctgaaccagcagcgcaaaaacggcgggcgcttc tgcgacgtgctcttgcgggtaggcgacgagagcttcccagcgcaccgcgccgtgctggcc gcctgcagcgagtactttgagtcggtgttcagcgcccagttgggcgacggcggagctgcg gacgggggtccggctgatgtagggggcgcgacggcagcaccaggcggcggggccgggggc agccgggagctggagatgcacactatcagctccaaggtatttggggacattctggacttc gcctacacttcccgcatcgtggtgcgcttggagagctttcccgaactcatgacggccgcc aagttcctgctgatgaggtcggttatcgagatctgccaggaagtcatcaaacagtccaac gtacagatcctggtaccccctgcccgcgccgatataatgctctttcgcccccctgggacc tcggacttgggcttccctttggacatgaccaacggggcagccttggcagccaacagcaat ggcatcgccggcagcatgcagccagaggaggaggcagctcgggcggctggtgcagccatt gcaggccaagcctctttgcctgtgttacctggggtggaccgcttgcccatggtggctgga cccctatccccccaactgctgacttccccattccccagtgtggcatccagtgcccctccc ctgactggcaagcgaggccggggccgcccaaggaaggccaacctgctggactcaatgttt gggtccccagggggcctgagggaggcaggcatccttccatgcggtctatgtggtaaggtg ttcactgatgccaaccggctccggcagcacgaggcccagcacggtgtcaccagcctccag ctgggctacatcgaccttcctcctccgaggctgggtgagaatgggctacccatctctgaa gaccccgacggcccccgaaagaggagccggaccaggaagcaggtggcttgtgagatctgc ggcaagatcttccgtgatgtgtatcatcttaaccggcacaagctgtcccactctggggag aagccctactcctgccctgtgtgtgggttgcggttcaagagaaaagaccgcatgtcctac catgtgcggtcccatgatgggtccgtgggcaagccttacatctgccagagctgtgggaaa ggcttctccaggcccccttcccccacagctgacgttcctccttcctggttgcttggtcct ggttcccaccctgtctttgccttcactctgttgcttggctttcctctcctggtgtttgcc ggctccttggatggtgggccgtatcacacagatgttgtgttgggagacccggcactgcgc tgggcctga >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_2|121_aa MDTFLDTYTFPKLKQEEIDSLNRTIMSSKMKSVINSLLTKKSPGPDEVTAEFHQMYKDGI VLFLLKLFQKIEAEGLLPNSFCEPGIILTLKPGRDTPKIENFKPISLMNIDAEILNKILA N >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_2|366_bp atggatacattcctggacacatacaccttcccaaaactgaaacaagaagaaatagattcc ctgaacagaacaataatgagctccaaaatgaaatcagtaataaatagcctactaacaaag aaaagcccaggaccagatgaagtcacagccgaattccaccagatgtacaaagacgggata gtactatttcttctgaaactatttcaaaaaattgaggcagagggactcctccccaactca ttctgtgagcccggcatcatcctgacactaaaacctggcagagacacacccaaaatagaa aacttcaagccaatatccttgatgaacatcgatgcagaaatcctcaacaaaatacttgca aattga >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_3|367_aa MSSTLAKIAEIEAEMARTQKNKATAHHLGLLKARLAKLRRELITPKGGGGGGPGEGFDVA KTGDARIGFVGFPSVGKSTLLSNLAGVYSEVAAYEFTTLTTVPGVIRYKGAKIQLLDLPG IIEGAKDGKGRGRQVIAVARTCNLILIVLDVLKPLGHKKIIENELEGFGIRLNSKPPNIG FKKKDKGGINLTATCPQSELDAETVKSILAEYKIHNADVTLRSDATADDLIDVVEGNRVY IPCIYVLNKIDQISIEELDIIYKVPHCVPISAHHRWNFDDLLEKIWDYLKLVRIYTKPKG QLPDYTSPVVLPYSRTTVEDFCMKIHKNLIKEFKYALVWGLSVKHNPQKVGKDHTLEDED VIQIVKK >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_3|1104_bp atgagcagcaccttagctaagatcgcggagatagaagcagagatggctcggactcaaaag aacaaggccacagcacaccacttagggctgcttaaggctcgtcttgctaagcttcgtcga gaactcattactccaaagggtggtggtggtggaggtccaggagaaggttttgatgtggcc aagacaggtgatgctcgaattggatttgttggttttccatctgtggggaagtcaacactg cttagtaacctggcaggggtatattctgaggtggcagcctatgaattcactactctgacc actgtgcctggtgtcatcagatacaaaggtgccaagatccagctcctggatctcccaggt atcattgaaggtgccaaggatgggaaaggtagaggtcgtcaagtcattgcagtggcccga acctgtaacttgatcttgattgttctggatgtcctgaaacctttgggacataagaagata attgaaaatgagctggaaggctttggcattcgcttgaacagcaaaccccccaacattggc tttaagaagaaggacaagggaggcattaatctcacagccacttgcccccagagtgagctg gatgctgaaactgtgaagagcattctggctgaatacaagattcataatgccgatgtgact ctacgtagtgatgctacagctgatgacctcattgatgtggtggaaggaaacagagtttat atcccctgtatctatgtgttaaataagattgaccaaatctccattgaggaattggatatc atctataaggtgcctcactgtgtacccatctctgcccatcaccgctggaattttgatgac ctattggaaaagatctgggactatctgaaactagtgagaatttacaccaaacccaaaggc cagttaccagattacacatccccagtggtgcttccttactccaggaccacagtggaggat ttctgcatgaagattcacaaaaatcttatcaaagaatttaaatatgctctggtctggggt ctctctgtgaaacacaatcctcagaaagtgggtaaagaccatacgttggaggatgaggat gtcattcaaattgtgaagaagtga >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_4|736_aa MGQVQWLTPVIPALWEARAGRSQDPRERVKEDDLDVVLSPQRRSFGGGCHVTAAVSSRRS GSPLEKDSDGLRLLGGRRIGSGRIISARTFEKDHRLSDKDLRDLRDRDRERDFKDKRFRR EFGDSKRVFGERRRNDSYTEEEPEWFSAGPTSQSETIELTGFDDKILEEDHKGRKRTRRR TASVKEGIVECNGGVAEEDEVEVILAQEPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKV PCLASMIEDVLGEGSVSASRFSRWFSNPSRSGSRSSSLGSTPHEELERLAGLEQAILSPG QNSGNYFAPIPLEDHAENKVDILEMLQKAKVDLKPLLSSLSANKEKLKESSHSGVVLSVE EVEAGLKGLKVDQQVKNSTPFMAEHLEETLSAVTNNRQLKKDGDMTAFNKLVSTMKASGT LPSQPKVSQMSQLELQQAALEGLALPHDLAVQAANFYQPGFGKPQVDRTRDGFRNRQQRV TKSPAPVHRGNSSSPAPAASITSMLSPSFTPTSVIRKMYESKEKSKEEPASGKAALGDSK EDTQKASEGTAELLRALLKKVCDGENMGLSADQPDILYKALGKTTSFSGLWFCVSEKVLC DSVLPPGMDLSHLQGISGPILGQPFYPLPAASHPLLNPRPGTPLHLAMVQQQLQRSVLHP PGSGSHAAAVSVQTTPQNVPSRSGLPHMHSQLEHRPSQRSSSPVGLAKWFGSDVLQQPLP SMPAKVISVDELEYRQ >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_4|2211_bp atgggccaggtgcagtggctcacacctgtaatcccagcactttgggaggccagggcgggc agatcacaagatccacgagagcgtgtgaaagaagatgacttagatgttgttctcagccct cagagacggagctttggagggggctgccacgtgacagccgctgttagctcccggcgctca ggaagtccattagagaaagatagtgatgggcttcgtctgcttggtggacgtaggattggc agtgggaggataatctctgcccggacctttgagaaggatcaccgtcttagcgataaggac ctgcgggacttgagagacagagaccgagagagggacttcaaggacaagcgtttcaggaga gagtttggagatagtaagcgtgtctttggtgagcgtagaagaaatgattcttacacagaa gaagaaccagagtggttctctgctggacccacaagtcagtctgaaaccatcgaactgact ggctttgatgataagatactagaagaagatcacaaagggagaaaaagaacaaggcgacgg acagcctctgtgaaggaaggtatagtagagtgcaatggaggagtggccgaagaggatgaa gtggaggtcatccttgcacaggagcctgcggctgatcaggaagtgccaagggatgctgtc ttgcctgagcagtccccaggagactttgactttaatgagttctttaaccttgataaggtg ccatgcttggcttcgatgatagaagatgttttgggagaagggtcagtctctgccagtcgg ttcagtaggtggttctctaacccgagcagatcaggaagccgatccagcagtcttgggtca acaccacatgaagagctagagagacttgcaggtctggagcaagccatcctctctcctgga cagaactcggggaattactttgctcctataccattggaagaccatgctgaaaataaagtg gatattttagaaatgctacagaaagccaaagtggatttgaaacctcttctttccagcctt tctgcaaataaagaaaaacttaaagaaagctcacattcaggggttgtgctttcagtggag gaggtagaagcaggtctgaagggcttgaaggttgaccagcaagtgaagaattcaactccc ttcatggcagaacacctagaagagaccttgagtgccgtaaccaacaatcgacaactgaag aaagacggagacatgactgcgttcaacaagctagtgagcacaatgaaggcaagtgggact ttgccttctcagcccaaagtcagccagatgagccagctggagttgcaacaggcagcttta gaagggctggccttgccacatgaccttgctgtacaggcagcaaacttctaccagcctggt tttggcaaaccacaggtggacagaaccagagatggattcagaaacaggcaacagcgagtg accaagtcaccagcacccgtgcatcgagggaattcctcttcccctgcccctgctgcctcc atcacaagcatgctttctccttcctttacccctacctcagtgattcgtaagatgtacgag agcaaagagaaaagcaaggaggagccagcatctggaaaagcagctcttggtgacagtaaa gaggatactcagaaggccagtgaaggtactgcagagctgttaagggcactgctcaagaaa gtatgtgatggagaaaacatgggcctgagcgctgaccagcctgatattctatacaaagcc ctgggcaagaccactagcttctctggactatggttctgtgtcagtgagaaggttctgtgt gacagtgtgcttcctcctgggatggacttgagtcatttacagggaatatctggccccatc ctgggtcagcccttttaccctttacctgctgctagtcaccctctcttaaaccctcgtcct ggaacacctctgcatctggcaatggtgcaacagcagctacagcgctcagttctgcatcct ccaggctctggttcccatgcagcagctgtcagcgttcagacaacccctcagaacgtgccc agccggtcaggcctgccccacatgcactcccagctggagcatcgccccagccagaggagc agctcccctgtgggccttgccaaatggtttggctcagatgtgctacagcaacccctgccc tccatgcccgccaaagttatcagtgtagatgaattggaataccgacagtga >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_5|321_aa MGAFGGRRLLQVKESISTFCFTHTSPIHGSLVYNALHLEDKSAQNCHREQEEGGRISRPG RFKLELAQARSRPPRPGGSVPASQPAGYPPAALRASRPCPPPAGAPAPALTHATPGAMRP RPFYTREHRARGAPDPAATAADPYSVPPPLGGPLPRPPLPSPLPAPVRGPALTSPPSSGL GLSGSEPGKDLSRVPPSPVKSVRASASSPESAGDVVYLRVCACADPNQGFPPPGSVAQER KGWYMGDLTPTEPAGSNAVNKPGSLPRVGVWFTEYFKPTIETYCSEKRIPFKILLVINKA PGHQGTLMKMYKEINVTLHVC >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_5|966_bp atgggggcatttggaggcaggaggcttcttcaggtcaaggaaagcatctccactttctgt ttcacccatacttctcctatccatggctccttggtctacaatgctctgcacctggaagat aaatcggcacaaaattgtcaccgagaacaggaggagggaggccgaatctcccggcccggg cgcttcaagctggagctagcgcaggcccgcagccgtcctccccggcccggcggctcggtc cccgccagccagcccgccggctacccgccggccgccttgcgggcctcgcgcccctgccca ccccctgcgggcgccccggcgcccgcgctcacgcacgcgaccccgggagcaatgcgcccg cgccccttctacacccgggagcaccgggcgcgcggtgccccggacccagccgccaccgcc gccgacccttactcggtgcctccgccgctcggcggccctttgcctcggccgccgctcccc tccccgctgcccgcaccggtacgaggcccggccctaacgtccccaccctcttctgggctc ggcctttccggcagtgagcccggcaaggacctcagtcgggttcctccgagcccggtgaaa tctgtccgagcctccgcttcctcccctgaaagtgcaggtgatgttgtctacctgcgggtc tgtgcgtgcgcggatccaaatcaggggtttcctcctcccggctccgtggcccaggagcgg aagggctggtatatgggcgaccttactcccactgagcccgcagggagtaacgcagtaaac aaacctggaagccttccccgcgtgggagtgtggtttactgaatattttaagcctactatc gagacctactgttcagaaaaaaggattcctttcaagatattactggtcattaacaaggca cctggtcaccaaggaactctgatgaagatgtacaaagagattaatgttactttacatgtc tgctaa >gi568815576r:31339883_31588718|GENSCAN_predicted_peptide_6|591_aa MYRLRYHFETNNVTITQRLANVWNDTITSNCSLDERVQFFFFTNSRLQMMRGNSFTFKSF GAFDVTSSISAVRYFKDGAVKKPYSAKTLSNKKSSASFGIRRELPSTSHLVQYRGTHTCT RQGRLRELRISLGESETLSQNNNNRFIWCLPMPGVTLDAEHIKCVARKFLYLWIRMTFGR VFPSKARFYYEQRLLRKVFEEWKEEWWVFQHEWKLCVRADCHYRYYLYNLMFQTWKTYVR QQQEMRNKYIRAEVHGEKKKSMSEEDAHIYFAEDAKQKMRQAWKSWLIYVVVRRTKLQMQ TTALEFRQRIILRMWVDSTLIVLRKADVDLTKWAGELTEDEMERVMTIMQNPCQYKIPDW FLNRRKDVKDGKYSQVLASGLDKKLRADVERLKKIQAHRGPHHFWGLRVRGQHTKTTGHH GCTMGGVWWSTWRQRLGQVRVSRALHASALKHRALSLQVQAWSQWREQLLYVQKEKQKVV SAVKHHQHWQKRRFLKAWLEYLQVRRVKRQQNDMLLCAEEAAQFEMAEEHHRHSQLLLHR FWNLWRSQIEQKKERELLPLLHAAWDHYRIALLCKCIELWLQYTQKRRYKQ >gi568815576r:31339883_31588718|GENSCAN_predicted_CDS_6|1773_bp atgtatcgtttaagatatcattttgaaacaaacaatgtaaccataacacagagacttgcg aatgtgtggaatgacaccatcaccagcaattgtagccttgatgagagagtccaattcttc ttcttcactaatagcaggttgcaaatgatgaggggtaatagctttacctttaaatctttt ggtgcctttgatgtcacttcaagtatctctgcagtgaggtattttaaggatggtgcagtt aagaaaccttattctgcaaagacactgtccaacaagaagtcttctgcatcctttgggatc cggagggagttacctagtaccagtcatctagtgcagtatcgtggcacacatacttgtacc cgacagggccggttaagagaactgcgcatcagcctgggcgagagtgagactctgtctcag aacaacaacaacagatttatctggtgcttacccatgccaggtgttacgctagatgctgag catataaaatgcgtggccagaaagttcttatatttatggattcgaatgacttttggaaga gtatttccctctaaagccagattttactatgagcagcgattactacggaaggtcttcgaa gaatggaaagaggagtggtgggttttccagcacgagtggaaactctgtgttcgagctgac tgtcactacaggtattacctgtacaacctgatgttccagacgtggaagacctatgtgcgt cagcagcaggagatgaggaacaagtacattagagccgaggttcatggtgagaaaaagaag tccatgtctgaagaagatgcccacatttactttgcagaagatgcaaagcaaaagatgcga caggcctggaagtcctggttgatctacgtggttgttcgtaggaccaaacttcagatgcag actacagctctggagtttaggcaacggattatcttacggatgtgggttgacagtacactc atagtgttgaggaaagctgacgttgacctcaccaagtgggcaggagaactcactgaggat gagatggaacgtgtgatgaccattatgcagaatccatgccagtacaagatcccagactgg ttcttgaacagacggaaggatgtaaaggatggaaaatatagccaggtcctagccagtggt ctggacaagaagctccgtgcagatgtggagcgactgaagaagattcaggcccatagagga ccgcaccacttctggggccttcgtgtccgaggccagcacaccaagaccactggccaccat ggctgtaccatgggcggggtgtggtggagcacgtggaggcagcgactaggacaggtccgt gtgagccgtgccctccatgcctctgctttgaagcacagggccctgagcctccaggtgcag gcttggtcacagtggcgggaacagctcctgtatgtccagaaggagaaacaaaaggttgtc tctgcagtgaaacatcatcagcactggcaaaaacggagatttctaaaggcctggcttgaa tacctgcaagtccgcagagtgaagagacagcagaatgacatgctgctgtgtgcagaagaa gctgcccagtttgagatggcagaagagcaccacaggcacagccagctgctgctgcacagg ttctggaacctctggcggtctcagattgagcagaaaaaggaaagagagctgctcccctta ctgcatgctgcctgggaccactacagaatagcactgctgtgcaaatgtatcgaattgtgg ctacagtatactcagaagaggcggtacaagcag