GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:29:52 Sequence gi568815577f:34270479_34470847 : 200369 bp : 44.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 252 311 60 2 0 68 40 84 0.549 2.85 1.02 Intr + 12801 12997 197 2 2 47 66 145 0.410 6.41 1.03 Intr + 15109 15497 389 1 2 36 95 144 0.282 4.33 1.04 Intr + 17532 17631 100 1 1 84 58 24 0.115 -1.83 1.05 Intr + 23203 23278 76 1 1 104 97 5 0.077 2.52 1.06 Intr + 23651 23825 175 1 1 79 69 49 0.100 1.61 1.07 Intr + 43314 43437 124 1 1 67 82 85 0.335 5.44 1.08 Term + 48448 48631 184 1 1 56 32 123 0.588 0.52 1.09 PlyA + 50952 50957 6 1.05 2.00 Prom + 53186 53225 40 -2.56 2.01 Init + 55785 55851 67 2 1 63 100 81 0.852 8.03 2.02 Intr + 67732 67992 261 2 0 66 49 130 0.431 4.36 2.03 Intr + 68827 68917 91 2 1 70 63 105 0.686 5.25 2.04 Intr + 83826 83907 82 0 1 49 73 74 0.011 1.64 2.05 Term + 91605 91673 69 2 0 93 49 47 0.055 -0.76 2.06 PlyA + 92389 92394 6 1.05 3.00 Prom + 98200 98239 40 -2.66 3.01 Sngl + 100001 100372 372 1 0 71 44 431 0.952 33.13 3.02 PlyA + 100520 100525 6 1.05 4.09 PlyA - 100919 100914 6 1.05 4.08 Term - 105077 104878 200 0 2 78 52 164 0.923 9.36 4.07 Intr - 105390 105262 129 1 0 74 75 119 0.940 9.87 4.06 Intr - 112296 112212 85 0 1 71 65 56 0.027 0.99 4.05 Intr - 118187 118073 115 1 1 123 26 -20 0.064 -4.25 4.04 Intr - 121234 121108 127 2 1 59 113 96 0.829 8.94 4.03 Intr - 121648 121550 99 2 0 128 28 50 0.439 3.08 4.02 Intr - 123478 123366 113 0 2 65 59 79 0.824 2.72 4.01 Init - 130594 130242 353 1 2 110 34 130 0.760 6.44 4.00 Prom - 133055 133016 40 -4.16 5.04 PlyA - 133285 133280 6 1.05 5.03 Term - 137253 137158 96 0 0 105 37 42 0.385 -1.23 5.02 Intr - 140830 140642 189 1 0 3 89 135 0.412 4.88 5.01 Init - 141091 140930 162 1 0 74 37 103 0.273 3.53 5.00 Prom - 142105 142066 40 -6.56 6.00 Prom + 142243 142282 40 -4.26 6.01 Init + 143572 143634 63 0 0 91 80 26 0.881 3.25 6.02 Intr + 146532 146565 34 2 1 93 50 106 0.121 5.10 6.03 Term + 147110 147249 140 0 2 -16 39 155 0.129 -1.67 6.04 PlyA + 147536 147541 6 -0.45 7.04 PlyA - 147548 147543 6 -0.45 7.03 Term - 149152 148740 413 2 2 73 54 365 0.952 27.00 7.02 Intr - 155851 155671 181 1 1 88 36 73 0.218 1.54 7.01 Init - 165489 165349 141 0 0 71 2 136 0.010 3.43 7.00 Prom - 172374 172335 40 -6.16 8.02 PlyA - 173425 173420 6 1.05 8.01 Sngl - 179156 178767 390 2 0 70 46 588 0.816 48.82 8.00 Prom - 184240 184201 40 -3.96 9.00 Prom + 186775 186814 40 -1.76 9.01 Init + 189171 189324 154 2 1 75 18 88 0.586 -1.33 9.02 Term + 190027 190160 134 2 2 104 46 93 0.597 4.95 9.03 PlyA + 195501 195506 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 108208 108154 55 1 1 35 119 62 0.825 5.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_1|434_aa MAVEELATDGTLMKQDSEEVVLALTDFPVKSNWPGLKHSCEDFEAAFESELLVHYCNEDM HRDPNQSSRVIMLWYLLENAVVRQPRDSQFKCYVNDSRASWCALKKTLIMCSHRAEIAEN QTQNLIFQLTELQRKLSSQPHRVKVRALIKKEWDPVNWDRKVREDLDEAGDTEPLNSDYF ASQVADVSPSPVELTSLPPAEMASSNPVAMAFPLSGYSSPPGCFHGLVLTVCGFSRGTMQ AVSGSTILRLQASGQQRPNSSLNPQHLAQYAWHKIPRREKLIGPFRSGICSYTSQPSGGN FMQDKPGCQQPISVVPREKGIPMSKRCRVTKSAALDGALSVSEAPSMVSGKCTGVLIAVC LQAAFPPMCLQCSRDYKGINSKYEKYLMTGTNQSKWTPAALWAYVGEIPIPPTTITNQSP RDAFKGETEEALLN >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_1|1305_bp atggcagtggaggagctagcgacagatggaaccctgatgaagcaggactcagaggaggtg gttttggccctgactgactttccagtgaaatctaattggccgggtttaaaacacagctgt gaggattttgaagccgcctttgaaagtgaactgctggtgcactattgcaatgaggacatg cacagagatcccaaccagagctcaagagtaataatgctgtggtaccttttggaaaatgct gttgtcaggcagccccgggactcccagttcaagtgctacgtaaatgactcaagagcttct tggtgtgccctaaaaaagaccctgatcatgtgtagccacagagctgaaattgctgaaaat caaacacagaaccttatctttcagctgactgaattacaacgcaaattgagctcccagcct cacagggtaaaagtgagggcattgattaagaaagaatgggatcctgtaaattgggatagg aaggtgcgggaagaccttgatgaagctggggacactgagcccctaaattctgattatttt gcttcgcaagtggcagatgtctccccatctccagtggaattgacctccttacccccagca gaaatggcctcctcaaacccagtggccatggccttcccactctcagggtacagctcccct cctggctgctttcatggactggtgttgactgtctgtggcttttccagggggacaatgcaa gctgtcagtggatctaccattctgaggctgcaagcatcaggacagcagagaccaaattca tcattgaatcctcaacatctagcacaatatgcctggcataagattcctagaagagagaag ttgattggcccttttaggtcaggaatttgttcctataccagtcagccctcaggaggcaat ttcatgcaggataaacctggctgccagcagcccatttctgtggttcccagagaaaaagga atccccatgagcaaaagatgtagagttaccaagtcagctgcgcttgatggagctctgtct gtgtctgaagctcccagtatggtttctggcaaatgcacaggtgtgctcatcgcagtgtgc ctgcaggcggctttcccgcccatgtgtcttcagtgctctagagactataaaggcatcaat agcaaatacgaaaaatatttaatgactggtaccaaccagtcaaaatggactccagctgct ttatgggcatatgttggtgagatacctatcccacccacaaccatcaccaaccagagtcca agagatgctttcaaaggtgaaacagaagaagcattgctcaattaa >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_2|189_aa MGEQNKSLRMQNGSFTKEKTEGTEEGADRLMGEKCMNFSGSINASENQQRAKRAKPRDVT WLRTLRSLKAGIKPCSCLHFNTKRSSRLQVEAQEALAVLKVLSLKVSGPGHRQAVEGMNV HRAVDGTDVSAFQSQQQQTNSGISEASYQVACFIDKLKKPYMFGEFLRHGEHSYTTKVFL LAEFAYLVQ >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_2|570_bp atgggagagcagaataagtctctgagaatgcagaatggctcctttacgaaagagaagacc gagggaactgaggagggggcagataggctgatgggggaaaagtgcatgaatttttctggt tccatcaatgcctctgagaatcagcaaagagcaaaaagagcaaagcccagagacgtgacc tggttgaggaccctacgctccttgaaggcaggaatcaagccttgttcatgtttgcatttc aacaccaagcgcagctccaggctccaagtcgaggcccaggaagcgctggctgtcctgaag gtgctcagtctgaaggtgtcgggcccaggccacagacaggcagttgagggcatgaatgtt catcgagcagtggacggcacagatgtttctgcttttcagagccagcagcaacaaaccaac agcggcatttcagaagcatcctaccaagttgcttgtttcattgataaactaaagaaaccc tacatgtttggagagttcttgaggcatggtgaacacagttacaccaccaaagtgttcctc ctggctgagtttgcctatcttgttcagtga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_3|123_aa MSTLSNFTQTLEDVFRRIFITYMDNWRQNTTAEQEALQAKVDAENFYYVILYLMVMIGMF SFIIVAILVSTVKSKRREHSNDPYHQYIVEDWQEKYKSQILNLEESKATIHENIGAAGFK MSP >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_3|372_bp atgtctactttatccaatttcacacagacgctggaagacgtcttccgaaggatttttatt acttatatggacaattggcgccagaacacaacagctgagcaagaggccctccaagccaaa gttgatgctgagaacttctactatgtcatcctgtacctcatggtgatgattggaatgttc tctttcatcatcgtggccatcctggtgagcactgtgaaatccaagagacgggaacactcc aatgacccctaccaccagtacattgtagaggactggcaggaaaagtacaagagccaaatc ttgaatctagaagaatcgaaggccaccatccatgagaacattggtgcggctgggttcaaa atgtccccctga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_4|406_aa MPRFASPLLRNVIIRSQFDGIKRKQCLQYLKTLRTLQYDGFKTVYFGETNIPESLVTGED ISDGYFIQTPTWCIVHAAGSQGWVPWKYRVFLRDELCIKQEDSLFSEFCDVVRKAYGNHS LIKYWSTFCVPGWVRQGQEYQGKQAWTLKVQVWCGDSPPSILWLHSDTDEICMGELPHGV MVRVFPAVDCVKRLNLFFTPEVITMNITSSRVASHVGRCGCRSVWRMKSTRWSPLKLTFT WQNLEGNTKGMSRKCLIRVYPMFTAQHKDAVETHGKRQKCGLRRTQHKGSRGQEHSQDSP GTNTVGLNQSWAPPGPRASRNTLLNGLTAPDGQVQQGRPAAEAPAPGRAQTTTSTVLPGK TTGPEMHRYVLRRSGPVPPTRVHYSVRRRSGPGQKTWGALLTAQAQ >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_4|1221_bp atgcctcgctttgcaagccctcttttaagaaatgtcattatcagaagtcaatttgatggc atcaagaggaagcaatgcctccaatatctgaaaaccctgagaacactgcaatatgatgga tttaagaccgtatattttggggaaaccaatatcccagaaagtctcgtaactggggaagat attagtgatggatatttcatacaaaccccaacttggtgtattgtgcatgctgcgggtagt caaggatgggtgccttggaaatatcgggtgttcctaagagacgagctgtgtatcaaacaa gaagacagcctcttctctgagttctgtgatgtggtgaggaaggcctatggaaatcattca ctcatcaaatactggagcaccttctgtgtgcctggctgggtgcgtcagggccaggagtac caaggaaaacaagcctggaccctcaaggtccaggtctggtgtggagactctccaccgtcc atcctctggcttcactctgacacagatgagatctgcatgggggagctccctcatggtgtg atggtgcgtgtgtttccggcagtggactgtgtgaaacggctgaacctcttcttcacgcct gaggtcatcaccatgaacataacatcatcacgtgtggcatcccatgtgggacgctgtggc tgccgcagtgtctggcgcatgaagagcacacgctggagtccactcaagctgacatttaca tggcagaatctagaagggaataccaaagggatgagtaggaaatgtttgataagagtatac ccgatgtttactgctcagcataaagacgcagtggagacgcacggcaagaggcagaagtgt ggactgagaaggacgcagcacaagggcagcagaggtcaagaacattcacaggactctccg gggaccaacaccgtaggcttgaaccaatcctgggccccgcccgggccacgtgccagccgg aacacgctgctcaacgggttgacggcgcccgacggccaggtccagcaggggcgccccgca gcggaggctccagcgcctggccgcgcacaaaccacgacttctaccgtcctgccggggaaa actacaggtcccgaaatgcaccgctacgtgctcaggcgcagtgggccggtgccgccgacg agagtgcactactccgtgcgcaggcgcagtgggcccgggcagaagacctggggcgcgctg ctcactgcgcaggcgcagtga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_5|148_aa MGSVPLQEETPDPPLSLHLVRTQQEVFYPRTRKQVLTGYRICQHLDLRLPASRTAARLIP SDSCSSGKRFQYRPSSTPTQKTVTEVFKERTEREQKGTEGEMKTERGGHVEMESSRTVRF QEKPEMGVCLNKPGAEEGRREFGDYPDS >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_5|447_bp atgggatcagtgcccttacaggaagagacaccagacccccctctctctctgcaccttgtg aggacacagcaagaagtgttctacccgcgaaccaggaagcaggtcctcaccgggtaccga atctgccaacaccttgatctcagacttccagcttccagaactgcggcaagactgattcct tcagactcctgcagcagcggaaagagattccagtaccgaccaagctcaaccccaactcag aaaacggtgactgaggtcttcaaagagagaacagaaagggaacaaaaagggactgaagga gaaatgaaaacagaacgtgggggccacgtggaaatggaaagttccagaacggtgaggttc caggagaagccagagatgggggtttgccttaacaaaccaggagctgaggaaggtcgaaga gaattcggtgattatcccgactcatga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_6|78_aa MGAELSFGRILVRRFSNWGRQDDDNDDDDNDDASSPPPLMPTSTTTTYATTVSIITTTNN ITNSNTAININHHHHQNH >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_6|237_bp atgggagctgagctgtcttttgggagaatcctagtgagaaggttctccaactggggccgc caagatgatgacaatgatgatgacgacaatgatgatgcatcatcaccaccaccattaatg ccaacatcaaccaccaccacctacgccaccaccgttagcatcataaccaccaccaataac atcaccaacagcaacactgccatcaacataaaccatcaccaccaccaaaaccattag >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_7|244_aa MGKDFTSKTPKAMAAKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWDQGHKILLILHLL HHLHLTDEETEAHKDEGAGPGSCSKDVAGARALTPSPYHCHEPLIITGAKWTPHEASNQT QASTLLGLLLGDHTEGRNDTNSTRALKVPDGTSAAWYILTIIGIYAVIFVFRLASNILRK NDKSLEDVYYSNLTSELKMTGLQGKVAKCSTLSISNRAVLQPCQAHLGAKGGSSGPQTAT PETP >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_7|735_bp atgggcaaggacttcacgtctaaaacaccaaaagcaatggcagcaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggaccaaggacataagatcctgctcatccttcacctcctt caccaccttcacctcacagatgaggaaactgaggctcacaaagatgagggggctggccca gggtcatgcagcaaggatgtggctggggccagagccctgactcccagcccttatcactgc cacgagccccttattatcacgggggccaagtggactccccatgaagcctccaaccagacc caggccagcaccctcctggggctcctgctgggtgaccacacagaggggaggaatgacacc aactccaccagggctctgaaggtgccagacggaaccagcgctgcctggtatatactcacc atcatcggcatctacgcggtgattttcgtcttccggctggccagcaacatcctcagaaag aatgacaagtccttagaagatgtttattactcaaatctgacctctgaactcaaaatgaca gggctgcagggcaaggtcgccaagtgctccaccctgtctatcagcaacagagctgtgctg cagccctgccaggcccacctgggggcaaagggcggaagcagcgggccccaaaccgcaacc ccagagaccccctga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_8|129_aa MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSSDGKLEALYVLMVLGFFGFFTLG IMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNT HLPETKPSP >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_8|390_bp atgatcctgtctaacaccacagcggtgacgccctttctgaccaagctgtggcaggagaca gttcagcagggtggcaacatgtcgggcctggcccgcaggtccccccgcagcagtgacggc aagctggaggccctctacgtcctcatggtactgggattcttcggcttcttcaccctgggc atcatgctgagctacatccgctccaagaagctggagcactcgaacgacccattcaacgtc tacatcgagtccgatgcctggcaagagaaggacaaggcctatgtccaggcccgggtcctg gagagctacaggtcgtgctatgtcgttgaaaaccatctggccatagaacaacccaacaca caccttcctgagacgaagccttccccatga >gi568815577f:34270479_34470847|GENSCAN_predicted_peptide_9|95_aa MGAGGASQSRQASRPTPPTRTAQPRVPRDHPRPAPAPPGPPHSAPQTNTNARHFFLGFDL GVLMGTRVIGGPPRGAPMCPVHQAEPALPYSVGAK >gi568815577f:34270479_34470847|GENSCAN_predicted_CDS_9|288_bp atgggggcgggcggggcctcccagagccgccaggcgtcccgccccactccgcccacacgc acggcccagcccagggttccccgggaccaccccagaccagccccggcccccccgggtcct ccacactctgcaccccagaccaacaccaacgcgcgtcacttcttcctgggcttcgacctc ggtgttctcatggggacgagggtgattggaggccctccaaggggtgcaccgatgtgtcct gtgcaccaggcagaaccagcattgccctacagtgtgggtgcaaaatga