GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:12:08 Sequence gi568815590f:27549700_27771348 : 221649 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 466 461 6 -0.45 1.01 Sngl - 1483 566 918 1 0 60 32 356 0.911 23.55 1.00 Prom - 1793 1754 40 -10.55 2.02 PlyA - 1894 1889 6 1.05 2.01 Sngl - 3317 2421 897 2 0 44 40 228 0.631 9.77 2.00 Prom - 3410 3371 40 -4.96 3.02 PlyA - 3579 3574 6 1.05 3.01 Sngl - 4463 3807 657 2 0 71 43 371 0.947 27.08 3.00 Prom - 13144 13105 40 -5.06 4.12 PlyA - 13694 13689 6 1.05 4.11 Term - 16069 15955 115 0 1 118 44 114 0.368 8.04 4.10 Intr - 17697 17619 79 1 1 12 70 86 0.063 -2.19 4.09 Intr - 34193 34093 101 1 2 63 64 78 0.279 2.65 4.08 Intr - 48936 48761 176 0 2 106 119 293 0.786 32.94 4.07 Intr - 50310 50081 230 1 2 70 101 314 0.986 28.49 4.06 Intr - 54696 54592 105 1 0 126 92 254 0.929 29.79 4.05 Intr - 55636 55225 412 1 1 60 78 591 0.998 49.06 4.04 Intr - 56825 56655 171 2 0 93 67 104 0.993 8.94 4.03 Intr - 59387 59239 149 0 2 103 55 206 0.992 18.75 4.02 Intr - 60901 60776 126 2 0 103 96 136 0.181 16.55 4.01 Init - 65186 65090 97 2 1 64 69 89 0.052 3.07 4.00 Prom - 68702 68663 40 -4.26 5.00 Prom + 68815 68854 40 -2.96 5.01 Init + 73776 73814 39 2 0 50 81 33 0.130 -1.00 5.02 Intr + 83777 83898 122 1 2 82 105 -2 0.460 0.19 5.03 Intr + 84402 84508 107 0 2 15 116 115 0.665 6.86 5.04 Intr + 100003 100101 99 2 0 37 86 156 0.967 10.38 5.05 Intr + 101809 101928 120 2 0 138 92 127 0.999 18.57 5.06 Intr + 107083 107181 99 2 0 110 106 79 0.974 11.98 5.07 Intr + 108797 109840 1044 0 0 91 94 1820 0.041 173.33 5.08 Term + 121201 121652 452 2 2 137 55 181 0.985 15.05 5.09 PlyA + 126093 126098 6 1.05 6.00 Prom + 149848 149887 40 -3.56 6.01 Init + 151057 151172 116 0 2 59 28 89 0.660 -0.32 6.02 Intr + 153295 153631 337 1 1 83 -3 177 0.114 3.62 6.03 Intr + 167567 167741 175 1 1 16 63 135 0.033 3.31 6.04 Intr + 169531 169610 80 2 2 52 103 40 0.159 1.17 6.05 Intr + 178800 178937 138 2 0 98 83 56 0.949 6.76 6.06 Term + 184603 184743 141 0 0 28 54 170 0.966 5.53 6.07 PlyA + 184846 184851 6 1.05 7.09 PlyA - 185380 185375 6 1.05 7.08 Term - 186546 186517 30 0 0 104 42 32 0.230 -1.95 7.07 Intr - 190818 190773 46 2 1 120 91 10 0.536 2.91 7.06 Intr - 198580 198378 203 1 2 72 77 274 0.956 22.78 7.05 Intr - 198899 198796 104 0 2 55 95 75 0.888 4.79 7.04 Intr - 202888 202813 76 1 1 84 86 59 0.702 4.39 7.03 Intr - 207071 207020 52 1 1 89 88 12 0.505 0.21 7.02 Intr - 212794 212720 75 0 0 58 119 58 0.378 4.53 7.01 Intr - 215552 215465 88 0 1 65 111 35 0.340 2.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 108797 109884 1088 0 2 91 54 1813 0.916 170.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_1|305_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSRVGRINIV KMAILPKVIYRFNAIPIKLPMIFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEVNLGITIQDIGMGKD FMSKTPKAMATKAKIDKWNLIKLKSFYTAKETTIRVNMQPTKCEKIFATYSSDKGLISRI YNELL >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_1|918_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggatacaaacaaatggaagaacattccatgctcacgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgattttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgggggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaac tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagtaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatggaatcta attaaactaaagagcttctacacagcaaaagaaactaccatcagagtgaacatgcaaccc acaaaatgcgagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcctataa >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_2|298_aa MGDFNTPLSTLDRSMRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILESKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNCSTTWKLNNLLLNDYWV HNEMKAEIKMFFEINENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTNAKASRRQEVTKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDVTTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYISQD >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_2|897_bp atgggagactttaacaccccactgtcaacattagacagatcaatgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttgaaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaatcaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacaaatgcaaaagctagcagaaggcaagaagtaactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggatgtcaccact gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacatctcccaagactaa >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_3|218_aa MEDEMNEVKQEGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGAPESDEENGTKLENTLQD IIQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPIISYPAKLSFISEREIKYFTDK QMLTDFVTTRPALKELLKEALNMERNNRYQPLQNHAKM >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_3|657_bp atggaagatgaaatgaatgaagtgaagcaagaagggaagtttagagagaaaagaataaaa agaaacgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacggctg attggtgcacctgaaagtgacgaggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgttaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacagcggatctctcggcagaaactctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccataatt tcatatccagccaaactaagcttcataagtgaaagagaaataaaatactttacagacaag cagatgctgacagatttcgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctgaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_4|586_aa MMRPPAPPARCCTGPHLPASRKLPLLSAAFFGQACKDSRIGGMMKTLLLFVGLLLTWESG QVLGDQTVSDNELQEMSNQGSKYVNKEIQNAVNGVKQIKTLIEKTNEERKTLLSNLEEAK KKKEDALNETRESETKLKELPGVCNETMMALWEECKPCLKQTCMKFYARVCRSGSGLVGR QLEEFLNQSSPFYFWMNGDRIDSLLENDRQQTHMLDVMQDHFSRASSIIDELFQDRFFTR EPQDTYHYLPFSLPHRRPHFFFPKSRIVRSLMPFSPYEPLNFHAMFQPFLEMIHEAQQAM DIHFHSPAFQHPPTEFIREGDDDRTVCREIRHNSTGCLRMKDQCDKCREILSVDCSTNNP SQAKLRRELDESLQVAERLTRKYNELLKSYQWKMLNTSSLLEQLNEQFNWVSRLANLTQG EDQYYLRVTTVASHTSDSDVPSGVTEVVVKLFDSDPITVTVPVEVSRKNPKFMETVAEKA LQEYRKKHRDSLLKLLSRRATWAELRGPGALLELLAVRRKVAGFCDEKREEEKGKEQRGC VCDAQEKAEVAVKLLRDEGGRALCNCQSTDMQQGPFLIVTVSQRRQ >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_4|1761_bp atgatgcgccccccggcgcccccagcccggtgctgcaccggcccccacctcccggcttcc agaaagctccccttgctttccgcggcattctttgggcaggcgtgcaaagactccagaatt ggaggcatgatgaagactctgctgctgtttgtggggctgctgctgacctgggagagtggg caggtcctgggggaccagacggtctcagacaatgagctccaggaaatgtccaatcaggga agtaagtacgtcaataaggaaattcaaaatgctgtcaacggggtgaaacagataaagact ctcatagaaaaaacaaacgaagagcgcaagacactgctcagcaacctagaagaagccaag aagaagaaagaggatgccctaaatgagaccagggaatcagagacaaagctgaaggagctc ccaggagtgtgcaatgagaccatgatggccctctgggaagagtgtaagccctgcctgaaa cagacctgcatgaagttctacgcacgcgtctgcagaagtggctcaggcctggttggccgc cagcttgaggagttcctgaaccagagctcgcccttctacttctggatgaatggtgaccgc atcgactccctgctggagaacgaccggcagcagacgcacatgctggatgtcatgcaggac cacttcagccgcgcgtccagcatcatagacgagctcttccaggacaggttcttcacccgg gagccccaggatacctaccactacctgcccttcagcctgccccaccggaggcctcacttc ttctttcccaagtcccgcatcgtccgcagcttgatgcccttctctccgtacgagcccctg aacttccacgccatgttccagcccttccttgagatgatacacgaggctcagcaggccatg gacatccacttccatagcccggccttccagcacccgccaacagaattcatacgagaaggc gacgatgaccggactgtgtgccgggagatccgccacaactccacgggctgcctgcggatg aaggaccagtgtgacaagtgccgggagatcttgtctgtggactgttccaccaacaacccc tcccaggctaagctgcggcgggagctcgacgaatccctccaggtcgctgagaggttgacc aggaaatacaacgagctgctaaagtcctaccagtggaagatgctcaacacctcctccttg ctggagcagctgaacgagcagtttaactgggtgtcccggctggcaaacctcacgcaaggc gaagaccagtactatctgcgggtcaccacggtggcttcccacacttctgactcggacgtt ccttccggtgtcactgaggtggtcgtgaagctctttgactctgatcccatcactgtgacg gtccctgtagaagtctccaggaagaaccctaaatttatggagaccgtggcggagaaagcg ctgcaggaataccgcaaaaagcaccgggacagtttgctgaagctgctaagccggagagcc acgtgggctgagctcagaggccctggagctctcttggagcttctggctgttcgccggaag gtggcaggattttgtgatgaaaagagggaggaggagaagggcaaggagcaacgagggtgt gtatgtgatgcccaagagaaagcagaggtggcagtgaagctcctaagagacgaaggtggg agggcactgtgcaactgtcagagcaccgacatgcagcagggtcccttcctcatcgtgact gtcagccagagaaggcagtga >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_5|693_aa MEEEKSKVEGLHLALDIIRWARKTCRAKIWGPHPCPSHLAVPIVKWTKIKELSRTAIRAL EDPPAARLHYSSSRLQRGPPEAPEEETMKVRSAGGDGDALCVTEEDLAGDDEDMPTFPCT QKGRPGPRCSRCQKNLSLHTSVRILYLFLALLLVAVAVLASLVFRKVDSLSEDISLTQSI YDKKLVLMQKNLQGLDPKALNNCSFCHEAGQLGPEIRKLQEELEGIQKLLLAQEVQLDQT LQAQEVLSTTSRQISQEMGSCSFSIHQVNQSLGLFLAQVRGWQATTAGLDLSLKDLTQEC YDVKAAVHQINFTVGQTSEWIHGIQRKTDEETLTLQKIVTDWQNYTRLFSGLRTTSTKTG EAVKNIQATLGASSQRISQNSESMHDLVLQVMGLQLQLDNISSFLDDHEENMHDLQYHTH YAQNRTVERFESLEGRMASHEIEIGTIFTNINATDNHVHSMLKYLDDVRLSCTLGFHTHA EELYYLNKSVSIMLGTTDLLRERFSLLSARLDLNVRNLSMIVEEMKAVDTQHGEILRNVT ILRGAPGPPGPRGFKGDMGVKGPVGGRGPKGDPGSLGPLGPQGPQGQPGEAGPVGERGPV GPRGFPGLKGSKGSFGTGGPRGQPGPKGDIGPPGPEGPPGSPGPSGPQGKPGIAGKTGSP GQRGAMGPKGEPGIQGPPGLPGPPGPPGSQSFY >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_5|2082_bp atggaagaggagaagtccaaggttgaggggctgcatctggccctcgacatcataagatgg gcgagaaaaacgtgcagggctaaaatctggggtccccacccctgccccagccacttggcg gtcccgatagtgaaatggactaagatcaaggagctctcccggacggcgatccgcgccctg gaggatccgccggccgcccggctccactacagctccagccgcctgcagcggggccctcct gaggccccagaggaagagaccatgaaagtgaggtcggccggcggcgatggagatgccttg tgcgttacagaagaggacctggcgggtgacgacgaggacatgccgaccttcccatgcacc cagaagggccggccagggccccgctgcagccgctgccagaagaacctatctttgcacaca tcggtgcggattctttacctcttcctggccctgctcctggtggccgtggctgtgttggcc tctctggttttcagaaaagtggactctctctccgaagacatctccttgacccagtctatt tatgacaagaagcttgtgttaatgcagaaaaatctccagggcctggatccgaaagccctg aacaactgctctttctgccatgaagctgggcagctggggccagagatccgaaaactgcag gaggagctggagggaattcagaagctgcttctggcccaggaggtgcagctggaccagacc ttacaggcccaggaggtgctctccaccaccagcagacaaatctcccaggagatgggcagt tgctccttctccatccaccaggttaaccagtctctggggctcttcctggcccaggtgaga ggctggcaggccaccacagctggcctggacctctctctgaaggacctcacccaggagtgc tacgatgtcaaggctgcagtgcaccagatcaacttcaccgtggggcagacttccgagtgg atccacgggatccagcggaagacagacgaggagaccctgaccctccagaagattgtcacc gactggcagaactacacacggctcttcagcggcctgcgcaccacctccaccaagactgga gaggcggtcaagaacatccaggccaccctgggggcctcctcacagcgcatcagccagaac tcagagagcatgcacgacctggtactccaggtcatgggcttgcagctgcagctggataac atctcgtccttcctggatgaccacgaagagaacatgcatgatcttcagtaccatacccac tacgcccagaaccgcactgtggagaggtttgagtctctggaaggacgcatggcttctcac gagattgaaattggcaccatcttcaccaacatcaatgccaccgacaaccacgtgcacagc atgctcaagtacctggatgacgtgcggctctcctgcacgctgggcttccacacccatgcc gaggagctctactacctgaacaagtctgtctccatcatgctgggcaccacagacctgctc cgggagcgcttcagcctgctcagtgcccggctggacctcaacgtccggaacctctccatg atcgtggaggagatgaaggcagtggacacacagcatggagaaatccttcgcaatgtcacc atcctacgaggtgcccccggccctccaggaccaagaggattcaaaggagatatgggcgtg aaagggcctgttggcggcagaggcccgaaaggagaccccggcagcttgggccccctggga ccccagggtcctcaggggcaacctggagaggccgggcctgtgggagaaaggggccctgtt ggccctcgagggttcccaggcctcaaaggctcaaagggcagctttggaactggagggccg agaggacagccaggcccaaaaggggacatagggcccccagggccagaagggcccccgggg tctccagggccctcagggcctcagggaaaaccgggaattgcagggaagacagggtcacca ggccagcggggggccatggggcctaagggtgaaccagggatccagggtccccctggtctc ccggggcctccaggtccaccaggaagccagagcttctactga >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_6|328_aa MAKISKTNNTKCCEDVEQRELSCINGETVKWYNPFGKVRPCQARSWAQRPAGTHMKGLPV ENWHGTQGLKAAHTTALAQGPEHRATETWPQGSGEKNSLSRTGEEPPIPAAWLGTSREKA QCPESFEHEAAAPAPGRGPGNQGFVLGAALGLDTDTDVGIAIDVGIAIDVDADINKHEND TNEQHNTSGNLQQQSGRMEGVRHGAQTSQWTWMELEAIILSKLTQEQKTKYCMFSHRIQR LILPKPTFSSLHSHSGGAVAGEEEDHGAFWKRQGEVSDVNQESVETEDDIGKVTQTHSTD LIKGTAHKNSNNFEFSYPTFRKQSNGKK >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_6|987_bp atggctaaaatttcaaaaaccaacaatacaaaatgttgtgaggatgtggagcaacgagaa ctctcatgcattaatggtgaaactgtgaagtggtacaacccttttggaaaagttaggccc tgccaggcccgaagctgggcccagaggccagctggcacccacatgaaaggccttcctgtc gagaactggcatggaacccagggcctcaaggctgcccacaccacagccctggcccagggt ccagagcacagggccacggaaacttggccacaaggctctggggagaaaaacagcctgagc cgcacaggagaggagcctcccatccccgcagcctggctgggcacatcaagggagaaagcg cagtgcccagaatcctttgaacatgaagccgcagcacctgcacctggcagaggacccggg aaccagggattcgtgctgggggcagcacttgggctagatacagatacagatgtgggtata gctatagatgtgggtatagctatagatgttgatgctgatatcaataaacatgagaacgac accaacgaacaacataatacaagtggtaacctacagcagcagagtgggagaatggaagga gtcaggcatggggcccagacttctcaatggacatggatggagctggaggccattatcctt agcaaactaacacaggagcagaaaaccaaatactgcatgttctcacataggatccagcgg ctaatcctccctaaacccaccttctccagcctccactcccattctggaggggccgtcgct ggggaagaggaggaccatggggcattctggaagaggcagggtgaagtatcagatgtgaac caggagagtgttgaaactgaagacgacattggcaaagtgactcagacccacagcacagat ttgatcaagggcacagctcataaaaatagcaacaatttcgaattctcctatccaacgttt aggaaacaatccaatggcaagaaatga >gi568815590f:27549700_27771348|GENSCAN_predicted_peptide_7|224_aa XNSSAYTIYMGKDKYESKFSKAFDFEIFSSFCFFRFSYLIADEDLIKHGWPEDIWFHVDK LSSAHVYLRLHKGENIEDIPKEVLMDCAHLVKANSIQGCKMNNVNVVYTPWSNLKKTADM DVGQIGFHRQKDVKIVTVEKKVNEILNRLEKTKVERFPDLAAEKECRDREERNEKKAQIQ EMKKREKEEMKKKREMDELRSYSSLMKVENMSSNQDGNDSDEFM >gi568815590f:27549700_27771348|GENSCAN_predicted_CDS_7|675_bp nttaattcatctgcctacactatttacatgggaaaagataaatatgaaagtaagttttca aaagcatttgattttgaaattttcagcagtttttgtttctttcgtttctcatatctcatt gcagatgaagatctgatcaagcatggctggcctgaagatatctggtttcatgtggacaaa ctctcttcggctcatgtataccttcgattacataagggagagaatatagaagacatccca aaggaagtgctgatggactgtgcccaccttgtgaaggccaatagcattcaaggctgcaag atgaacaacgttaatgtggtatatacgccgtggtctaacctgaagaaaacagctgacatg gatgtggggcagataggctttcacaggcagaaggatgtaaaaattgtgacagtggagaag aaagtaaatgagatcctgaaccgattagaaaagaccaaagtcgagcggttcccagaccta gcagcagagaaagaatgcagagatcgtgaagagaggaatgagaaaaaagcccaaattcag gaaatgaaaaagagagaaaaagaagaaatgaagaagaagagggaaatggatgaacttagg agctattcatcactaatgaaagttgaaaatatgtcttcaaatcaggatggcaatgattca gatgaattcatgtaa