GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:11:13 Sequence gi568815596r:151150909_151355744 : 204836 bp : 40.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13649 13740 92 1 2 69 64 82 0.118 1.77 1.02 Term + 15498 15708 211 0 1 42 49 191 0.737 6.38 1.03 PlyA + 18055 18060 6 1.05 2.00 Prom + 23758 23797 40 -5.55 2.01 Init + 29197 29348 152 0 2 77 71 166 0.676 11.37 2.02 Term + 35203 35674 472 1 1 -38 48 347 0.574 11.52 2.03 PlyA + 35683 35688 6 1.05 3.00 Prom + 36447 36486 40 -5.95 3.01 Init + 41334 41406 73 2 1 62 81 21 0.533 0.08 3.02 Intr + 41572 41778 207 2 0 40 89 168 0.860 10.23 3.03 Term + 47745 47932 188 1 2 47 55 141 0.711 3.37 3.04 PlyA + 50036 50041 6 1.05 4.04 PlyA - 50735 50730 6 1.05 4.03 Term - 72244 72189 56 2 2 79 55 65 0.329 -0.96 4.02 Intr - 74612 74498 115 2 1 51 72 88 0.529 2.60 4.01 Init - 84981 84973 9 0 0 110 107 27 0.824 5.90 4.00 Prom - 85693 85654 40 -4.25 5.00 Prom + 87216 87255 40 -7.35 5.01 Init + 90242 90244 3 1 0 113 89 0 0.867 2.65 5.02 Term + 92346 92474 129 2 0 87 42 145 0.936 7.00 5.03 PlyA + 92597 92602 6 1.05 6.06 PlyA - 93081 93076 6 1.05 6.05 Term - 94958 94938 21 2 0 131 54 10 0.290 -0.77 6.04 Intr - 100756 100007 750 1 0 100 96 296 0.613 22.32 6.03 Intr - 101947 101847 101 2 2 46 116 35 0.794 1.01 6.02 Intr - 104835 104625 211 0 1 100 63 144 0.948 10.66 6.01 Init - 105255 105253 3 0 0 113 22 0 0.802 -4.05 6.00 Prom - 105592 105553 40 -6.25 7.00 Prom + 106796 106835 40 -8.75 7.01 Sngl + 110760 111137 378 2 0 115 44 301 0.890 24.31 7.02 PlyA + 114220 114225 6 1.05 8.08 PlyA - 115125 115120 6 1.05 8.07 Term - 119967 119785 183 0 0 78 38 196 0.819 10.16 8.06 Intr - 120824 120718 107 0 2 67 103 26 0.830 1.01 8.05 Intr - 124762 124576 187 1 1 58 63 193 0.980 12.14 8.04 Intr - 124956 124850 107 1 2 73 92 28 0.742 0.71 8.03 Intr - 128082 127920 163 0 1 35 86 94 0.767 2.53 8.02 Intr - 131135 131040 96 2 0 25 78 88 0.560 0.79 8.01 Init - 132040 131960 81 1 0 82 93 102 0.991 11.12 8.00 Prom - 137346 137307 40 -6.95 9.00 Prom + 142230 142269 40 -5.85 9.01 Init + 145315 145506 192 0 0 97 44 83 0.841 1.98 9.02 Intr + 146178 146439 262 2 1 -16 33 251 0.223 5.34 9.03 Intr + 152459 152518 60 0 0 82 96 38 0.583 1.79 9.04 Term + 154392 154558 167 1 2 -13 39 276 0.976 9.70 9.05 PlyA + 155348 155353 6 1.05 10.00 Prom + 155814 155853 40 -8.65 10.01 Init + 156786 156845 60 2 0 34 98 63 0.702 1.24 10.02 Intr + 159569 159593 25 1 1 136 31 25 0.438 -1.72 10.03 Intr + 160416 160561 146 1 2 28 81 150 0.730 7.38 10.04 Intr + 161934 162509 576 2 0 74 65 321 0.501 20.39 10.05 Intr + 164489 164761 273 1 0 64 81 85 0.662 2.21 10.06 Term + 165480 165692 213 2 0 65 42 141 0.817 3.45 10.07 PlyA + 165994 165999 6 1.05 11.00 Prom + 171194 171233 40 -5.85 11.01 Init + 171610 171626 17 0 2 50 116 9 0.011 -0.39 11.02 Intr + 194077 194224 148 1 1 64 60 105 0.281 4.62 11.03 Intr + 198968 199136 169 1 1 49 77 108 0.543 4.40 11.04 Term + 199195 199304 110 2 2 63 43 90 0.575 -0.31 11.05 PlyA + 200758 200763 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_1|100_aa DLDMGVLIPASGLHTVTPNTHAAIARAQAQSLYVHKGFLGEWILWRAVETSTKVNVLPLS QPQGLGAEDLVKKTDPENVWKSRCFDHQQATEIVGLSSGS >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_1|303_bp gatttggacatgggtgtcctgattccagcatctgggctccacactgtgacccccaacact catgctgcaatagcacgggcacaggctcagagtctgtatgtccataaaggctttctgggt gaatggattctttggagggctgtagagacttccaccaaagtcaacgtgctgccactgagt cagccacaaggacttggagcagaggacttagtgaaaaaaacagacccagagaatgtgtgg aaatctagatgctttgatcatcagcaagccacggaaattgtgggtttgagcagtggttct tga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_2|207_aa MMWARLRRGRFCLATLLLPFLGSQTTRQQSNKDLLHFLRALYGSTRLPFNRTQTSTLLTP IPLRWFRTLRTPACAHHGATVQQLEGRWRLADSKGFDAYMKKLGVGISLRNMGAMAKPDC IITCDGKNLTIKTESTLKTTQFSCTLGEKFEETTAVGRKTQTVCSFTDGALVPHQEWDGK ENTITRKLKDAISGGLCHEQCHLYSDL >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_2|624_bp atgatgtgggcaaggctgaggagaggaaggttctgcttagccacactcctcctgcccttc ctgggctcccagacaactcgccagcagagtaataaagatcttctccactttctgcgagct ttatatggaagtactcgtctaccctttaatagaacacaaaccagtacgctgctcacgccg atcccgctccgctggttccgcacgctccgcacaccagcctgcgcgcaccatggggccacc gttcagcagctggaaggaagatggcgcctggcggacagcaaaggctttgatgcatacatg aagaaactaggagtgggaatatctttgcgcaatatgggcgcaatggccaaaccagactgt atcatcacttgtgatggcaaaaacctcaccataaaaactgagagcactttgaaaacaaca cagttttcttgtaccctgggagagaagtttgaagaaaccacagctgttggcagaaaaact cagactgtctgcagctttacagatggtgcattggttccgcatcaggagtgggatgggaag gaaaacacaataacaagaaaattgaaagatgcaatcagtggtggattgtgtcacgaacaa tgtcacctgtactcggatctatga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_3|155_aa MTSLMINKYPIQVQYQGYFYGIKQVVKGSDSWSLVDEHSKEWRRHETVKQVRCVPVANLS ESSDKTHSYATSSRKQRYYLQIGSKEQKKHEIHGKCTFTQEHRLTEHDINEKDKQSLTPP PNFSNPVGKKRCTSSHREEIISSQHSKQGVAHADD >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_3|468_bp atgacttcccttatgattaataaatatcccattcaggtgcaatatcagggctatttctat ggcattaagcaagtggttaaggggtctgatagttggtccttggtggatgaacacagtaag gaatggagaaggcatgagactgtgaagcaggttcgctgtgtaccagttgccaacttgtct gagtctagtgacaaaacacattcatatgcaacaagttccaggaagcagaggtattactta cagataggcagcaaggaacaaaagaagcatgagatccatgggaagtgtaccttcacacag gaacacagacttacagaacatgatattaatgaaaaggacaagcagagtttaacgcctcct cccaacttctcaaatccagtgggaaagaaaaggtgtacttctagccacagagaggagatc atttcttctcaacattccaagcagggtgtggcccacgctgatgactga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_4|59_aa MALREQECIWPYIAAANDLASVRKVNWRTKPAHDGKITEKQATLAADWMVPTHIVGGSS >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_4|180_bp atggcgctgagggaacaagaatgcatatggccttatattgctgctgcaaatgatcttgcc tccgtgagaaaagttaactggaggacaaagccagcacatgatgggaaaattacagagaaa caggccacactggcagccgattggatggtgcctactcacattgtgggtgggtcttcctga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_5|43_aa MKVQPSAMATGLGQHDDKDCTKMTDNWRQFVKQDGFPWQTGQV >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_5|132_bp atgaaagtacagccatcagcgatggccacaggacttggtcagcatgatgacaaggactgc actaaaatgacagacaactggagacaatttgttaaacaggatgggttcccgtggcaaact gggcaagtttaa >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_6|361_aa MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTK GVAYVIFKEKKVAENVIRQKKHWLARKTRHAELTVSLRVSHFGDKIFSSVNAILDLSVFG KEVTLETLVKDLKKKIPSLSFSPLKPNGRISVEGSFLAVKRLRESLLARACSLLEKDRNF TSEERKWNRQNPQRNLQRSNNSLASVRTLVPETARSGEMLVLDTDVFLYLKHKCGSYEST LKKFHILSQEKVDGEITTICLKSIQVGSQPNNAKHVKELIEEWSHALYLKLRKETFILEG KENREKRMIKRACEQLSSRYLEVLINLYRTHIDIIGSSSDTYLFKKGVMKLIGQKIQEII N >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_6|1086_bp atggcatcagttttgaatgtcaaggaatccaaagctcctgaaagaacggttgtagttgct ggtcttccagttgacctttttagtgatcaattattggccgtattagtgaagagccacttc caagacattaagaatgagggcggagatgttgaagatgtgatatatccgacaagaaccaag ggagttgcatatgtaatattcaaagaaaaaaaagttgcagagaatgtcatcagacaaaag aaacactggctagcaaggaagactagacatgctgaactcacagtctctctcagagtctct cattttggtgacaagatcttcagctctgtaaatgccatccttgatctttctgtttttgga aaagaagttactctagaaactctggtaaaagacctgaaaaaaaaaatcccgagtttaagc ttcagtcctttgaaacccaatggaagaatctccgtggaaggatcatttctggctgtcaag aggctcagagaatctttgctagcaagagcatgttctctcttagaaaaagacagaaatttt accagtgaggagagaaagtggaatagacaaaatccccagaggaatctacagagaagtaat aactctttggcatcagtcaggaccttagtacctgagactgctagaagtggagaaatgctt gtgcttgacacagatgtttttctttacctgaaacacaagtgtggatcttatgaaagcaca ctgaaaaaattccacattctgagtcaggagaaagtggatggtgaaatcaccacaatttgt ctaaaaagcattcaagttggttctcagccaaacaatgcaaaacatgtaaaagagctcatt gaggaatggtcacatgctctttacttaaagcttagaaaagagacatttattttggaagga aaggaaaatagagagaaaagaatgatcaaaagggcatgtgaacaattaagttcgagatac cttgaagtcctgattaacctttataggacacacattgacattataggatcttcttctgac acttacctgtttaaaaaaggggtcatgaaattaatagggcaaaagatccaggagataatc aactga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_7|125_aa MATVQARTCTAKPKAKATCHGGGETRPQRPQKAHNQRNAGDGERSEQAGFGFVFCSSSLG GYEEKGGPSTRSGCSATSWPLSGSSDSGVLLPAPAQTPSVVLQLVEALWREAERIPRQEI WEFSG >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_7|378_bp atggcgacggtacaggcacggacctgcaccgcaaaacccaaagcaaaagcaacttgccat ggcggaggggagacgcgccctcagcggccgcagaaagcccacaaccagcggaacgcaggc gatggggagaggagcgagcaggcaggttttggtttcgttttttgttccagctcccttgga ggctacgaagaaaagggcggtccttccacccgatccggctgttctgcgacctcgtggcct ctgagtgggagctcggactcaggagtgctgttgccagcgcctgcccagacgccctccgta gttttgcaacttgtggaagcactctggagagaggccgagaggattcctcgacaggaaatt tgggaattctctgggtga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_8|307_aa MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEIQKLETELQEATKEFQI KEDIPETKMKFLSVETPENDSQLSNISCSFQVSSKVPYEIQKGQALITFEKEEVAQNVVS MSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKINVTEIPDTLREDQMRDKLELS FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTE IHLKKYQIFSGTSKRTVLLTGMEGIQMDEEIVEDLINIHFQRAKNGGGEVDVVKCSLGQP HIAYFEE >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_8|924_bp atggaagctgataaagatgacacacaacaaattcttaaggagcattcgccagatgaattt ataaaagatgaacaaaataagggactaattgatgaaattacaaagaaaaatattcaacta aagaaggagatccaaaagcttgaaacggagttacaagaggctaccaaagaattccagatt aaagaggatattcctgaaacaaagatgaaattcttatcagttgaaactcctgagaatgac agccagttgtcaaatatctcctgttcgtttcaagtgagctcgaaagttccttatgagata caaaaaggacaagcacttatcacctttgaaaaagaagaagttgctcaaaatgtggtaagc atgagtaaacatcatgtacagataaaagatgtaaatctggaggttacggccaagccagtt ccattaaattcaggagtcagattccaggtttatgtagaagtttctaaaatgaaaatcaat gttactgaaattcctgacacattgcgtgaagatcaaatgagagacaaactagagctgagc ttttcaaagtcccgaaatggaggcggagaggtggaccgcgtggactatgacagacagtcc gggagtgcagtcatcacgtttgtggagattggagtggctgacaagattttgaaaaagaaa gaataccctctttatataaatcaaacctgccatagagttactgtttctccatacacagaa atacacttgaaaaagtatcagatattttcaggaacatctaagaggacagtgcttctgaca ggaatggaaggcattcaaatggatgaagaaattgtggaggatttaattaacattcacttt caacgggcaaagaatggaggtggagaagtagatgtggtcaagtgttctctaggtcaacct cacatagcatactttgaagaatag >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_9|226_aa MPEPPPRLRGFLWGPSLPDERRPLLHAPSPMDHPRAEECGRTAPDWQAAPPAAPVQDPAG LLSLKVCSFTPEASETTNPPGGTNNSRCAALRAVTLTAKVCSFTPEPVRPRTHQKEETPN TSEHQKEQTQDMPPLRAVTLTARVRSFVLEVTNLQHGLKTKPPEGGSWGKEVRKEEDEKR GEEGRRRKKKKKEEEEEEEEEEEKGGGGGEGGGEGEGEGEGEAEDS >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_9|681_bp atgcctgagcctcccccgcgcctccgtgggttcctgtggggcccgagcctccccgacgag cgccgccccctgctccacgcacccagtcccatggaccacccaagggctgaggagtgcggg cgcacggcaccggactggcaggcagctccacctgcggcccccgtgcaggatccagctggg ctcctgagtctgaaggtctgcagcttcactcctgaagccagtgagaccacgaacccaccg ggaggaacgaacaactccagatgcgccgccttaagagctgtaacgctcaccgcaaaggtc tgcagcttcactcctgagccagtgagaccacgaacccaccagaaggaagaaactccaaac acatccgaacatcagaaggaacaaactcaggacatgccgcctttaagagctgtaacactc actgcgagggtccgcagcttcgttcttgaagtcactaatttacaacatggcctcaaaacc aagccaccggaaggaggtagttggggtaaggaggttagaaaagaggaagatgaaaagaga ggagaggaagggagaagaagaaagaagaagaaaaaagaagaagaggaagaagaggaggaa gaagaagaaaaaggaggaggaggaggggaagggggaggggaaggggaaggggaaggagaa ggggaagcagaagattcctga >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_10|430_aa MPNLAYAAPQYLPLPTFPLTAAPRQAKPEASGTIVDAELLVTLTVEGKSVPFLINMEATH STLPYFQGPVSLASKTVLYSLFVESPTITIVPGLDFNLAFHIILDTTPDPHDCISLIHLT FTPFLLISFFRVRYPDHTWLIDGSSIKPNHHSPAKAGYAIVSSTSIIEATALPPSTTSQQ VKLIALIRVLTLVKGLLINIYADPISCTTMLLYELKVFLTTQGSSIINASLIKTLLKAAL LPKTARIIHCKGHQKASDPITQDNAYAHKESLGRAARGTGKRPQGKGNLQLNCVTIPTEC KISWPELRGGSDSGVQTPKAGRHESPACFCSWETGSLQQVLSPARPLLETDLVLLGVGGK TIQMRRNQKNNSGNMTKQGSLTAPKNDTSSPAMNPNQEEIYDLPKKEFRKSVIKLIKEAP EKDEVQVKGI >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_10|1293_bp atgcccaacctggcttacgcagccccacagtacctgccactgcctacctttcccctcaca gctgctcctcgccaggccaagccagaagcctccgggaccatcgtggatgccgagcttctg gtaactcttacagtggagggtaagtccgtccccttcttaatcaacatggaggctactcac tccacattaccttattttcaagggcctgtttccctggcctccaaaactgttctatactca ctctttgttgagtctcccacaattaccattgttcctggcctggacttcaatctggccttc cacattattctggataccacacctgacccccatgactgtatctctctgatccacctgaca ttcactccatttctccttatttccttctttcgtgttcgttaccctgatcacacttggctt attgatggcagttccatcaagcctaatcaccactcaccagcaaaggcaggctatgctata gtatcttccacatctatcattgaggctactgctctgcccccctccactacctctcagcaa gtcaaactcattgccttaattcgggtcctcactcttgtaaagggactactcatcaatatt tatgctgaccccatatcctgcaccaccatgctgctttatgagctgaaagttttcctcact acacaagggtcctccatcattaatgcctctttaataaaaactcttctcaaggctgcttta cttccaaagacagctagaattattcactgcaaaggccatcaaaaggcatcagatcccatc actcaggacaatgcttatgctcataaggagtccttggggagggctgccagaggaactggg aaaagaccacagggaaaaggaaacctccagctgaactgtgtaacaattccaacagaatgc aaaatctcctggccagaactccggggagggagtgattctggtgtgcagactccaaaagca ggcagacacgaaagccctgcttgcttttgcagctgggagactggtagcctgcagcaagtt ctcagccctgctcgcccactgctggaaacagacttggtgctgttgggagtcgggggaaag actatccaaatgagaaggaaccagaaaaacaattctggtaatatgacaaagcaaggctct ttaacagccccaaaaaatgacactagctcaccagcaatgaatccaaaccaagaagaaatc tatgatttgcctaaaaaagaattcagaaaatcagtcattaaactaatcaaggaggcacca gagaaagatgaagtccaagttaagggaatttaa >gi568815596r:151150909_151355744|GENSCAN_predicted_peptide_11|147_aa MTTELSSYGYLGSENSALFNRVCTSYCEEGVESAALLGCDNSSSTGNTSFSSLLRLESSH LTAQNGKGQGNRVCDCAQEKENLCGNLPTISATILSLKVETTGHGSGLDEIEVKDLGDVL DFSPSLHPTCCTERRDDEWKKMQRSCP >gi568815596r:151150909_151355744|GENSCAN_predicted_CDS_11|444_bp atgacaactgaattaagttcatatgggtatttgggatccgagaattctgctttgttcaat agagtctgcaccagttactgtgaagaaggagtagagtctgctgccctcttgggatgtgac aatagctcatctactggaaataccagtttctcttcccttctgaggctagaatctagccac ttgactgcacaaaatggcaaagggcagggaaatagagtctgtgactgtgcccaggaaaaa gagaacctttgtggtaatcttccaacaatctctgctaccattttgtctttgaaggtagag accacaggacatggaagtggattggatgagattgaggtgaaagatctaggagatgtgctt gatttctctccttcccttcatcctacctgttgtactgaaagaagggatgatgagtggaag aagatgcaaagatcttgtccatag