GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:21:24 Sequence gi568815595f:148641036_148842112 : 201077 bp : 37.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9867 10107 241 2 1 46 80 182 0.918 11.28 1.02 Intr + 29798 30000 203 0 2 70 76 62 0.070 1.28 1.03 Intr + 40384 40459 76 0 1 50 69 101 0.160 2.57 1.04 Term + 42034 42281 248 2 2 64 48 102 0.151 -1.23 1.05 PlyA + 43091 43096 6 1.05 2.04 PlyA - 43247 43242 6 1.05 2.03 Term - 51557 51055 503 0 2 38 37 321 0.218 15.66 2.02 Intr - 56713 56613 101 0 2 29 44 101 0.125 -1.37 2.01 Init - 57118 56838 281 1 2 78 45 195 0.222 8.73 2.00 Prom - 64913 64874 40 -7.35 3.00 Prom + 66749 66788 40 -5.55 3.01 Init + 72672 72729 58 2 1 43 74 83 0.656 3.92 3.02 Intr + 96498 96563 66 1 0 74 78 56 0.457 1.16 3.03 Intr + 99954 101032 1079 1 2 92 75 380 0.570 26.05 3.04 Intr + 105949 106057 109 0 1 85 79 101 0.576 7.84 3.05 Term + 107402 107553 152 0 2 32 42 191 0.577 5.99 3.06 PlyA + 108844 108849 6 1.05 4.05 PlyA - 108965 108960 6 1.05 4.04 Term - 110923 110745 179 2 2 60 49 119 0.153 2.07 4.03 Intr - 115029 114969 61 0 1 93 76 34 0.727 0.09 4.02 Intr - 116372 116260 113 0 2 54 72 137 0.979 7.88 4.01 Init - 119084 119030 55 2 1 59 97 16 0.977 1.10 4.00 Prom - 123336 123297 40 -7.95 5.04 PlyA - 124920 124915 6 1.05 5.03 Term - 125581 125250 332 2 2 89 49 256 0.958 15.63 5.02 Intr - 142422 141769 654 1 0 56 80 260 0.310 12.91 5.01 Init - 144432 144222 211 0 1 81 0 220 0.231 11.49 5.00 Prom - 153083 153044 40 -4.65 6.00 Prom + 161327 161366 40 -3.65 6.01 Init + 168059 168131 73 1 1 61 97 18 0.247 1.28 6.02 Term + 169813 170048 236 2 2 6 48 235 0.722 6.80 6.03 PlyA + 173826 173831 6 1.05 7.00 Prom + 182543 182582 40 -5.35 7.01 Init + 186789 186859 71 2 2 64 110 79 0.998 8.35 7.02 Intr + 186967 187042 76 1 1 58 111 87 0.933 6.60 7.03 Intr + 193463 193587 125 1 2 55 108 78 0.905 4.96 7.04 Intr + 198540 198726 187 0 1 45 31 100 0.503 -1.23 7.05 Intr + 199481 199750 270 1 0 42 72 207 0.770 11.32 7.06 Intr + 200788 200889 102 0 0 83 78 32 0.313 1.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_1|255_aa MWVTHLTSYGERVGQKKKNVWALKSDAVSANALTQASAQWAKAEADLLLQVQRAQKLGSY KEVNGNISQLDDKHIHSAPPGFGIKFSGKLLSSYLMLELKVYRARDQEGTKSIKCGRERI SCNSHGQTGTHKDRLELNQALTTSDFSDGDSSPAEETNIKQRIIAFSIYPYGKERHAYHL FMREEPSAWIRNHAAQSAEAGVCLGQHGHSQGGQVPKKCWPSGQSAIVTSIVGKDSQHSC VTNLFDSKEGSSPQN >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_1|768_bp atgtgggttacacatttgacttcctatggggaaagggttggtcagaagaagaaaaatgtt tgggctctgaagtctgatgcggtttcagcaaatgctctcacccaggcctctgcccagtgg gctaaagctgaggctgacttactactgcaagtccaacgtgcccagaaacttggatcatat aaggaagttaatgggaacatcagtcaactggatgataaacatatacattcagcaccacca gggtttggtataaaattttctggaaagctgttatcttcatatctgatgttggagcttaaa gtctacagggcaagagatcaagaagggacaaagagtataaaatgtggaagagaaaggata agctgtaactcacacggtcaaactggaacccacaaggacagactggaactcaatcaggct cttactacttctgacttcagtgatggagattccagtcctgcagaagagaccaatattaaa caaagaatcattgcattttccatctacccgtatggaaaagaaagacatgcctatcactta tttatgagagaggaaccttctgcttggatcaggaatcatgcagctcaatcagcagaggca ggagtgtgccttggccaacacgggcattctcaaggtggccaggtgccaaagaagtgttgg ccaagtggacagtctgctatcgtgacctctatagtgggtaaagacagccagcatagttgt gtgactaatttgtttgacagtaaagaagggtcctccccgcaaaactga >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_2|294_aa MPGGPRGLTRAPGAASRPAEGRLRWCGSPARRLGVPAVRPHSLGPLQPLPIDHRRGPAEL LAARRSRRPVTRCCLGSWLSGAGIIGEEAAPNLYKDDATVVPCQLSDQRSPGPEPRLARG REGERAGEIQTTIREYYKHLYANKLENLEEMYKFLDTYTLPRLNQEEIDSLNRRITGSEI VAIINSLPTKKSPGPDGFTAEFNQRYKEELVPFLLKLLQSIEKEGTLRNSFYEASIILIP KPGRDSTKKENFRPISLMSIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQG >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_2|885_bp atgcctggcggtccgcgcggactcacccgcgccccgggcgctgcgtcacgtcccgccgag ggccggctgcgctggtgcgggtctcccgcccgccgcctcggcgtcccggctgtgcgccct cactcgctgggtccgctccagccgctccccatcgatcaccgccgcggcccggcagagctg ctagcggcgcgccggtccagacgtcctgtcactcgctgctgcctggggtcctggctgtca ggcgctggaatcattggcgaggaggcagctccgaatttatacaaggatgatgccacagtc gtcccgtgtcaactatcagatcagcgttcacctggtcccgaaccccgcctagcgaggggt cgtgagggagagcgagctggggaaatacaaactaccatcagagaatactacaaacacctc tatgcaaataaactagaaaatctggaagaaatgtataaattcctcgacacatacaccctc ccaagactaaaccaggaagaaattgactctctgaatagacgaataacaggctctgaaatt gtggcaataatcaatagcttaccaacaaaaaagagtccaggaccagatggattcacagcc gaattcaaccagaggtacaaggaggaactggtaccattccttctgaaactactccaatca atagaaaaagagggaaccctccgtaactcattttatgaggccagcatcatcctgatacca aagccaggcagagactcaaccaaaaaagagaattttagaccaatatccttgatgagcatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctga >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_3|487_aa MSLWSYWPQLAAKKGFEATDLNCVEEENCDFIPLASWTPTHGVFDIVFATNSTQVIKMIL NSSTEDGIKRIQDDCPKAGRHNYIFVMIPTLYSIIFVVGIFGNSLVVIVIYFYMKLKTVA SVFLLNLALADLCFLLTLPLWAVYTAMEYRWPFGNYLCKIASASVSFNLYASVFLLTCLS IDRYLAIVHPMKSRLRRTMLVAKVTCIIIWLLAGLASLPAIIHRNVFFIENTNITVCAFH YESQNSTLPIGLGLTKNILGFLFPFLIILTSYTLIWKALKKAYEIQKNKPRNDDIFKIIM AIVLFFFFSWIPHQIFTFLDVLIQLGIIRDCRIADIVDTAMPITICIAYFNNCLNPLFYG FLGKKFKRYFLQLLKYIPPKAKSHSNLSTKMSTLSYRPSDNATVNWSRLEVDQSQFFMRI FQTGADEELAPVLAFLRHRKQLVLLNRDKPPRAHHREESIMSADATADLLRISQRHVTQE PTDADSS >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_3|1464_bp atgtccttgtggtcctattggccacaactggcagccaaaaagggctttgaagccacagac ttaaactgtgtagaggaggagaactgtgactttattccccttgcatcctggacacccaca catggtgtatttgatatagtgtttgcaacaaattcgacccaggtgatcaaaatgattctc aactcttctactgaagatggtattaaaagaatccaagatgattgtcccaaagctggaagg cataattacatatttgtcatgattcctactttatacagtatcatctttgtggtgggaata tttggaaacagcttggtggtgatagtcatttacttttatatgaagctgaagactgtggcc agtgtttttcttttgaatttagcactggctgacttatgctttttactgactttgccacta tgggctgtctacacagctatggaataccgctggccctttggcaattacctatgtaagatt gcttcagccagcgtcagtttcaacctgtacgctagtgtgtttctactcacgtgtctcagc attgatcgatacctggctattgttcacccaatgaagtcccgccttcgacgcacaatgctt gtagccaaagtcacctgcatcatcatttggctgctggcaggcttggccagtttgccagct ataatccatcgaaatgtatttttcattgagaacaccaatattacagtttgtgctttccat tatgagtcccaaaattcaaccctcccgatagggctgggcctgaccaaaaatatactgggt ttcctgtttccttttctgatcattcttacaagttatactcttatttggaaggccctaaag aaggcttatgaaattcagaagaacaaaccaagaaatgatgatatttttaagataattatg gcaattgtgcttttctttttcttttcctggattccccaccaaatattcacttttctggat gtattgattcaactaggcatcatacgtgactgtagaattgcagatattgtggacacggcc atgcctatcaccatttgtatagcttattttaacaattgcctgaatcctcttttttatggc tttctggggaaaaaatttaaaagatattttctccagcttctaaaatatattcccccaaaa gccaaatcccactcaaacctttcaacaaaaatgagcacgctttcctaccgcccctcagat aatgccacagttaattggtccaggcttgaagtggaccaatcacagtttttcatgaggatt tttcaaacaggcgctgatgaagagctggccccagtgttggcatttctgagacatagaaag cagctggtgttactcaacagagataagcctcctagagctcaccacagagaggagagcatc atgtctgcagatgccacagcagatctgctgagaatcagccagcgccatgtgacacaagaa cccacagatgcggattcgtcctga >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_4|135_aa MYGGYVCVEEGQQWEVVFVWPTNQQQENLLAACKKYTLLGSVLDLLAESAASQDSQCWYS PTSGKQETEEVQNKGRLLDTLTSPSFIELVSCSWVLRRTFREQPTKNQAEIKNDAYKRSL CDSMSLSTVQREKFP >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_4|408_bp atgtatggaggttatgtgtgtgtggaggaggggcagcagtgggaggtggtttttgtgtgg cccacgaatcagcaacaagagaatctcctggcagcttgtaagaaatacacactcttaggc tctgtcctagacctactagcggaatcagcagcctcacaagattcccagtgttggtactca cctacatcaggtaaacaggagacagaagaggtacaaaacaaggggagattgctggatacc cttacatcaccaagcttcattgaattggtgtcatgctcttgggttttaagaagaacattc cgagaacaaccaactaagaatcaggctgaaataaaaaatgatgcttacaaacgatccctc tgtgactctatgtctctatccacagtccaacgtgagaaatttccttag >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_5|398_aa MAQELRDTCTSFSSQFNQVEERVLVIEDQMNEMKREEKFGEKRVKRNEQSLQEIWNYVKR PNLRLIGVTEKILITIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLDRPITG SEIEAIINSLPTKKSPGPDRFTAEFYQRYNEELVAFLLKLFQSTEKEGILPNSFYEASII LIPKPGRDTTIKENFRLISLMNMDTKILNKILANRIQQHIKKLIHHDQVGFIPGVQDWFN ICKSINIIHHINRTKDKNHMIISIDAEKAFDKMQQPFMLKTLNKLGTDALITYIRITSVE KSLNDLKELKTMARELPDTCISFRSQFDQLEERVSVIEDQMNEMKREEKFREKRVKRNKQ SLQEIWSYVKRPNLRLIGVPESDGENGTKLENTLQDII >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_5|1197_bp atggcacaagaactacgtgacacatgcacaagcttcagtagccaattcaatcaagtggaa gaaagggtattagtgattgaagatcaaatgaatgaaatgaagcgagaagagaaatttgga gaaaaaagagtaaaaagaaacgaacaaagcctccaagaaatatggaactatgtgaaaaga ccaaatctacgtctgattggcgtaactgaaaaaatactaattaccatcagagaatactat aaacacctctatgcaaataaactagaaaatctggaagaaatggataaattcctcgacaca tacaccctcccaagactaaaccaggaagaagttgaatcgctggatagaccaataacaggt tctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagataga ttcacagctgaattctaccagaggtacaacgaggagctggtagcattccttctgaaacta ttccaatcaacagaaaaagagggaatcctccctaactcattttatgaggccagcatcatc ctgataccaaagcctggcagagacacaacaataaaagagaattttagactaatatccctg atgaacatggacacaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacgatcaagttggcttcatccctggggtgcaagactggttcaat atatgcaaatcaataaacataatccatcatataaacagaaccaaagacaaaaatcacatg attatctcaatagatgcagaaaaggcctttgacaaaatgcaacagcccttcatgctaaaa actctcaataaactaggtactgatgcacttatcacttatattagaataaccagtgtagag aagtccttaaatgacctgaaggagctgaaaaccatggcacgagaactacctgacacatgc ataagcttcaggagccaatttgatcaactggaagaaagggtatcagtgattgaagatcaa atgaatgaaatgaagcgagaagagaagtttagagaaaaaagagtaaaaagaaacaaacaa agcctccaagaaatatggagctatgtgaaaagaccaaatctacgtctgattggtgtacct gaaagtgatggggagaatggaaccaagttggaaaacactctgcaggatattatctag >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_6|102_aa MTKEKDWDSLGTATWERQPTKKWLRWGIAEKIPQNVKATLELGNRQRGWNSLEGSEEDRK MWKSLEPPRDLLNGFDKHADSDMNNKVQAEVVSDGDEELVRN >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_6|309_bp atgacaaaggaaaaggactgggactctctagggacagcaacttgggaaagacaaccaaca aaaaaatggttaaggtggggcattgccgaaaagataccccaaaatgtgaaagcgactttg gaactgggtaacaggcagagaggttggaacagtttggagggctcagaagaagataggaaa atgtggaagagtttggaacctcctagagacttgttgaatggctttgacaaacatgctgat agtgatatgaacaataaggttcaggctgaggtggtctcagatggagatgaggaacttgtt aggaactga >gi568815595f:148641036_148842112|GENSCAN_predicted_peptide_7|277_aa MLALLVLVTVALASAHHGGEHFEGEKVFRVNVEDENHINIIRELASTTQIDFWKPDSVTQ IKPHSTVDFRVKAEDTVTVENVLKQNELQYKKQIPVSINLLHFPKTDTSSYPSSPIEGVT RRKVQRMSLTYFIRMSLGQPEEKQTFDVCLKFSNLSGAAEGHQEEPEDNMRIYGECNVPS TKAVALGFYVCSLEKNTLMFIGTPTLVRCRVLISNLRNVVEAQFDSRVRATGHSYEKYNK WETVGKAGQNKPAIFMDCGFHAREWISPAFCQWFVRE >gi568815595f:148641036_148842112|GENSCAN_predicted_CDS_7|831_bp atgttggcactcttggttctggtgactgtggccctggcatctgctcatcatggtggtgag cactttgaaggcgagaaggtgttccgtgttaacgttgaagatgaaaatcacattaacata atccgcgagttggccagcacgacccagattgacttctggaagccagattctgtcacacaa atcaaacctcacagtacagttgacttccgtgttaaagcagaagatactgtcactgtggag aatgttctaaagcagaatgaactacaatacaagaaacagattccagttagtattaattta ttacactttccaaaaacagacacaagttcgtacccaagttcaccaattgaaggagtgacc aggaggaaggttcagaggatgtccctgacatacttcatcagaatgtccttggggcagcca gaggagaagcagacttttgatgtgtgtctgaaattctctaatctctctggagcagctgaa ggtcatcaggaagaaccagaggacaacatgagaatttatggagagtgcaatgtgccatct actaaagcagtggctctggggttttatgtgtgttcactggagaaaaatacacttatgttc ataggaacacccactttggttcgttgcagggtactgataagcaacctgagaaatgtggtg gaggctcagtttgatagccgggttcgtgcaacaggacacagttatgagaagtacaacaag tgggaaacggttggcaaagctggacaaaataagcctgccattttcatggactgtggtttc catgccagagagtggatttctcctgcattctgccagtggtttgtaagagag