GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:35:50 Sequence gi568815596r:9390180_9655605 : 265426 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3303 3468 166 1 1 149 86 68 0.802 11.72 1.02 Intr + 9122 9252 131 2 2 66 38 85 0.466 1.64 1.03 Intr + 10563 10651 89 1 2 95 92 19 0.984 2.69 1.04 Intr + 11095 11217 123 0 0 90 92 239 0.999 25.28 1.05 Term + 13074 13148 75 2 0 110 38 78 0.991 2.84 1.06 PlyA + 13473 13478 6 1.05 2.08 PlyA - 14540 14535 6 1.05 2.07 Term - 16726 16655 72 1 0 60 41 34 0.362 -6.19 2.06 Intr - 17419 17270 150 1 0 85 85 72 0.711 6.96 2.05 Intr - 22226 22090 137 0 2 113 98 96 0.999 13.39 2.04 Intr - 24077 23999 79 2 1 89 88 31 0.570 2.32 2.03 Intr - 29921 29811 111 2 0 100 66 51 0.474 4.68 2.02 Intr - 33642 33569 74 1 2 98 60 49 0.115 2.23 2.01 Init - 34505 34499 7 2 1 76 64 0 0.102 -2.53 2.00 Prom - 40673 40634 40 -5.66 3.00 Prom + 41176 41215 40 -7.26 3.01 Init + 42351 42509 159 2 0 85 75 111 0.769 9.32 3.02 Intr + 43692 43781 90 2 0 87 87 43 0.896 4.29 3.03 Intr + 46032 46182 151 2 1 48 57 30 0.711 -4.36 3.04 Intr + 50312 50487 176 0 2 54 113 153 0.740 14.06 3.05 Intr + 51639 51797 159 2 0 101 86 41 0.976 5.38 3.06 Intr + 53336 53482 147 1 0 88 37 50 0.537 0.33 3.07 Intr + 58019 58171 153 1 0 41 93 148 0.952 10.87 3.08 Intr + 62734 62842 109 0 1 99 100 1 0.996 2.26 3.09 Intr + 65480 65578 99 0 0 92 89 78 0.988 8.38 3.10 Intr + 66754 66848 95 2 2 76 63 39 0.810 -0.12 3.11 Intr + 69352 69439 88 0 1 94 116 29 0.855 5.84 3.12 Intr + 77528 77597 70 0 1 129 100 5 0.967 4.04 3.13 Intr + 81164 81260 97 2 1 104 79 84 0.998 9.11 3.14 Term + 82737 82838 102 2 0 87 53 178 0.999 12.48 3.15 PlyA + 82895 82900 6 1.05 4.00 Prom + 83828 83867 40 -11.92 4.01 Init + 84388 84468 81 0 0 73 96 153 0.911 13.59 4.02 Intr + 86660 86691 32 1 2 83 92 13 0.720 -1.87 4.03 Intr + 88043 88191 149 2 2 76 93 55 0.974 4.68 4.04 Intr + 91107 91268 162 1 0 16 92 161 0.943 9.25 4.05 Intr + 94253 94371 119 0 2 105 96 137 0.997 16.38 4.06 Term + 95964 96062 99 2 0 69 47 90 0.915 1.13 4.07 PlyA + 98259 98264 6 1.05 5.17 PlyA - 98328 98323 6 1.05 5.16 Term - 100339 99998 342 1 0 84 28 350 0.877 23.11 5.15 Intr - 100972 100922 51 1 0 116 84 8 0.750 2.30 5.14 Intr - 102807 102719 89 1 2 56 103 10 0.864 -1.01 5.13 Intr - 103646 103568 79 2 1 106 74 74 0.947 6.92 5.12 Intr - 104588 104458 131 0 2 89 82 73 0.996 7.21 5.11 Intr - 107069 106935 135 0 0 104 89 127 0.960 14.94 5.10 Intr - 112097 111994 104 1 2 100 79 64 0.920 6.52 5.09 Intr - 115186 114987 200 1 2 72 94 225 0.974 19.65 5.08 Intr - 119952 119800 153 0 0 27 95 105 0.794 5.37 5.07 Intr - 127810 127722 89 2 2 69 95 20 0.572 0.49 5.06 Intr - 128068 127924 145 1 1 55 110 20 0.692 0.76 5.05 Intr - 131137 131024 114 1 0 1 92 122 0.866 4.54 5.04 Intr - 133159 133070 90 1 0 51 68 105 0.936 4.99 5.03 Intr - 136085 135932 154 1 1 42 83 115 0.486 6.47 5.02 Intr - 153106 152974 133 2 1 70 86 13 0.080 -1.00 5.01 Init - 165426 165330 97 0 1 94 28 135 0.277 6.61 5.00 Prom - 174144 174105 40 -4.96 6.10 PlyA - 175640 175635 6 1.05 6.09 Term - 195166 195107 60 1 0 92 35 56 0.600 -1.50 6.08 Intr - 197330 197235 96 2 0 32 79 135 0.989 7.21 6.07 Intr - 198149 197986 164 0 2 91 107 47 0.996 6.59 6.06 Intr - 201336 201213 124 0 1 82 111 104 0.814 12.26 6.05 Intr - 230582 230463 120 2 0 106 84 46 0.857 6.69 6.04 Intr - 240355 239980 376 0 1 100 105 724 0.281 70.32 6.03 Intr - 243465 243449 17 0 2 119 93 4 0.024 -1.66 6.02 Intr - 249056 248955 102 2 0 91 53 52 0.027 2.37 6.01 Intr - 259209 259004 206 1 2 100 64 59 0.045 3.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_1|194_aa XPLTPTPPPPVAKTPSVMEALSQPSKPAPPGISQIRPPPLPPQPPSRLPQKKPAPGSRRS TGELKPLALVLSSGVDGSCVTPFADENADSPEPIYPGLPVDLSATEALGPLSNAMVLQPP APMPRKSQATKLKPKRVKALYNCVADNPDELTFSEGDVIIVDGEEDQEWWIGHIDGDPGR KGAFPVSFVHFIAD >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_1|585_bp natcccctgacccccacgccgcccccacccgttgccaagacgcccagcgtaatggaagcc ttgagccagccgagcaagcctgccccgcctgggatctcacagatcaggcccccacctctg cccccacagccgcccagccgcctcccgcagaagaagcctgcgccggggtccaggagatcc accggggagctgaagccactggcactggtccttagctctggagttgacggatcttgtgtc acaccttttgcagatgaaaatgctgactctccagaacccatttatccagggctgccagtg gatctctctgcaacggaagctctgggtcctctgtccaatgctatggtcctgcagccccct gcacccatgcctaggaagtcgcaggcaaccaagttgaagcctaagcgggtgaaagcgctc tataactgtgtggctgacaaccccgatgagctcaccttctccgagggggatgtgatcatc gtggacggggaggaggaccaggagtggtggattggccacattgatggagatcctggtcgc aaaggcgcattcccggtgtcatttgtgcactttatcgctgactga >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_2|209_aa MTGVGSAAGRSPQQESQTSRKCEGEAGVRSFSEKHHPGQNERKLSVQTAVCGTFEDSQIV FNSISVDSSLGGLSRSSTVASLDTDSTKSSGQSNNNSDTCAEFRIKYVGAIEKLKLSEGK GLEGPLDLINYIDVAQDVLHRHALYLIIRMVCYDDGLGAGKSLLALKTTDASNEEYSLWV YQCNSLEQAQAICKVLSTAFDSVLTSEKP >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_2|630_bp atgacaggggtcggatcagcagctggtcgctctcctcagcaggaatcgcagacatcccga aagtgtgagggcgaagcgggggtgcgtagtttctctgaaaagcatcacccaggtcagaat gaaaggaaactctctgtgcagaccgctgtatgtgggacctttgaagacagtcaaatagtg ttcaattctatatctgtggattctagccttgggggtctttcacgatccagcactgtggcc agcctcgacacagattccaccaaaagctcaggacaaagcaacaataattcagatacctgt gcagaatttcgaataaaatatgttggtgccattgagaaactgaaactctccgagggaaaa ggccttgaagggccattagacctgataaattatatagacgttgcccaggatgttttgcac aggcatgctctctacttaataatccggatggtgtgttacgatgacggtctgggggcggga aaaagcttactggctctgaagaccacagatgcaagcaatgaggaatacagcctgtgggtt tatcagtgcaacagcctggaacaagcacaagccatttgcaaggttttatccaccgctttt gactctgtattaacatctgagaaaccctga >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_3|564_aa MLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDF SRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPV FALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ NPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVE GTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGE QNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQR VSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEEL EIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYADTVTTVILEVQSNPKIRKGAVQK VSKKLEMHVYSKRLEIMLQDIFGEDCVSVKDDSILSVTVDGKTANLNLETRTVECEEGSE DDESLREMVELAAQRLYEALTPVH >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_3|1695_bp atgctgtataccgagacagatttggaagaaagcatggacaaaattgaaactatcaacttt catgaagttaaggaagttgcgggaatcaagttttggtgttaccatgcaggtcacgtccta ggagccgccatgttcatgattgagatcgcaggcgtgaagcttttgtacactggtgatttc tcaagacaagaagataggcacttaatggcagctgaaattcctaatattaagcctgatatt cttatcattgaatctacttatgggacccatatccatgagaaacgtgaagagcgagaagca agattctgtaacactgtccacgatattgtaaacagaggaggcaggggtctcattcctgtc tttgctcttggaagggctcaggagctgctcttgattctagatgagtactggcagaatcac ccagaactacatgacattccaatatactatgcatcatctttggccaagaagtgtatggca gtgtaccagacatatgtaaatgccatgaatgacaaaatccgcaaacagatcaacatcaat aatccctttgttttcaaacacattagtaacctcaagagcatggatcattttgatgacatt ggtcccagtgttgtaatggcctccccaggcatgatgcaaagtggcttatccagagaatta tttgaaagctggtgtactgataagaggaatggtgtcattatagcgggatactgtgtagaa gggacacttgccaagcacatcatgtctgaacctgaagaaatcactactatgtctggacag aagttaccactgaaaatgtctgttgattacatttctttctcagctcacacggattaccag caaaccagtgaatttattcgtgctttgaaaccgcctcatgtgattttagtccatggagaa cagaatgaaatggccagattgaaagcagcactgattcgagaatatgaagataacgatgaa gttcacatagaggttcataatcctcggaatacagaagcagtgaccttaaacttcagagga gaaaaactagccaaggttatgggatttttagcagacaaaaaaccagaacaaggccagcgg gtctcaggaatacttgttaaaagaaactttaattatcacatactttctccttgcgacctg tccaattatactgacctggccatgagcacggtgaagcagacccaagccattccatatact ggtccctttaatttgctctgttaccagctgcagaaattgacaggtgatgtggaagaatta gaaattcaagaaaaacctgctctgaaagtgttcaaaaatattactgtaatacaagaacca ggcatggtggtattagaatggctggcaaacccttctaatgatatgtatgcagatacagta acaactgtgatattggaagttcagtcaaatcccaaaataagaaaaggtgcagtacagaag gtttctaaaaaattagaaatgcacgtttacagcaagaggttggagatcatgctccaggac atatttggagaagactgtgtaagtgtaaaggatgactctattcttagcgtcacagtggac gggaaaactgccaaccttaacttggagacacggactgtagaatgtgaagagggaagtgaa gacgatgaatccctccgagaaatggtggagctggctgcacagagactgtacgaggccctg acgccagttcactga >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_4|213_aa MALCEAAGCGSALLWPRLLLFGDSITQLLQATWMYTCRKCDVLNRGFSGYNTRWAKIILP RLIRKGNSLDIPVAVTIFFGANDSALKDENPKQHIPLEEYAANLKSMVQYLKSVDIPENR VILITPTPLCETAWEEQCIIQGCKLNRLNSVVGEYANACLQVAQDCGTDVLDLWTLMQDS QGKWSPVGKSYYHVDAESPVTFGGPESLSRSGP >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_4|642_bp atggcgctgtgcgaggccgcgggctgcgggagtgccctgctctggcctcgcttgttgctc ttcggggactccatcacccagttgctccaggccacctggatgtatacgtgcagaaaatgt gatgttctgaatcgtggattttcaggttacaataccaggtgggccaaaattatccttcca agattaatcaggaaaggaaacagtttggacatcccagtagcagttacaattttctttggg gccaatgacagtgcactaaaagatgagaatcccaagcagcacattcccctggaggagtac gctgcgaacctaaagagcatggtgcagtacctgaagtccgtggacatccctgagaatcga gtcattctcatcacgccgaccccactttgtgaaacagcctgggaagaacagtgcatcata caaggttgcaaactaaatcgcctgaactctgttgttggtgaatatgccaatgcgtgttta caagtggcccaagactgtgggactgacgtacttgacctgtggaccctgatgcaggacagc cagggaaaatggtcaccagtgggaaagtcatactaccacgtggatgccgagtcaccagtc acctttggaggccctgagagcctaagcagatctggaccctag >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_5|701_aa MRQSLLFLTSVVPFVLAPRPPDDPGFGPHQRLEKLDSLLSDYDILSLSNIQQHSVRKRDL QTSTHVETLLTFSALKSAYKFILELVHRVKRRADPDPMKNTCKLLVVADHRFYRYMGRGE ESTTTNYLIELIDRVDDIYRNTSWDNAGFKGYGIQIEQIRILKSPQEVKPGEKHYNMAKS YPNEEKDAWDVKMLLEQFSFDIAEEASKVCLAHLFTYQDFDMGTLGLAYVGSPRANSHGG VCPKAYYSPVGKKNIYLNSGLTSTKNYGKTILTKEADLVTTHELGHNFGAEHDPDGLAEC APNEDQGGKYVMYPIAVSGDHENNKMFSNCSKQSIYKTIESKAQECFQERSNKVCGNSRV DEGEECDPGIMYLNNDTCCNSDCTLKEGVQCSDRNSPCCKNCQFETAQKKCQEAINATCK GVSYCTGNSSECPPPGNAEDDTVCLDLGKCKDGKCIPFCEREQQLESCACNETDNSCKVC CRDLSGRCVPYVDAEQKNLFLRKGKPCTVGFCDMNGKCEKRVQDVIERFWDFIDQLSINT FGKFLADNIVGSVLVFSLIFWIPFSILVHCVDKKLDKQYESLSLFHPSNVEMLSSMDSAS VRIIKPFPAPQTPGRLQPAPVIPSAPAAPKLDHQRMDTIQEDPSTDSHMDEDGFEKDPFP NSSTAAKSFEDLTDHPVTRSEKAASFKLQRQNRVDSKETEC >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_5|2106_bp atgaggcagtctctcctattcctgaccagcgtggttcctttcgtgctggcgccgcgacct ccggatgacccgggcttcggcccccaccagagactcgagaagcttgattctttgctctca gactacgatattctctctttatctaatatccagcagcattcggtaagaaaaagagatcta cagacttcaacacatgtagaaacactactaactttttcagctttgaaaagtgcatacaaa ttcatattagagcttgttcatcgagtgaaaagaagagctgacccagatcccatgaagaac acgtgtaaattattggtggtagcagatcatcgcttctacagatacatgggcagaggggaa gagagtacaactacaaattacttaatagagctaattgacagagttgatgacatctatcgg aacacttcatgggataatgcaggttttaaaggctatggaatacagatagagcagattcgc attctcaagtctccacaagaggtaaaacctggtgaaaagcactacaacatggcaaaaagt tacccaaatgaagaaaaggatgcttgggatgtgaagatgttgctagagcaatttagcttt gatatagctgaggaagcatctaaagtttgcttggcacaccttttcacataccaagatttt gatatgggaactcttggattagcttatgttggctctcccagagcaaacagccatggaggt gtttgtccaaaggcttattatagcccagttgggaagaaaaatatctatttgaatagtggt ttgacgagcacaaagaattatggtaaaaccatccttacaaaggaagctgacctggttaca actcatgaattgggacataattttggagcagaacatgatccggatggtctagcagaatgt gccccgaatgaggaccagggagggaaatatgtcatgtatcccatagctgtgagtggcgat cacgagaacaataagatgttttcaaactgcagtaaacaatcaatctataagaccattgaa agtaaggcccaggagtgttttcaagaacgcagcaataaagtttgtgggaactcgagggtg gatgaaggagaagagtgtgatcctggcatcatgtatctgaacaacgacacctgctgcaac agcgactgcacgttgaaggaaggtgtccagtgcagtgacaggaacagtccttgctgtaaa aactgtcagtttgagactgcccagaagaagtgccaggaggcgattaatgctacttgcaaa ggcgtgtcctactgcacaggtaatagcagtgagtgcccgcctccaggaaatgctgaagat gacactgtttgcttggatcttggcaagtgtaaggatgggaaatgcatccctttctgcgag agggaacagcagctggagtcctgtgcatgtaatgaaactgacaactcctgcaaggtgtgc tgcagggacctttctggccgctgtgtgccctatgtcgatgctgaacaaaagaacttattt ttgaggaaaggaaagccctgtacagtaggattttgtgacatgaatggcaaatgtgagaaa cgagtacaggatgtaattgaacgattttgggatttcattgaccagctgagcatcaatact tttggaaagtttttagcagacaacatcgttgggtctgtcctggttttctccttgatattt tggattcctttcagcattcttgtccattgtgtggataagaaattggataaacagtatgaa tctctgtctctgtttcaccccagtaacgtcgaaatgctgagcagcatggattctgcatcg gttcgcattatcaaaccctttcctgcgccccagactccaggccgcctgcagcctgcccct gtgatcccttcggcgccagcagctccaaaactggaccaccagagaatggacaccatccag gaagaccccagcacagactcacatatggacgaggatgggtttgagaaggaccccttccca aatagcagcacagctgccaagtcatttgaggatctcacggaccatccggtcaccagaagt gaaaaggctgcctcctttaaactgcagcgtcagaatcgtgttgacagcaaagaaacagag tgctaa >gi568815596r:9390180_9655605|GENSCAN_predicted_peptide_6|421_aa XSSSLLLSPGEGEEGQSSPEPHTPSQVIANIYHVFAKCQDNSKHLKGISSFYICRNPRRQ VQFPDEDVQEVCVTVVAAGFRLNTFQQFVCCRILQSEAGHISLELLHTSSRGSSRCGSAL ALALLALRPGPGPAPAMEKTELIQKAKLAEQAERYDDMATCMKAVTEQGAELSNEERNLL SVAYKNVVGGRRSAWRVISSIEQKTDTSDKKLQLIKDYREKVESELRSICTTVLEYRFED QENHRCDNEERRIVDIRHWVIKVPVNSWKPAKDLELLDKYLIANATNPESKVFYLKMKGD YFRYLAEVACGDDRKQTIDNSQGAYQEAFDISKKEMQPTHPIRLGLALNFSVFYYEILNN PELACTLAKTAFDEAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDSAGEECDAAEGAE N >gi568815596r:9390180_9655605|GENSCAN_predicted_CDS_6|1266_bp naaagctcttcactgctgctcagccctggagagggggaggaggggcaaagctcccccgaa ccccacaccccctcgcaagtgattgctaatatttatcatgtgtttgccaagtgccaggac aatagcaaacacctcaaaggcattagctcattttatatttgcaggaaccctaggaggcaa gtccagtttccagatgaggacgttcaggaagtttgtgtgacggttgtagcagctggattc cgtctgaacacattccagcaatttgtctgctgcaggatccttcagagtgaggctggccac atcagtctggagcttcttcataccagctctcgaggctcctcccgctgcgggtcggcgctc gccctcgctctcctcgccctccgccccggccccggccccgcgcccgccatggagaagact gagctgatccagaaggccaagctggccgagcaggccgagcgctacgacgacatggccacc tgcatgaaggcagtgaccgagcagggcgccgagctgtccaacgaggagcgcaacctgctc tccgtggcctacaagaacgtggtcgggggccgcaggtccgcctggagggtcatctctagc atcgagcagaagaccgacacctccgacaagaagttgcagctgattaaggactatcgggag aaagtggagtccgagctgagatccatctgcaccacggtgctggaatataggtttgaagac caggaaaatcacagatgtgataatgaagaaaggagaatagttgatattagacattgggta ataaaagtccctgttaattcctggaaaccagctaaagatctggaattgttggataaatat ttaatagccaatgcaactaatccagagagtaaggtcttctatctgaaaatgaagggtgat tacttccggtaccttgctgaagttgcgtgtggtgatgatcgaaaacaaacgatagataat tcccaaggagcttaccaagaggcatttgatataagcaagaaagagatgcaacccacacac ccaatccgcctggggcttgctcttaacttttctgtattttactatgagattcttaataac ccagagcttgcctgcacgctggctaaaacggcttttgatgaggccattgctgaacttgat acactgaatgaagactcatacaaagacagcaccctcatcatgcagttgcttagagacaac ctaacactttggacatcagacagtgcaggagaagaatgtgatgcggcagaaggggctgaa aactaa