GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:51:20 Sequence gi568815586r:120362022_120569609 : 207588 bp : 46.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 1114 1022 93 1 0 112 98 150 0.960 18.56 1.05 Intr - 2734 2693 42 1 0 97 100 14 0.737 1.94 1.04 Intr - 6071 5987 85 1 1 53 79 84 0.608 3.82 1.03 Intr - 6252 6171 82 1 1 35 99 134 0.986 7.90 1.02 Intr - 6466 6396 71 0 2 66 40 64 0.626 -1.77 1.01 Init - 6960 6812 149 0 2 121 84 97 0.680 10.17 1.00 Prom - 21238 21199 40 -4.76 2.04 PlyA - 22349 22344 6 1.05 2.03 Term - 27003 26925 79 2 1 77 38 106 0.823 1.74 2.02 Intr - 31337 31276 62 2 2 77 107 32 0.718 1.53 2.01 Init - 33777 33724 54 0 0 92 90 59 0.814 5.78 2.00 Prom - 50144 50105 40 -1.96 3.00 Prom + 50277 50316 40 -3.86 3.01 Init + 76106 76208 103 1 1 85 84 74 0.818 5.65 3.02 Intr + 76358 76500 143 0 2 85 107 240 0.941 25.67 3.03 Term + 78759 78842 84 2 0 92 44 51 0.665 -1.25 3.04 PlyA + 81379 81384 6 1.05 4.03 PlyA - 81970 81965 6 1.05 4.02 Term - 82934 82851 84 2 0 97 43 101 0.996 4.15 4.01 Init - 84351 84205 147 0 0 97 99 298 0.636 31.89 4.00 Prom - 84399 84360 40 -16.83 5.00 Prom + 84417 84456 40 -16.89 5.01 Init + 84460 84540 81 0 0 53 103 115 0.999 8.49 5.02 Intr + 84636 84808 173 2 2 121 97 359 0.997 38.84 5.03 Intr + 95055 95158 104 0 2 70 98 142 0.974 13.22 5.04 Term + 96606 96613 8 1 2 121 42 0 0.606 -3.27 5.05 PlyA + 97680 97685 6 -0.45 6.05 PlyA - 99376 99371 6 1.05 6.04 Term - 100141 99998 144 1 0 117 53 47 0.825 2.01 6.03 Intr - 102086 101929 158 0 2 51 90 191 0.677 15.33 6.02 Intr - 103766 103675 92 1 2 98 47 103 0.594 6.84 6.01 Init - 107588 107401 188 2 2 34 93 527 0.997 46.33 6.00 Prom - 109773 109734 40 -4.16 7.00 Prom + 123020 123059 40 -4.46 7.01 Init + 134401 134532 132 0 0 76 100 232 0.897 23.34 7.02 Term + 136052 136189 138 1 0 116 42 130 0.993 9.16 7.03 PlyA + 136452 136457 6 1.05 8.08 PlyA - 137380 137375 6 1.05 8.07 Term - 141864 141763 102 0 0 124 31 107 0.999 6.88 8.06 Intr - 142060 141949 112 0 1 89 80 94 0.951 9.08 8.05 Intr - 142962 142874 89 0 2 60 52 24 0.638 -5.23 8.04 Intr - 148102 147996 107 2 2 86 80 177 0.842 16.63 8.03 Intr - 154767 154546 222 1 0 96 105 172 0.998 17.80 8.02 Intr - 160342 160193 150 2 0 48 95 110 0.989 7.83 8.01 Init - 160970 160604 367 2 1 77 50 262 0.583 18.49 8.00 Prom - 161681 161642 40 -9.46 9.00 Prom + 163427 163466 40 -3.06 9.01 Init + 164287 164645 359 0 2 60 33 289 0.175 17.18 9.02 Intr + 172314 172368 55 0 1 60 57 69 0.437 -0.02 9.03 Term + 172637 172936 300 1 0 92 40 340 0.837 24.82 9.04 PlyA + 173575 173580 6 1.05 10.00 Prom + 176028 176067 40 -6.36 10.01 Init + 181924 181975 52 0 1 64 105 22 0.674 0.82 10.02 Intr + 184384 184580 197 2 2 106 64 136 0.981 12.13 10.03 Intr + 190478 190677 200 1 2 109 89 25 0.987 2.85 10.04 Intr + 192697 192787 91 1 1 111 89 90 0.963 11.40 10.05 Intr + 195261 195445 185 2 2 54 78 10 0.516 -4.81 10.06 Intr + 195525 195661 137 0 2 39 105 74 0.578 4.41 10.07 Intr + 198705 198865 161 1 2 70 100 230 0.997 22.11 10.08 Intr + 201326 201602 277 1 1 12 100 199 0.628 10.59 10.09 Intr + 201789 201922 134 1 2 44 78 210 0.999 15.96 10.10 Intr + 203051 203168 118 1 1 76 94 15 0.984 0.94 10.11 Intr + 203407 203508 102 2 0 57 84 152 0.998 11.85 10.12 Intr + 204804 204959 156 1 0 87 97 74 0.475 8.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_1|174_aa MADPGPAGPPRSPGPRPLRPGARRSRGPFVSLLLPQQDVHRGTQLADYAGPARPSASRGP GGRQEAQRERGEGEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQ SRHELDSKTIDPKVAFPRRAQPKMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGK >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_1|522_bp atggccgatccgggtccggccgggcctccccggagcccgggcccgcgccccctgcgccct ggcgcccggcgctcacgcgggccctttgtgtctctcctcctcccgcagcaagatgttcat cgggggactcagttggcagactacgcaggcccagctcggccctcggcttcccggggcccc ggtgggcgccaggaggctcagcgggaacggggcgagggcgaagggctgcgcgaatacttc ggccagttcggggaggtgaaggagtgtctggtgatgcgggaccccctgaccaagagatcc aggggtttcggcttcgtcactttcatggaccaggcgggggtggataaagtgctggcgcaa tcgcggcacgagctcgactccaaaacaattgaccctaaggtggccttccctcggcgagca cagcccaagatggtgactcgaacgaagaagatctttgtgggggggctgtcggtgaacacc acggtggaggacgtgaagcaatattttgagcagtttgggaag >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_2|64_aa MGRARWLTPVIPALLEAEVPIMHPLSACSLCHPVNALVRSRDADWLRAGPWAQRCVSIIV RDCE >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_2|195_bp atgggccgggcacggtggctcacacctgtaatcccagcacttttggaggctgaggtcccc atcatgcacccgctcagtgcttgttctctctgccatcctgtcaatgcccttgtgagatca cgtgatgccgactggctccgagctgggccctgggctcagcgctgtgtgagcatcattgta cgggactgtgaatag >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_3|109_aa MAVVGVSSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLK SHHGEHERPEFIAYPHLRIRTKKLAFELQVSVQVEKKLALVISVSTVKV >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_3|330_bp atggcggtagttggtgtgtcctcggtttctcggctgctgggtcggtcccgcccacagctg gggcggcctatgtcgagtggcgcccatggcgaagagggctcagctcgcatgtggaagact ctcaccttcttcgtcgcgctccccggggtggcagtcagcatgctgaatgtgtacctgaag tcgcaccacggagagcacgagagacccgagttcatcgcctacccccatctccgcatcagg accaagaaattagcatttgagcttcaagtcagtgtccaagttgaaaagaaattggcatta gttatttctgtttccacagtgaaggtctag >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_4|76_aa MNSVGEACTDMKREYDQCFNRWFAEKFLKGDSSGDPCTDLFKRYQQCVQKAIKEKEIPIE GLEFMGHGKEKPENSS >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_4|231_bp atgaacagtgtgggggaggcatgcacggacatgaagcgcgagtacgaccagtgcttcaat cgctggttcgccgagaaatttctcaagggggacagctccggggacccgtgcaccgacctc ttcaagcgctaccagcagtgtgttcagaaagcaataaaggagaaagagattcctattgaa ggactggagttcatgggccatggcaaagaaaagcctgaaaattcttcttga >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_5|121_aa MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLALVDFGSREAVARLEKA IAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADELLQNSHRVVEEYFVAPPG R >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_5|366_bp atgtggtcgcggttggtgtggctgggccttcgggcccctctgggcgggcgccagggcttc acctccaaggcggatcctcagggcagtggccggatcacggctgcggtgatcgagcacctg gagcgtctagcgcttgtggacttcggcagccgcgaggcagtggcgcgactggagaaagct atcgccttcgccgaccggctacgcgccgtggacacagacggggtggagcccatggaatcg gtcctggaggacagatgtctatacctgagatccgacaatgtggtagaaggcaactgtgct gatgaattactacaaaactcccatcgcgtcgtggaggagtactttgtggcccccccaggt aggtga >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_6|193_aa MSGWADERGGEGDGRIYVGNLPTDVREKDLEDLFYKYGRIREIELKNRHGLVPFAFVRFE DPRDAEDAIYGRNGYDYGQCRLRVEFPRTYGGRGSWQDLKDHMREAGDVCYADVQKDGVG MVEYLRKEDMEYALRKLDDTKFRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQS RGSPHYFSPFRPY >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_6|582_bp atgtcgggctgggcggacgagcgcggcggcgagggcgacgggcgcatctacgtggggaac cttccgaccgacgtgcgcgagaaggacttggaggacctgttctacaagtacggccgcatc cgcgagatcgagctcaagaaccggcacggcctcgtgcccttcgccttcgtgcgcttcgag gacccccgagatgcagaggatgctatttatggaagaaatggttatgattatggccagtgt cggcttcgtgtggagttccccaggacttatggaggtcggggcagctggcaggacctgaag gatcacatgcgagaagctggggatgtctgttatgctgatgtgcagaaggatggagtgggg atggtcgagtatctcagaaaagaagacatggaatatgccctgcgtaaactggatgacacc aaattccgctctcatgagggtgaaacttcctacatccgagtttatcctgagagaagcacc agctatggctactcacggtctcggtctgggtcaaggggccgtgactctccataccaaagc aggggttccccacactacttctctcctttcaggccctactga >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_7|89_aa MCDRKAVIKNADMSEEMQQDSVECATQALEKYNIEKDIAAHIKKEFDKKYNPTWHCIVGR NFGSYVTHETKHFIYFYLGQVAILLFKSG >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_7|270_bp atgtgcgaccgaaaggccgtgatcaaaaatgcggacatgtcggaagagatgcaacaggac tcggtggagtgcgctactcaggcgctggagaaatacaacatagagaaggacattgcggct catatcaagaaggaatttgacaagaagtacaatcccacctggcattgcatcgtggggagg aacttcggtagttatgtgacacatgaaaccaaacacttcatctacttctacctgggccaa gtggccattcttctgttcaaatctggttaa >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_8|382_aa MRFAKKHNKKGLKKMQANSARATSARADAIKALVKPKEVKPEIPKGVSRKLDPLAYIAHP QAWEMCSFLHCQGAQAMSVKGQGQRSNQGPGCSSSLGSQRCPDTYEGFRVDISVCQCEDR RTVYQVFESVAKKYDVMNDMMSLGIHRVWKDLLLWKMHPLPGTQLLDVAGGTGDIAFRFL NYVQSQHQRKQKRQLRAQQNLSWEEIAKEYQNEEDSLGGSRVVVCDINKEMLKVGKQKAL AQGYRAGLAWVLGDAEELPFDDDKFDIYTIAFGIRNVTHIDQALQEAHRVLKPGGRFLCL EFSQVNNPLISRLYDLYSFQVIPVLGEVIAGDWKSYQYLVESIRRFPSQEEFKDMIEDAG FHKVTYESLTSGIVAIHSGFKL >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_8|1149_bp atgcgctttgccaagaagcacaacaagaaaggcctaaaaaagatgcaggccaacagtgcc agggccacgagtgcacgtgctgacgctatcaaggcccttgtaaagcccaaggaggttaag cccgagatcccaaagggtgtcagccgcaagctcgatccacttgcctacattgcccacccc caggcttgggaaatgtgctcattcctgcattgccaaggggctcaagctatgtcggtcaaa ggccaaggccaaagatcaaaccaaggcccaggctgcagctccagcttaggctcccaaagg tgcccggacacctacgaaggcttcagagtagatatctctgtctgccaatgtgaggacaga aggactgtctatcaggtgtttgaaagtgtggctaagaagtatgatgtgatgaatgatatg atgagtcttggtatccatcgtgtttggaaggatttgctgctctggaagatgcacccgctt cctgggacccagctgcttgatgttgctggaggcacaggtgacattgcattccggttcctt aattatgttcagtcccagcatcagagaaaacagaagaggcagttaagggcccaacaaaat ttatcctgggaagaaattgccaaagagtaccagaatgaagaagattccttgggcgggtct cgtgtcgtggtgtgtgacatcaacaaggagatgctaaaggttggaaagcagaaagccttg gctcaaggatacagagctggacttgcatgggtattaggagatgctgaagaactgcccttt gatgatgacaagtttgatatttacaccattgcctttgggatccggaatgtcacacacatt gatcaggcactccaggaagctcatcgggtgctgaaaccaggaggacggtttctctgtctg gaatttagccaagtgaacaatcccctcatatccaggctttatgatctatatagcttccag gtcatccctgtcctgggagaggtcatcgctggagactggaagtcctatcagtaccttgta gagagtatccgaaggtttccgtctcaggaagagttcaaggacatgatagaagatgcaggc tttcacaaggtgacttacgaaagtctaacatcaggcattgtggccattcattctggcttc aaactttaa >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_9|237_aa MKMLIMLEVKFKLSVSGVIGEIDDCGNIDTAATGETLDIQPEELQEDELTNMNKKWSCDK KVEDVPEEVMPAKTFTLKEFLEVFHNIESTKDKISEVDKNLERNMAICQGIEKIFHTTRS PSGFFEMLFGDSSPFPEQFEKPRKETGKNVAMKAENRCRRRPPPALNAMSLGPRRARSAP TAVAAEAPVDAAELPQRRRHRLRHGQEQRLQQLLRLFGQQQRATAAPLRLGGASRRV >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_9|714_bp atgaaaatgctgataatgctggaagtgaaattcaaattgagtgtcagtggagttatagga gaaatagatgactgtgggaatattgacactgctgccactggagagactctagacatacag ccagaggaactgcaggaagatgaacttaccaacatgaacaagaaatggagctgtgacaaa aaggttgaagatgtcccagaagaagtgatgccagcaaaaactttcacattaaaggagttc ttggaggtatttcataacattgaaagcacaaaggataaaatatcagaagtagataaaaac ttagaaagaaatatggcaatttgccaaggcatagaaaagatatttcatacgacaagaagt ccatccgggttctttgagatgctgtttggcgactcgtcgccattcccggagcagtttgag aagccaaggaaggaaacagggaaaaatgtcgccatgaaggccgagaaccgctgccgccgc cgacccccgccggccctgaacgccatgagcctgggtccccgccgcgcccgctccgctccg actgccgtcgccgccgaggcccccgttgatgccgctgagctcccccaacgccgccgccac cgcctccgacatggacaagaacagcggctccaacagctcctccgcctcttcgggcagcag caaagggcaacagccgccccgctccgcctcggcggggccagccggcgagtctaa >gi568815586r:120362022_120569609|GENSCAN_predicted_peptide_10|604_aa MLGMVAHACNPSTLGGRDGKNSSGSKRYNRKRELSYPKNESFNNQSRRSSSQKSKTFNKM PPQRGGGSSKLFSSSFNGGRRDEVAEAQRAEFSPAQFSGPKKINLNHLLNFTFEPRGQTG HFEGSGHGSWGKRNKWGHKPFNKELFLQANCQFVVSEDQDYTAHFADPDTLVNWDFVEQV RICSHEVPSCPICLYPPTAAKITRCGHIFCWACILHYLSLSEKTWSKCPICYSSVHKKDL KSVVATESHQYVVGDTITMQLMKREKGVLVALPKSKWMNVDHPIHLGDEQHSQYSKLLLA SKEQVLHRVVLEEKVALEQQLAEEKHTPESCFIEAAIQELKGVLEYLSAFDEETTEVCSL DTPSRPLALPLVEEEEAVSEPEPEGLPEACDDLELADDNLKEGTICTESSQQEPITKSGF TRLSSSPCYYFYQAEDGQHMFLHPVNVRCLVREYGSLERSPEKISATVVEIAGYSMSEDV RQRHRYLSHLPLTCEFSICELALQPPVVSKETLEMFSDDIEKRKRQRQKKAREERRRERR IEIEENKKQGKYPEVHIPLENLQQFPAFNSYTCSSDSALGPTSTEGHGALSISPLSRSPG SHAX >gi568815586r:120362022_120569609|GENSCAN_predicted_CDS_10|1812_bp atgctgggcatggtggctcatgcctgtaatcccagtactttgggaggccgagatggaaag aactccagtggatccaagcgttataatcgcaaacgtgaactttcctaccccaaaaatgaa agttttaacaaccagtcccgtcgctccagttcacagaaaagcaagacttttaacaagatg cctcctcaaaggggcggcggcagcagcaaactctttagctcttcttttaatggtggaaga cgagatgaggtagcagaggctcaacgggcagagtttagccctgcccagttctctggtcct aagaagatcaacctgaaccacttgttgaatttcacttttgaaccccgtggccagacgggt cactttgaaggcagtggacatggtagctggggaaagaggaacaagtggggacataagcct tttaacaaggaactctttttacaggccaactgccaatttgtggtgtctgaagaccaagac tacacagctcattttgctgatcctgatacattagttaactgggactttgtggaacaagtg cgcatttgtagccatgaagtgccatcttgcccaatatgcctctatccacctactgcagcc aagataacccgttgtggacacatcttctgctgggcatgcatcctgcactatctttcactg agtgagaagacgtggagtaaatgtcccatctgttacagttctgtgcataagaaggatctc aagagtgttgttgccacagagtcacatcagtatgttgttggtgataccattacgatgcag ctgatgaagagggagaaaggggtgttggtggctttgcccaaatccaaatggatgaatgta gaccatcccattcatctaggagatgaacagcacagccagtactccaagttgctgctggcc tctaaggagcaggtgctgcaccgggtagttctggaggagaaagtagcactagagcagcag ctggcagaggagaagcacactcccgagtcctgctttattgaggcagctatccaggagctc aagggtgtgctggagtatctgtctgccttcgatgaagaaaccacggaagtttgttctctg gacactccttctagacctcttgctctccctctggtagaagaggaggaagcagtgtctgaa ccagagcctgaggggttgccagaggcctgtgatgacttggagttagcagatgacaatctt aaagaggggaccatttgcactgagtccagccagcaggaacccatcaccaagtcaggcttc acacgcctcagcagctctccttgttactacttttaccaagcggaagatggacagcatatg ttcctgcaccctgtgaatgtgcgctgcctcgtgcgggagtacggcagcctggagaggagc cccgagaagatctcagcaactgtggtggagattgctggctactccatgtctgaggatgtt cgacagcgtcacagatatctctctcacttgccactcacctgtgagttcagcatctgtgaa ctggctttgcaacctcctgtggtctctaaggaaaccctagagatgttctcagatgacatt gagaagaggaaacgtcagcgccaaaagaaggctcgggaggaacgccgccgagagcgcagg attgagatagaggagaacaagaaacagggcaagtacccagaagtccacattcccctcgag aatctacagcagtttcctgccttcaattcttatacctgctcctctgattctgctttgggt cccaccagcaccgagggccatggggccctctccatttctcctctcagcagaagtccaggt tcccatgcagnn