GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:08:48 Sequence gi568815595r:50507610_50708594 : 200985 bp : 51.69% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 7945 7940 6 1.05 1.06 Term - 11106 10688 419 1 2 45 38 242 0.250 10.71 1.05 Intr - 17565 17483 83 2 2 64 94 74 0.367 5.38 1.04 Intr - 23224 22903 322 2 1 59 22 165 0.004 2.47 1.03 Intr - 30387 30363 25 0 1 99 100 14 0.002 1.78 1.02 Intr - 32047 31989 59 2 2 96 77 7 0.003 -0.51 1.01 Init - 35532 35466 67 0 1 86 59 40 0.203 2.08 1.00 Prom - 41055 41016 40 1.49 2.07 PlyA - 41095 41090 6 -1.75 2.06 Term - 43358 43222 137 0 2 75 47 152 0.043 8.29 2.05 Intr - 49252 49164 89 0 2 102 18 24 0.002 -3.09 2.04 Intr - 52128 52052 77 0 2 125 66 65 0.347 6.81 2.03 Intr - 53455 53308 148 0 1 80 86 245 0.996 24.25 2.02 Intr - 54138 54113 26 0 2 142 76 22 0.994 3.71 2.01 Init - 58090 57857 234 1 0 75 77 158 0.808 11.59 2.00 Prom - 60834 60795 40 -5.71 3.00 Prom + 60947 60986 40 -8.38 3.01 Init + 61199 61360 162 1 0 63 56 83 0.583 2.31 3.02 Intr + 61771 61936 166 0 1 51 68 73 0.342 1.65 3.03 Intr + 63113 63161 49 0 1 48 111 10 0.202 -2.47 3.04 Intr + 64506 64599 94 0 1 119 92 67 0.608 10.67 3.05 Intr + 69443 69577 135 1 0 102 99 62 0.899 9.97 3.06 Intr + 69900 69964 65 2 2 61 92 81 0.971 3.81 3.07 Intr + 70217 70266 50 2 2 92 80 29 0.915 1.31 3.08 Intr + 71212 71317 106 2 1 77 48 179 0.995 12.57 3.09 Intr + 72235 72354 120 1 0 121 48 86 0.572 8.01 3.10 Intr + 72507 72620 114 0 0 67 94 113 0.909 9.87 3.11 Intr + 73916 74065 150 2 0 47 95 57 0.704 2.09 3.12 Intr + 76221 76332 112 0 1 104 63 34 0.136 3.38 3.13 Term + 85080 85307 228 2 0 33 53 141 0.486 2.06 3.14 PlyA + 87918 87923 6 1.05 4.06 PlyA - 89950 89945 6 1.05 4.05 Term - 100533 99998 536 1 2 52 42 628 0.999 49.41 4.04 Intr - 100984 100764 221 0 2 95 105 227 0.982 23.57 4.03 Intr - 102920 102764 157 0 1 89 86 8 0.667 0.28 4.02 Intr - 104187 104022 166 0 1 102 86 15 0.754 2.75 4.01 Init - 106518 106513 6 0 0 88 94 0 0.616 1.51 4.00 Prom - 108878 108839 40 -4.31 5.00 Prom + 109099 109138 40 -4.41 5.01 Init + 109453 109632 180 0 0 62 101 93 0.617 5.16 5.02 Intr + 109915 110175 261 0 0 36 86 299 0.559 22.72 5.03 Intr + 132757 132896 140 0 2 69 68 187 0.988 14.57 5.04 Intr + 134098 134162 65 1 2 87 87 32 0.495 1.85 5.05 Intr + 134644 134723 80 2 2 127 100 94 0.999 14.27 5.06 Intr + 136800 136923 124 2 1 75 92 119 0.993 11.66 5.07 Intr + 138101 138176 76 0 1 85 76 82 0.998 5.77 5.08 Intr + 138531 138655 125 0 2 70 101 193 0.999 19.53 5.09 Intr + 139131 139216 86 1 2 107 66 77 0.997 7.44 5.10 Intr + 139514 139594 81 1 0 43 98 84 0.966 5.23 5.11 Term + 140285 140437 153 1 0 96 36 208 0.995 14.63 5.12 PlyA + 141659 141664 6 1.05 6.07 PlyA - 143675 143670 6 -0.45 6.06 Term - 147425 147397 29 0 2 121 54 -1 0.004 -1.68 6.05 Intr - 149488 149397 92 0 2 125 75 -11 0.005 1.44 6.04 Intr - 159944 159863 82 0 1 79 94 111 0.672 9.99 6.03 Intr - 162144 162090 55 0 1 5 76 45 0.064 -5.96 6.02 Intr - 165936 165813 124 2 1 37 70 75 0.489 1.69 6.01 Init - 167656 166863 794 1 2 82 75 327 0.534 23.09 6.00 Prom - 175570 175531 40 -1.71 7.03 PlyA - 175848 175843 6 1.05 7.02 Term - 178187 177880 308 0 2 67 41 136 0.766 2.43 7.01 Init - 180143 180089 55 2 1 88 101 2 0.587 2.89 7.00 Prom - 185014 184975 40 -0.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 28452 28541 90 1 0 88 82 63 0.889 6.16 S.002 Intr + 47050 47139 90 2 0 70 105 64 0.853 6.76 S.003 Init - 150135 150058 78 0 0 88 100 70 0.961 9.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_1|324_aa MGCGGEPHGTQSRSCKKKTSSISIVNSPVARAPPPSSLPAGQLLSTCELPTPDANGLLIQ KHLDFKGGSGGGQALLKELQAQKSWDSTRTGCLPLLPALAAAPDDSGSWYYPPVYPTCSP HPRALSPVPASLRPDVGTLRARGRPCRRRRVGQGHLDGDQGGKIKSSINPPEMEYLEHAQ FYLPTALDEQPLTVHNAGSFWEAGCCAGQLYKASDGVFLKLDEHNMELPCASDVRRVQSV GGGVLRDVANGVHPWMLLIDVVEVALAPSAKISAPSMRDLALAAEGTMDSAMIPPCLAPW ESQSERGGSSRPRHTQYDVFDMDI >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_1|975_bp atggggtgtggaggagagcctcatgggactcaatccaggtcctgcaagaagaagaccagc agcatttccatcgtcaactcccctgtggcccgggctccccctccctccagcctccccgca ggtcagctactttctacctgtgagctccccaccccagacgctaatgggctcctcattcag aagcacctggacttcaaaggaggctctggaggaggccaggcattactaaaggaattgcag gcacagaagagttgggactccactagaactggctgccttcccctgctaccggctctcgca gcagccccagacgactcaggctcctggtactacccccctgtgtacccgacgtgttctccc cacccccgtgccctgtcccccgtccctgcgtccctgcgccccgatgttggcaccctgcgg gcccgggggcggccgtgccggcgccgcagagtagggcagggccacctggacggtgatcaa ggagggaagataaaatcctccataaatcccccagaaatggagtatctggagcatgctcaa ttctacctgcccacagccttagatgagcagcccctcactgtgcacaatgcaggctccttc tgggaagctgggtgttgtgctgggcaactatacaaggcatcagatggtgttttcctgaaa cttgatgagcacaatatggagttgccatgtgccagtgatgtgcgccgggtgcagtctgtt gggggtggggtgctcagggacgtggccaacggagtccacccttggatgctccttatagat gtcgtggaagtagccttggctccatcagcaaaaatctctgccccttccatgagggacctt gcactggccgctgagggcaccatggattcagcaatgattccaccctgccttgccccctgg gagtcccagtctgagagaggaggcagttccagacccagacatacccagtacgatgtgttt gacatggacatctag >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_2|236_aa MNSRTASARGWFSSRPPTSESDLEPATDGPASETTTLSPEATTFNDTRIPDAAGGTAGVG TMLLSFGIITVIGLAVALVLYIRKKKRLEKLRHQLMPMYNFDPTEEQDELEQELLEHGRD AASVQAATSVQAMQGKTTLPSQGPLQRPSRLVFTDVANAIHASYSQSCRKTKFPASSQVF QSVGSINIGVKALLQQALCWALVLRGTQSVLDLEETHQQESQDGSIDSNDSTLRRK >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_2|711_bp atgaactccaggaccgcatctgctaggggctggttcagcagccgcccacccacctctgag tctgacctggaacctgccacagatgggccagcctccgagaccactaccctcagcccagag gccaccacctttaatgacaccagaatccctgatgcagctggtggcacggccggcgtgggt accatgcttctgtcctttgggatcatcacggtgataggcctggctgtggccttggttttg tacatcaggaagaagaagaggctggagaagctacgccaccagctcatgcccatgtacaac ttcgaccccacggaggaacaagatgagttggagcaggagctgctggagcatgggcgggac gccgcctctgtacaggctgctacttctgtgcaggccatgcagggcaagactactctgccc tcccagggcccactgcagagacccagccggctggtgtttaccgatgtggccaatgccatc catgcctcttattcccagtcctgcaggaaaaccaagttcccagcaagttcccaggtcttc cagagtgtggggtcaataaatattggggtgaaagctcttctacagcaggccctctgctgg gctttggtgctgaggggcacccagtcggttctggacttggaagagacccaccagcaggag agccaggatggcagcattgacagcaacgacagcaccctcaggaggaagtga >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_3|516_aa MLTRHTLRDAAEYFIEGATEVQRSQHLEELPIESRFESVKERPLGSGRQSADPLGSPGRN PPAIGTLKLGYGVRPSLFLKAPTPEAGRPRERATGHARPGVQFGAASPAETELWAQPTPA KNVKKTMPVQYILGEWDFQGLSLRMVPPVFIPRPETEELVEWVLEEVAQRSHAVGSPGSP LILEVGCGSGAISLSLLSQLPQSRVIAVDKREAAISLTHENAQRLRLQDRIWIIHLDMTS ERSWTHLPWGPMDLIVSNPPYVFHQDMEQLAPEIRSYEDPAALDGGEEGMDIITHILALA PRLLKDSGYEWDGSPSSIFLEVDPRHPELVSSWLQSRPDLYLNLVAVRRDFCGSGTLGNS FPYEKLALLSPGPTPTGNLFTLSQVECACWHLNVHSPGMESKRSPYLCWHLLYHAFRNLL IPLISGAPCGSGIPKFSKCLSWGGPWSWKRKQSLCQSHHLPPTLLLLRRPNKVLCALQGQ RRLLAVAVGGDLEHCARGKAGPVEFAIPAFIYSSVP >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_3|1551_bp atgttgacaagacacactttgcgagacgctgccgagtattttatagaaggcgcaactgag gtccagaggtcccaacacctggaggagttgccaatagagtcaagattcgaatcggtaaaa gagcggcccctggggtctggccgccagtcggccgaccccctggggtctcctgggaggaac ccaccagcgataggaacactgaagctgggctacggcgtccgcccgagccttttcttaaag gcgccgaccccggaagcggggcgtccgagggagcgcgcgacgggccacgcacgtccgggc gtccagttcggggcagcttctccggctgagactgagctatgggctcagcccactccagct aaaaatgtgaagaaaacgatgccggtgcagtacatccttggagagtgggacttccagggg ctcagcctaaggatggtgcccccagtgtttattcctcggccagaaacagaggaactggtt gagtgggtgctggaagaggtggcccagaggtcccatgctgtgggatccccaggcagcccc ctcattctggaggtgggctgcggatcaggagccatctccctcagcctgctgagccagctc ccccagagccgagtcattgctgtggataagcgggaagctgctatctctctgacccatgag aatgctcagaggcttcggttgcaggacaggatttggatcatccacctcgacatgacctca gaaaggagctggacacacctgccctggggccccatggacctgattgtcagcaaccctccc tacgtcttccaccaggacatggagcagctggcccctgagatccgcagctatgaagacccc gcggccctggatggtggggaggagggcatggacatcattacccacattctggccttggca ccccggctcctgaaagactctgggtatgaatgggatgggtctcctagtagtatcttctta gaagtggacccaaggcacccggagcttgtcagcagctggcttcagagccggcctgacctg taccttaatcttgtggctgtgcgcagggacttctgtgggagtgggaccttgggcaactcc tttccctatgagaagctggctcttctgagtccagggccaacgccaactggcaacctcttt actcttagtcaagtggaatgtgcatgctggcatctgaatgtccattcgccaggcatggag agcaagagaagcccatacctctgctggcaccttctgtaccatgccttcagaaaccttctt atccccctcatctctggggccccctgtggatctggcatacccaagttcagtaaatgtcta tcatggggtgggccctggagctggaagaggaaacagagcctctgccagagtcaccacctg cccccaaccctacttctcctccgaaggcccaacaaggtcctgtgcgccctccaagggcag aggaggctgctggctgtggctgtgggaggagacctggagcactgtgcccgaggcaaagct ggccccgtggagtttgctattccagcttttatttactcctctgtaccctga >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_4|361_aa MQAVRATASQSLSCARAPREPTQHALRAHWFPPAAAVQPSPHSGVAAAAGTWSSAFRGEH PLVSSGLLLGVREQSFRLLRSKAGTHMYLEHTSHCPHHDDDTAMDTPLPRPRPLLAVERT GQRPLWAPSLELPKPVMQPLPAGAFLEEVAEGTPAQTESEPKVLDPEEDLLCIAKTFSYL RESGWYWGSITASEARQHLQKMPEGTFLVRDSTHPSYLFTLSVKTTRGPTNVRIEYADSS FRLDSNCLSRPRILAFPDVVSLVQHYVASCTADTRSDSPDPAPTPALPMPKEDAPSDPAL PAPPPATAVHLKLVQPFVRRSSARSLQHLCRLVINRLVADVDCLPLPRRMADYLRQYPFQ L >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_4|1086_bp atgcaggcggtccgcgccactgcctctcagtccctgtcctgcgcccgcgcgccccgggag cctacccagcacgcgctccgcgcccactggttccctccagccgccgccgtccagccgagt ccccactccggagtcgccgctgccgcggggacatggtcctctgcgttcaggggtgagcac ccccttgtaagctcagggctactgttgggtgtcagggaacaaagttttagactgctgcgc tccaaagcgggcacacacatgtacctagaacacaccagccactgtccccaccatgatgat gacacagccatggacacacccctgcccagacctcgtcctttgctggctgtggagcggact gggcagcggcccctgtgggccccgtccctggaactgcccaagccagtcatgcagcccttg cctgctggggccttcctcgaggaggtggcagagggtaccccagcccagacagagagtgag ccaaaggtgctggacccagaggaggatctgctgtgcatagccaagaccttctcctacctt cgggaatctggctggtattggggttccattacggccagcgaggcccgacaacacctgcag aagatgccagaaggcacgttcttagtacgtgacagcacgcaccccagctacctgttcacg ctgtcagtgaaaaccactcgtggccccaccaatgtacgcattgagtatgccgactccagc ttccgtctggactccaactgcttgtccaggccacgcatcctggcctttccggatgtggtc agccttgtgcagcactatgtggcctcctgcactgctgatacccgaagcgacagccccgat cctgctcccaccccggccctgcctatgcctaaggaggatgcgcctagtgacccagcactg cctgctcctccaccagccactgctgtacacctaaaactggtgcagccctttgtacgcaga agcagtgcccgcagcctgcaacacctgtgccgccttgtcatcaaccgtctggtggccgac gtggactgcctgccactgccccggcgcatggccgactacctccgacagtaccccttccag ctctga >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_5|456_aa MTRRERGVEWGSGGGGGGEGGEVTWAPAARLSALGFLRPPAPATASPAPRRYLSKVRCRQ KRQAGAASERPAGAMDGETAEEQGGPVPPPVAPGGPGLGGAPGGRREPKKYAVTDDYQLS KQVLGLGVNGKVLECFHRRTGQKCALKLLYDSPKARQEVDHHWQASGGPHIVCILDVYEN MHHGKRCLLIIMECMEGGELFSRIQERGDQAFTEREAAEIMRDIGTAIQFLHSHNIAHRD VKPENLLYTSKEKDAVLKLTDFGFAKETTQNALQTPCYTPYYVAPEVLGPEKYDKSCDMW SLGVIMYILLCGFPPFYSNTGQAISPGMKRRIRLGQYGFPNPEWSEVSEDAKQLIRLLLK TDPTERLTITQFMNHPWINQSMVVPQTPLHTARVLQEDKDHWDEVKEEMTSALATMRVDY DQVKIKDLKTSNNRLLNKRRKKQAGSSSASQGCNNQ >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_5|1371_bp atgacgcgcagggagcggggggtggagtgggggagtgggggaggggggggtggggagggg ggggaggtcacgtgggcgccggcagcgcgactctcggccctgggatttctgcggccgcca gctcccgcgaccgcctctcctgcccctcgccggtacctcagcaaggtgcgttgccgccag aagcgccaggctggggccgcctctgagcgccccgcgggggccatggatggtgaaacagca gaggagcaggggggccctgtgcccccgccagttgcacccggcggacccggcttgggcggt gctccgggggggcggcgggagcccaagaagtacgcagtgaccgacgactaccagttgtcc aagcaggtgctgggcctgggtgtgaacggcaaagtgctggagtgcttccatcggcgcact ggacagaagtgtgccctgaagctcctgtatgacagccccaaggcccggcaggaggtagac catcactggcaggcttctggcggcccccatattgtctgcatcctggatgtgtatgagaac atgcaccatggcaagcgctgtctcctcatcatcatggaatgcatggaaggtggtgagttg ttcagcaggattcaggagcgtggcgaccaggctttcactgagagagaagctgcagagata atgcgggatattggcactgccatccagtttctgcacagccataacattgcccaccgagat gtcaagcctgaaaacctactctacacatctaaggagaaagacgcagtgcttaagctcacc gattttggctttgctaaggagaccacccaaaatgccctgcagacaccctgctatactccc tattatgtggcccctgaggtcctgggtccagagaagtatgacaagtcatgtgacatgtgg tccctgggtgtcatcatgtacatcctcctttgtggcttcccacccttctactccaacacg ggccaggccatctccccggggatgaagaggaggattcgcctgggccagtacggcttcccc aatcctgagtggtcagaagtctctgaggatgccaagcagctgatccgcctcctgttgaag acagaccccacagagaggctgaccatcactcagttcatgaaccacccctggatcaaccaa tcgatggtagtgccacagaccccactccacacggcccgagtgctgcaggaggacaaagac cactgggacgaagtcaaggaggagatgaccagtgccttggccactatgcgggtagactac gaccaggtgaagatcaaggacctgaagacctctaacaaccggctcctcaacaagaggaga aaaaagcaggcaggcagctcctctgcctcacagggctgcaacaaccagtag >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_6|391_aa MAVPGAAPRARRPGDNAAGTAAGPAVAHRESRRGRSGGAGQWAAHPSQAVRDAAATPATS GPRGSAGGPAAPGGAGEGSPPVRRRCSRAAACFFTAPPPGSRRRRGSARPSGLAGRPRMR HSLPRRGSRCPGSRRPEPARESTRRCCAPAEGNPREAIGLPGPGPASRIAGPGPSAAANP GGGEAGQVGDSLYPDSPLASASSASCLRPEMATGLLVGEGRSRKGGHDTIFRPIPVSPTV LSERSPEPPCAFPGPAGWSRHEGTRGSRGVQEHNERANCSKKIDPSPVATMEVFYGMKPH SYKTLQAQKRKTAQNSTDIPNQDRGGNQNGFYIRDASEDNACKVLNTEPGTRAWKLAACS FPKPPFTHARHPNLLQAALPASGKHSDFKAS >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_6|1176_bp atggccgtgccgggcgcggccccgcgggcgcggcgaccgggcgacaacgcggcggggacg gccgcgggcccggctgtggcgcaccgcgagtcgaggcggggacggtcaggcggcgcgggg cagtgggcggcgcatccctcacaggccgtccgggatgccgccgccacccccgccacgtct ggaccgcgaggctccgcgggtggcccagcagcccctggcggcgcgggcgaaggctcgcca cctgtccgtcgccgctgctcccgtgccgccgcttgcttcttcacagctccgccgcccggc tcccggcgccgccgcggctcggctcgccccagcgggctcgcaggacgcccgcgcatgcgc cactccctgccgcgcaggggctcgcgctgcccgggctcgcgccgtcccgagcccgcgcgc gaatccacgcgccgctgctgcgcgccagcggagggaaatccgcgagaagctattggactg cccggccccggaccggcttctagaatcgctggtccggggccctcagctgcagcaaacccc ggagggggcgaagccggtcaggtgggcgactccctctacccggactccccactcgcttcc gcttccagtgcctcctgcctgcgccccgagatggccacaggcctcctggtcggagagggg agatccagaaaaggaggacacgacaccattttcagacccatcccagtgtcacccacagtg ctgtcggaaaggtccccggagccaccctgtgcattcccagggcccgcaggatggagtcgc cacgagggaaccaggggtagcaggggtgtccaagaacataacgaaagagctaactgttcc aagaagatagacccctccccggtggccaccatggaagtattctatggaatgaagccacat tcttacaaaactctccaggcccaaaagagaaaaacagcccaaaatagcacagatattcca aatcaggacagaggtgggaatcagaatggcttctacatcagggatgcttctgaggataat gcctgcaaagttctcaacacagagcccggcacaagagcgtggaaactggccgcctgctcc ttccctaaaccccctttcacacatgcacggcatcctaacctcctccaggcagccctccca gcttcagggaagcactctgactttaaagcatcctga >gi568815595r:50507610_50708594|GENSCAN_predicted_peptide_7|120_aa MTGQLSNFTTQVKEVTSECHGQPLSHVSFELFADKVPKTAENFHALTTGEKGFDNKGSSF HIIIPGFLCQSGDLTCHNGVASPATRRSDENFILKHTLTGTSNFPIDVSEEETNIPVLTS >gi568815595r:50507610_50708594|GENSCAN_predicted_CDS_7|363_bp atgactggacagctttctaatttcactactcaggtcaaagaggtcacttcagaatgccat ggacagcccttgagccacgtctcctttgagctgtttgcagacaaagttccaaagacagca gaaaacttccatgctctaaccactggagaaaaaggatttgataataagggttccagcttt catataattattccagggtttttgtgtcagagtggtgacttaacatgccataatggtgtg gcaagtccagctacaaggagatctgatgagaacttcatcctgaagcatacactcactgga actagcaattttcctatagatgtcagtgaggaagaaacaaatatccctgtattaacatca taa