GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:49:07 Sequence gi568815589r:104504674_104705627 : 200954 bp : 36.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 58 477 420 0 0 82 52 232 0.060 15.61 1.02 Intr + 16334 16502 169 1 1 29 71 147 0.031 5.70 1.03 Term + 18181 18347 167 2 2 21 38 125 0.083 -2.10 1.04 PlyA + 19049 19054 6 1.05 2.02 PlyA - 19204 19199 6 1.05 2.01 Sngl - 22185 21580 606 0 0 87 44 205 0.427 11.94 2.00 Prom - 23061 23022 40 -4.95 3.02 PlyA - 25002 24997 6 1.05 3.01 Sngl - 31699 31097 603 1 0 87 39 314 0.384 22.34 3.00 Prom - 36941 36902 40 -3.45 4.00 Prom + 51024 51063 40 -3.75 4.01 Sngl + 64846 65457 612 0 0 99 42 420 0.802 34.44 4.02 PlyA + 65929 65934 6 1.05 5.00 Prom + 80677 80716 40 -3.75 5.01 Init + 86074 86226 153 0 0 72 42 95 0.619 3.33 5.02 Term + 87696 87857 162 2 0 14 48 214 0.941 6.95 5.03 PlyA + 89931 89936 6 1.05 6.03 PlyA - 90047 90042 6 1.05 6.02 Term - 94594 93784 811 0 1 17 36 355 0.004 14.86 6.01 Init - 100855 100002 854 1 2 38 89 510 0.027 40.26 6.00 Prom - 103032 102993 40 -6.95 7.02 PlyA - 103180 103175 6 1.05 7.01 Sngl - 113213 112575 639 2 0 83 42 318 0.553 22.63 7.00 Prom - 118256 118217 40 -7.65 8.00 Prom + 118552 118591 40 -3.65 8.01 Init + 119817 120027 211 2 1 68 57 144 0.551 8.29 8.02 Term + 120487 121028 542 2 2 32 38 251 0.520 7.93 8.03 PlyA + 121762 121767 6 1.05 9.02 PlyA - 121917 121912 6 1.05 9.01 Sngl - 129770 129555 216 2 0 68 53 253 0.942 14.72 9.00 Prom - 144952 144913 40 -4.05 10.03 PlyA - 145937 145932 6 1.05 10.02 Term - 152285 152108 178 1 1 65 48 167 0.297 6.58 10.01 Init - 156528 156419 110 0 2 23 84 85 0.196 1.34 10.00 Prom - 160597 160558 40 -5.95 11.00 Prom + 171547 171586 40 -4.25 11.01 Init + 180831 180944 114 2 0 74 102 98 0.712 10.06 11.02 Term + 189824 190789 966 1 0 52 48 320 0.004 15.46 11.03 PlyA + 191390 191395 6 1.05 12.00 Prom + 194375 194414 40 -5.85 12.01 Sngl + 200094 200609 516 2 0 21 49 328 0.838 17.99 12.02 PlyA + 200705 200710 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 130471 130218 254 2 2 7 48 242 0.851 6.92 S.002 Init - 170109 169997 113 0 2 5 107 136 0.815 6.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_1|251_aa MVEMMSVLPLSLCGNSIINHFTCEILAILKLVCVDTSLVQLIMLVISVLLLPMPMLLICI SYAFILASILRISSVEGRSKAFSTCTAHLMVVVLFYGTALSMHLKPSAVDSQEIDKFMAL VYAGQTPMLNPIIYSLRNKEPPLVIPRQTGSGVDLQQTPADLQQRGLTVGRKTNKQKGIA HPLRDPIRRSPTSKTKEKGNLHRKLRLKAAVKTTGDVQQGYAVLGHVRRIHRKRNKKDCN RFTEEFKLTGM >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_1|756_bp atggtggaaatgatgtctgtgctgccactgtctctctgtggtaatagcatcatcaatcat ttcacttgtgaaattctggccatcttgaaattggtttgtgtggacacctccctggtgcag ttaatcatgctggtgatcagtgtacttcttctccccatgccaatgctactcatttgtatc tcttatgcatttatcctcgccagtatcctgagaatcagctcagtggaaggtcgaagtaaa gccttttcaacgtgcacagcccacctgatggtggtagttttgttctatgggacggctctc tccatgcacctgaagccctccgctgtagattcacaggaaatagacaaatttatggctttg gtgtatgccggacaaacccccatgttgaatcctatcatctatagtctacggaacaaagag cctccgctggtgatacccaggcaaacagggtctggggtggaccttcagcaaactccagca gacctgcagcagaggggcttgactgttggaaggaaaactaacaaacagaaaggaatagca catccactcagagaccccatccgaaggtcaccaacatcaaagaccaaagaaaaaggaaat ttacataggaaattaaggctaaaagcagctgtaaaaacgactggtgatgtgcagcaagga tatgctgtgcttggccatgtcagaagaatccacagaaaaagaaacaaaaaggattgtaat aggtttactgaagagtttaaactcactggaatgtaa >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_2|201_aa MAFDRYVAICNPLRYPIIMNKVVYVLLTSVSWLSGGINSTVQTSLAMRWPFCGNNIINHF LCEILAVLKLACSDISVNIVTLAVSNIAFLVLPLLVIFFSYMFILYTILRTNSATGRHKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLLGKDNLQATEGLVSMFYGVVTPMLNPIIYS LRNKDVKAAIKYLLSRKAINQ >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_2|606_bp atggcatttgatcgttatgtggccatctgtaaccctctgagataccccatcatcatgaac aaggtggtgtatgtactgctgacttctgtatcatggctttctggtggaatcaattcaact gtgcaaacatcacttgccatgcgatggcctttctgtgggaacaatattattaatcatttc ttatgcgagatcttagctgtcctaaaattagcttgttctgatatatctgtcaatattgtt accctagcagtgtcaaatattgctttcctagttcttcctctgctcgtgatttttttctcc tatatgttcatcctctacaccatcttgcgaacgaactcggccacaggaagacacaaggca ttttctacatgctcagctcacctgactgtggtgatcatattttatggtaccatcttcttt atgtatgcaaaacctaagtcccaggacctccttgggaaagacaacttgcaagctacagag gggcttgtttccatgttttatggggttgtgacccccatgttaaaccccataatctatagc ttgagaaataaagatgtaaaagctgctataaaatatttgctgagcaggaaagctattaac cagtaa >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_3|200_aa MAFDRYVAICNPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRLPFCGNNIINHF ACEILAVLKLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATGRRKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLIGEEKLQALDKLISLFYGVVTPMLNPILYS LRNKDVKAAVKYLLNKKPIH >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_3|603_bp atggcatttgatcgttatgtggccatctgcaacccactgagataccccatcatcctgagc aaggtggcgtatgtattgatggcttctgtgtcctggctgtccggtggaataaattcagct gtgcaaacattacttgccatgagactgcctttctgtgggaataatattatcaatcatttc gcatgtgaaatattagctgtcctcaagctggcctgtgctgatatatccctcaatattatc accatggtgatatcaaatatggccttcctggttcttccactgatggtcatttttttctcc tatatgttcatcctctacaccatcttgcaaatgaattcagccacaggaagacgcaaggca ttttccacgtgctcagctcacctgactgtggtgatcatattttacggtaccatcttcttt atgtatgcgaaaccgaagtctcaagacctgattggggaagaaaaattgcaagcattagac aagctcatttctctgttttatggggtagtgacacccatgctgaatcctatactctatagc ttgagaaataaggatgtaaaagctgctgtaaaatatttgctgaacaaaaaaccaattcac taa >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_4|203_aa MALDRYVAICYPLRYPVIMSKGAYVAMAAGSWVTGLVDSVVQTAFAMQLPFCANNVIKHF VCEILAILKLACADISINVISMTGSNLIVLVIPLLVISISYIFIVATILRIPSTEGKHKA FSTCSAHLTVVIIFYGTIFFMYAKPESKASVDSGNEDIIEALISLFYGVMTPMLNPLIYS LRNKDVKAAVKNILCRKNFSDGK >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_4|612_bp atggcactggaccgctatgtggccatctgctacccactgagataccctgtcatcatgagc aagggtgcctatgtggccatggcagctgggtcctgggtcactgggcttgtggactcagta gtgcagacagcttttgcaatgcagttaccattctgtgctaataatgtcattaaacatttt gtctgtgaaattctggctatcttgaaactggcctgtgctgatatttcaatcaatgtgatt agtatgacagggtcgaatctgattgttctggttattccattgttagtaatttccatctct tacatatttattgttgccactattctgaggattccttccactgaaggaaaacataaggcc ttctccacctgctcagcccacctgacagtggtgattatattctatggaaccatcttcttc atgtacgcaaagcctgagtctaaagcctctgttgattcaggtaatgaagacatcattgag gccctcatctcccttttctatggagtgatgactcccatgcttaatcctctcatctatagt ctgcgaaacaaggatgtaaaggctgctgtcaaaaacatactgtgtaggaaaaacttttct gatggaaaatga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_5|104_aa MTVVIVFYGTILFMYMKAKSKDSAFDKLIALFYGIVTPMLNPIIYSLRNTEYDVLEPVQN KEDVNVEERLSSGCQSPSSMKGAYTQEEGVDPSTWLRTTHAKKS >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_5|315_bp atgacagtggtgattgtgttttatgggacaatcctcttcatgtacatgaaggcaaagtcc aaagactctgcttttgacaaactgattgccctgttctatggcatagtcacccccatgctc aatcctatcatctatagcctgaggaatacagagtatgatgttttggagcctgtccagaat aaggaagatgtcaacgtggaagagaggctcagttcagggtgtcaaagtccaagcagcatg aaaggagcatacacacaagaggaaggtgtggatcctagcacctggttgaggaccactcat gcaaagaaaagctga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_6|554_aa MYVVILLGNGTLILISILDPHLHTPMYFFLGNLSFLDICYTTTSIPSTLVSFLSERKTIS LSGCAVQMFLGLAMGTTECVLLGMMAFDRYVAICNPLRYPIIMSKDAYVPMAAGSWIIGA VNSAVQSVFVVQLPFCRNNIINHFTCEILAVMKLACADISDNEFIMLVATTLFILTPLLL IIVSYTLIIVSIFKISSSEGRSKASSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDL DATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSNILDPHLHTPMYFFLG NLSFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFLSLAMGTTECVLLGVMAFDRYV AICNPLRYPIIMSKDAYVPMAAGSWIIGAVNSAVQTVFVVQLPFCRNNIINHFTCEILAV MKLACADISGNEFILLVTTTLFLLTPLLLIIVSYTLIILSIFKISSSEGRSKPSSTCSAR LTVVITFCGTIFLMYMKPKSQETLNSDDLDATDKLIFIFYRVMTPMMNPLIYSLRNKDVK EAVKHLLRRKNFNK >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_6|1665_bp atgtatgtggtcatccttctggggaatggtactctcattttaatcagcatcttggaccct caccttcacacccctatgtacttctttctggggaacctctccttcttggacatctgctac accaccacctctattccctccacgctagtgagcttcctttcagaaagaaagaccatttcc ctttctggctgtgcagtgcagatgttcctcggcttggccatggggacaacagagtgtgtg cttctgggcatgatggcctttgaccgctatgtggctatctgcaaccctctgagatatccc atcatcatgagtaaggatgcctatgtacccatggcagctgggtcctggatcataggagct gtcaattctgcagtacaatcagtgtttgtggtacaattgcctttctgcaggaataacatc atcaatcatttcacctgtgaaattctggctgtcatgaaactggcctgtgctgacatctca gacaatgagttcatcatgcttgtggccacaacattgttcatattgacacctttgttatta atcattgtctcttacacgttaatcattgtgagcatcttcaaaattagctcttccgagggg agaagcaaagcttcctctacctgttcagcccatctgactgtggtcataatattctatggg accatcctcttcatgtacatgaagcccaagtctaaagagacacttaattcggatgacttg gatgctaccgacaaaattatatccatgttctatggggtgatgactcccatgatgaatcct ttaatctacagtcttagaaacaaggatgtgaaagaggcagtaaaacacctactgaacaga aggttctttagcaacatcttggaccctcaccttcacacccctatgtacttctttctgggg aacctctccttcttggacatctgctacaccaccacctctattccctccacgctagtgagc ttcctttcagaaagaaagaccatttccctttctggctgtgcagtgcagatgttcctcagc ttggccatggggacaacagagtgtgtgcttctgggcgtgatggcctttgaccgctatgtg gctatctgcaaccctctgagatatcccatcatcatgagtaaggatgcctatgtacccatg gcagctgggtcctggatcataggagctgtcaattctgcagtacaaacagtgtttgtggta caattgcctttctgcaggaataacatcatcaatcatttcacctgtgaaattctagctgtc atgaaactggcctgtgctgacatctcaggcaatgagttcatcctgcttgtgaccacaaca ttgttcctattgacacctttgttattaattattgtctcttacacgttaatcattttgagc atcttcaaaattagctcttcggaggggagaagcaaaccttcctctacctgctcagctcgt ctgactgtggtgataacattctgtgggaccatcttcctcatgtacatgaagcccaagtct caagagacacttaattcagatgacttggatgccactgacaaacttatattcatattctac agggtgatgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaag gaggcagtaaaacacctactgagaagaaaaaattttaacaagtaa >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_7|212_aa MGTTECVLLGMMAFDRYVAICNPLRYPIIMSKNAYVPMAVGSWFAGIVNSAVQTTFVVQL PFCRKNVINHFSCEILAVMKLACADISGNEFLMLVATILFTLMPLLLIVISYSLIISSIL KIHSSEGRSKAFSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDLDATDKIISMFYGV MTPMMNPLIYSLRNKDVKEAVKHLPNRRFFSK >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_7|639_bp atggggacaacagagtgtgtgcttctgggcatgatggcctttgaccgctatgtggctatc tgcaaccctctgagatatcccatcatcatgagcaagaatgcctatgtacccatggctgtt gggtcctggtttgcagggattgtcaactctgcagtacaaactacatttgtagtacaattg cctttctgcaggaagaatgtcatcaatcatttctcatgtgaaattctagctgtcatgaag ttggcctgtgctgacatctcaggcaatgagttcctcatgcttgtggccacaatattgttc acattgatgccactgctcttgatagttatctcttactcattaatcatttccagcatcctc aagattcactcctctgaggggagaagcaaagctttctctacctgctcagcccatctgact gtggtcataatattctatgggaccatcctcttcatgtatatgaagcccaagtctaaagag acacttaattcagatgacttggatgctaccgacaaaattatatccatgttctatggggtg atgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaaagaggca gtaaaacacctaccgaacagaaggttctttagcaagtga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_8|250_aa MWKRLWNWVTGRRWKSLEGSEEDRKMWESLELPTDLLNGFTQNVDSNMDNKVQAEVVSDG DEKLVGSWSKAAPDVAKRGQCRARAMASEGVSLKPQKLPHGVEPASAPKSRIGVWKPLPR LQKMYGNTWMSRQKFAAGMGCSWRTSDRAVQEGKVGLEPPHRVPTGALPSGAVRRGPPPS RLQNGRSTNSLHHLPGKATDTQYQPVKVAEKEAVLCKATGSELPKTMGMHFLHQCDLDMR PESKEIILEL >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_8|753_bp atgtggaagcgactttggaactgggtaacaggcagaagatggaagagtttggagggctca gaagaagacaggaaaatgtgggaaagtttagaacttcctacagacttgctgaatggcttt acccaaaatgttgatagcaatatggataataaggtccaggctgaggtggtctcagatgga gatgagaaacttgttgggagctggagcaaagctgctccagatgtggctaaaaggggccaa tgcagagctcgggccatggcttcagagggggtaagcctcaagcctcagaagcttccacat ggtgttgagcctgcaagtgcaccgaagtcaagaattggggtttggaaacctctacctaga cttcagaagatgtatggaaacacctggatgtccaggcagaagtttgctgcagggatgggg tgctcatggagaacttctgatagggcagtgcaggagggaaaagtggggttggaaccccca cacagagtccctactggggcactgcctagtggagctgtgagaagagggccaccaccctcc agactccagaatggtagatctaccaacagcttgcaccatttgcctggaaaagccacagac actcaataccagcctgtgaaagtagctgagaaggaggctgtactctgcaaagccacaggg tcagagctgcccaagaccatgggaatgcacttcttgcatcagtgtgacctggatatgaga ccagagtcaaaggagatcattttggagctttaa >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_9|71_aa MPVFTELQAAEPERAVTFPGGSDLGTPRAKAVTSLGGLRLLASPSFRVPPPRLDASAQHG SCCGTLSPAAG >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_9|216_bp atgcccgtgttcactgagctgcaggcggcagaaccagagagagctgtaacatttcctggg ggctcagacctcgggactcccagagcaaaagctgtaacatcccttgggggtctgcggttg ctggcatctccaagttttcgggtgccaccacctcgtctagacgccagtgcccaacacgga agctgctgtggcacgctgagtccagccgcaggctga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_10|95_aa MSSNAQRSGSHTLSNACKWLELTVMLVGAIRLGLNAITILFMYAKPKAKDSSGADKEQVT DKIISLFYGVVTPMLNPLIYSLRNKDVKAAVKSIL >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_10|288_bp atgagtagtaatgcccagagatctgggtctcatactctctccaatgcctgcaagtggttg gaactgacagtgatgctggttggggcaatcaggcttggactgaatgccataaccatcctt ttcatgtatgcaaagcccaaggctaaagactcttctggtgcagacaaagaacaagtcaca gacaaaatcatctccctgttctatggagtggtgacacctatgcttaatcctcttatctat agtttgaggaacaaagacgtgaaggcagctgtgaagagtatactgtga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_11|359_aa MEIENLEDKATENRQTDQREPQIFVFTIPKYRLISTLYQAGDLEHMETRNYSAMTEFFLV GLSQYPELQLFLFLLCLIMYMIILLGNSLLIIITILDSRLHTPMYFFLGNLSFLDICYTS SSIPPMLIIFMSERKSISFIGCALQMVVSLGLGSTECVLLAVMAYDHYVAICNPLRYSII MNGVLYVQMAAWSWIIGCLTSLLQTVLTMMLPFCGNNVIDHITCEILALLKLVCSDITIN VLIMTVTNIVSLVILLLLIFISYVFILSSILRINCAEGRKKAFSTCSAHSIVVILFYGSA LFMYMKPKSKNTNTSDEIIGLSYGVVSPMLNPIIYSLRNKEVKEAVKKVLSRHLHLLKM >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_11|1080_bp atggagattgaaaatctggaagataaagccacagaaaataggcagacagaccaacgagag cctcaaatttttgtgtttactatacccaaataccggctgatctctacactataccaggca ggtgacctagaacacatggagacaagaaattactctgccatgactgaattctttctggtg gggctttcccaatatccagagctccagctttttctgttcctgctctgcctcatcatgtac atgataatcctcctgggaaatagcctcctcattatcatcaccatcttggattctcgcctc catactcccatgtatttctttcttggaaacctctcattcttggacatctgttacacatcc tcatccattcctccaatgcttattatatttatgtctgagagaaaatccatctccttcatt ggctgtgctctgcagatggttgtgtcccttggcttgggctccactgagtgtgtcctcctg gctgtgatggcctatgaccactatgtggccatctgcaacccactgaggtactccatcatc atgaacggagtgctgtatgtgcaaatggctgcatggtcctggatcataggctgtctgacc tccctattgcaaacagttctgacaatgatgttgcctttctgtgggaataatgtcattgat catattacctgtgaaattttggcccttctaaaacttgtttgttcagatatcaccatcaat gtgcttatcatgacagtgacaaatattgtttcactggtgattcttctactgttaattttc atctcctatgtgtttattctctcttccatcctgagaattaattgtgctgagggaagaaag aaagccttctctacctgttcagcgcactcgattgtggtcatcttattctacggttcagcc ctttttatgtacatgaaacccaagtcaaagaacactaatacatctgatgagattattggg ctgtcttatggagtggtaagcccaatgttaaatcccatcatctatagcctcaggaataaa gaggtcaaagaggctgtaaagaaagtcctgagcagacatctgcatttattgaaaatgtga >gi568815589r:104504674_104705627|GENSCAN_predicted_peptide_12|171_aa MTELKNTARELREAYTSINGQINQAEERISEIEDQLDKIKHEDNIREKRVKRNEQSLQDT WDYVKRPNLCLIGVPESDGENGTKLENTLHDIIQENFPNPARQANIQIQEIQRTPQRYFL RTATPRCIIIRFTKVEMKEKMLREARGKGWVTHKGKPIRLTADISAEILQA >gi568815589r:104504674_104705627|GENSCAN_predicted_CDS_12|516_bp atgacagagctgaaaaacacagcacgagaacttcgtgaagcatacacaagtatcaatggc caaatcaatcaagcagaagaaaggatatcagagattgaagatcaacttgataaaataaag catgaagacaacattagagaaaaaagagtgaaaaggaatgaacaaagcctccaagataca tgggactatgtgaaaagaccaaacctgtgtttgattggtgtacctgaaagtgatggggag aatggaaccaaattggaaaacacccttcacgatatcatccaggagaacttccccaaccca gcaagacaggccaacattcaaattcaggaaatacaaagaacaccacaaagatatttcttg agaacagcaaccccaagatgcataatcatcagattcaccaaggttgaaatgaaggaaaaa atgttaagggaagccagagggaaaggttgggttacccacaaagggaagcccatcagacta acagcggatatctctgcagaaatcctacaagcctga