GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:49:44 Sequence gi568815589r:104435773_104636813 : 201041 bp : 36.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9064 9265 202 1 1 70 56 104 0.331 3.77 1.02 Term + 18312 18371 60 2 0 105 51 87 0.539 3.53 1.03 PlyA + 18514 18519 6 1.05 2.06 PlyA - 18625 18620 6 1.05 2.05 Term - 24314 24161 154 1 1 89 48 137 0.663 6.21 2.04 Intr - 35216 35119 98 2 2 87 84 30 0.286 0.39 2.03 Intr - 37881 37777 105 0 0 73 75 90 0.560 5.69 2.02 Intr - 42729 42597 133 2 1 46 119 14 0.409 0.03 2.01 Init - 60178 59985 194 1 2 67 60 168 0.361 10.39 2.00 Prom - 65755 65716 40 -3.75 3.00 Prom + 66704 66743 40 -8.95 3.01 Init + 68809 69378 570 0 0 73 52 333 0.010 23.33 3.02 Intr + 85235 85403 169 1 1 29 71 147 0.031 5.70 3.03 Term + 87082 87248 167 2 2 21 38 125 0.082 -2.10 3.04 PlyA + 87950 87955 6 1.05 4.02 PlyA - 88105 88100 6 1.05 4.01 Sngl - 91086 90481 606 0 0 87 44 205 0.427 11.94 4.00 Prom - 91962 91923 40 -4.95 5.02 PlyA - 93903 93898 6 1.05 5.01 Sngl - 100600 99998 603 1 0 87 39 314 0.384 22.34 5.00 Prom - 105842 105803 40 -3.45 6.00 Prom + 119925 119964 40 -3.75 6.01 Sngl + 133747 134358 612 0 0 99 42 420 0.802 34.44 6.02 PlyA + 134830 134835 6 1.05 7.00 Prom + 149578 149617 40 -3.75 7.01 Init + 154975 155127 153 0 0 72 42 95 0.619 3.33 7.02 Term + 156597 156758 162 2 0 14 48 214 0.941 6.95 7.03 PlyA + 158832 158837 6 1.05 8.03 PlyA - 158948 158943 6 1.05 8.02 Term - 163495 162685 811 0 1 17 36 355 0.004 14.86 8.01 Init - 169756 168903 854 1 2 38 89 510 0.027 40.26 8.00 Prom - 171933 171894 40 -6.95 9.02 PlyA - 172081 172076 6 1.05 9.01 Sngl - 182114 181476 639 2 0 83 42 318 0.553 22.63 9.00 Prom - 187157 187118 40 -7.65 10.00 Prom + 187453 187492 40 -3.65 10.01 Init + 188718 188928 211 2 1 68 57 144 0.551 8.29 10.02 Term + 189388 189929 542 2 2 32 38 251 0.520 7.93 10.03 PlyA + 190663 190668 6 1.05 11.02 PlyA - 190818 190813 6 1.05 11.01 Sngl - 198671 198456 216 2 0 68 53 253 0.966 14.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_1|87_aa XCFLPSNNRLQVLQIWILGLTSAICQGLQPQTEGCTVSFPTFEVLGLGLASLLLSLQTAY HGTSPCDCSLRVTTIEKKNKELGKVQR >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_1|264_bp nnatgctttctgccctcgaacaacagactccaagttcttcaaatttggattcttggactt acatcagcgatttgccagggccttcagccacagactgaaggctgcactgtcagcttccct acttttgaggttttgggacttggactggcttccttgctcctcagcttgcagacggcctat catgggacttcaccttgtgattgtagtttaagagtgaccacaattgagaaaaagaacaag gaactgggcaaagttcagagatga >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_2|227_aa MSKKNVQGTKSKRWGHKLHQKAVCSVKQEHGAKPRRLEVMGHKIQPTKDSNRQTKAYIGE KSNHWIWRENKSGLDSFGTQKLFPVPWGMEVARTVGWGSLDFDRHQNDLAALNVAKGYRY ELVFAKILDFEIHRVPFEDLWTVEDGGSPFAFYHDWKLPEASPEAEATILPVQPEEPCYF TDLVHSLSQVSIRHPPYGDVINKRPSTVNSPDFPPGTRFISMNDFYE >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_2|684_bp atgtctaagaaaaatgttcaaggaacaaagtcaaagaggtggggccacaaacttcatcaa aaggcggtttgcagtgtgaagcaggaacacggagcaaagccaaggcgtctggaagtcatg ggacacaagatccaaccaacaaaagattcaaatcgtcagacgaaggcctatattggtgaa aaaagtaaccattggatttggagagaaaataaaagtgggctggactcttttggaacccag aaacttttccctgtcccctggggcatggaggtggcaagaactgtggggtggggttcattg gattttgataggcatcaaaatgacctggctgcacttaacgttgcaaagggctacaggtat gagctggtatttgctaagatcttggattttgaaattcatagagttccatttgaggatcta tggactgtggaggatggcggctccccctttgctttctaccatgattggaagcttcctgag gcctccccagaagcagaagccactatacttcctgtacagcctgaagaaccgtgttacttc acagacctggttcattccttgtctcaggtctccatccggcacccaccctatggagacgtc attaataaacgtccatccacagtaaacagcccagactttccacctggcactaggtttatt tcaatgaatgacttttatgagtga >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_3|301_aa MGSTECVLLPMMAYDRYVAICNPLRYPVIMNRRTCVQIAAGSWMTGCLTAMVEMMSVLPL SLCGNSIINHFTCEILAILKLVCVDTSLVQLIMLVISVLLLPMPMLLICISYAFILASIL RISSVEGRSKAFSTCTAHLMVVVLFYGTALSMHLKPSAVDSQEIDKFMALVYAGQTPMLN PIIYSLRNKEPPLVIPRQTGSGVDLQQTPADLQQRGLTVGRKTNKQKGIAHPLRDPIRRS PTSKTKEKGNLHRKLRLKAAVKTTGDVQQGYAVLGHVRRIHRKRNKKDCNRFTEEFKLTG M >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_3|906_bp atgggctccactgagtgtgtgctcctgcccatgatggcatatgaccggtatgtggccatc tgcaaccccctgagataccctgtcatcatgaataggagaacctgtgtgcagattgcagct ggctcctggatgacaggctgtctcactgccatggtggaaatgatgtctgtgctgccactg tctctctgtggtaatagcatcatcaatcatttcacttgtgaaattctggccatcttgaaa ttggtttgtgtggacacctccctggtgcagttaatcatgctggtgatcagtgtacttctt ctccccatgccaatgctactcatttgtatctcttatgcatttatcctcgccagtatcctg agaatcagctcagtggaaggtcgaagtaaagccttttcaacgtgcacagcccacctgatg gtggtagttttgttctatgggacggctctctccatgcacctgaagccctccgctgtagat tcacaggaaatagacaaatttatggctttggtgtatgccggacaaacccccatgttgaat cctatcatctatagtctacggaacaaagagcctccgctggtgatacccaggcaaacaggg tctggggtggaccttcagcaaactccagcagacctgcagcagaggggcttgactgttgga aggaaaactaacaaacagaaaggaatagcacatccactcagagaccccatccgaaggtca ccaacatcaaagaccaaagaaaaaggaaatttacataggaaattaaggctaaaagcagct gtaaaaacgactggtgatgtgcagcaaggatatgctgtgcttggccatgtcagaagaatc cacagaaaaagaaacaaaaaggattgtaataggtttactgaagagtttaaactcactgga atgtaa >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_4|201_aa MAFDRYVAICNPLRYPIIMNKVVYVLLTSVSWLSGGINSTVQTSLAMRWPFCGNNIINHF LCEILAVLKLACSDISVNIVTLAVSNIAFLVLPLLVIFFSYMFILYTILRTNSATGRHKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLLGKDNLQATEGLVSMFYGVVTPMLNPIIYS LRNKDVKAAIKYLLSRKAINQ >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_4|606_bp atggcatttgatcgttatgtggccatctgtaaccctctgagataccccatcatcatgaac aaggtggtgtatgtactgctgacttctgtatcatggctttctggtggaatcaattcaact gtgcaaacatcacttgccatgcgatggcctttctgtgggaacaatattattaatcatttc ttatgcgagatcttagctgtcctaaaattagcttgttctgatatatctgtcaatattgtt accctagcagtgtcaaatattgctttcctagttcttcctctgctcgtgatttttttctcc tatatgttcatcctctacaccatcttgcgaacgaactcggccacaggaagacacaaggca ttttctacatgctcagctcacctgactgtggtgatcatattttatggtaccatcttcttt atgtatgcaaaacctaagtcccaggacctccttgggaaagacaacttgcaagctacagag gggcttgtttccatgttttatggggttgtgacccccatgttaaaccccataatctatagc ttgagaaataaagatgtaaaagctgctataaaatatttgctgagcaggaaagctattaac cagtaa >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_5|200_aa MAFDRYVAICNPLRYPIILSKVAYVLMASVSWLSGGINSAVQTLLAMRLPFCGNNIINHF ACEILAVLKLACADISLNIITMVISNMAFLVLPLMVIFFSYMFILYTILQMNSATGRRKA FSTCSAHLTVVIIFYGTIFFMYAKPKSQDLIGEEKLQALDKLISLFYGVVTPMLNPILYS LRNKDVKAAVKYLLNKKPIH >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_5|603_bp atggcatttgatcgttatgtggccatctgcaacccactgagataccccatcatcctgagc aaggtggcgtatgtattgatggcttctgtgtcctggctgtccggtggaataaattcagct gtgcaaacattacttgccatgagactgcctttctgtgggaataatattatcaatcatttc gcatgtgaaatattagctgtcctcaagctggcctgtgctgatatatccctcaatattatc accatggtgatatcaaatatggccttcctggttcttccactgatggtcatttttttctcc tatatgttcatcctctacaccatcttgcaaatgaattcagccacaggaagacgcaaggca ttttccacgtgctcagctcacctgactgtggtgatcatattttacggtaccatcttcttt atgtatgcgaaaccgaagtctcaagacctgattggggaagaaaaattgcaagcattagac aagctcatttctctgttttatggggtagtgacacccatgctgaatcctatactctatagc ttgagaaataaggatgtaaaagctgctgtaaaatatttgctgaacaaaaaaccaattcac taa >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_6|203_aa MALDRYVAICYPLRYPVIMSKGAYVAMAAGSWVTGLVDSVVQTAFAMQLPFCANNVIKHF VCEILAILKLACADISINVISMTGSNLIVLVIPLLVISISYIFIVATILRIPSTEGKHKA FSTCSAHLTVVIIFYGTIFFMYAKPESKASVDSGNEDIIEALISLFYGVMTPMLNPLIYS LRNKDVKAAVKNILCRKNFSDGK >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_6|612_bp atggcactggaccgctatgtggccatctgctacccactgagataccctgtcatcatgagc aagggtgcctatgtggccatggcagctgggtcctgggtcactgggcttgtggactcagta gtgcagacagcttttgcaatgcagttaccattctgtgctaataatgtcattaaacatttt gtctgtgaaattctggctatcttgaaactggcctgtgctgatatttcaatcaatgtgatt agtatgacagggtcgaatctgattgttctggttattccattgttagtaatttccatctct tacatatttattgttgccactattctgaggattccttccactgaaggaaaacataaggcc ttctccacctgctcagcccacctgacagtggtgattatattctatggaaccatcttcttc atgtacgcaaagcctgagtctaaagcctctgttgattcaggtaatgaagacatcattgag gccctcatctcccttttctatggagtgatgactcccatgcttaatcctctcatctatagt ctgcgaaacaaggatgtaaaggctgctgtcaaaaacatactgtgtaggaaaaacttttct gatggaaaatga >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_7|104_aa MTVVIVFYGTILFMYMKAKSKDSAFDKLIALFYGIVTPMLNPIIYSLRNTEYDVLEPVQN KEDVNVEERLSSGCQSPSSMKGAYTQEEGVDPSTWLRTTHAKKS >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_7|315_bp atgacagtggtgattgtgttttatgggacaatcctcttcatgtacatgaaggcaaagtcc aaagactctgcttttgacaaactgattgccctgttctatggcatagtcacccccatgctc aatcctatcatctatagcctgaggaatacagagtatgatgttttggagcctgtccagaat aaggaagatgtcaacgtggaagagaggctcagttcagggtgtcaaagtccaagcagcatg aaaggagcatacacacaagaggaaggtgtggatcctagcacctggttgaggaccactcat gcaaagaaaagctga >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_8|554_aa MYVVILLGNGTLILISILDPHLHTPMYFFLGNLSFLDICYTTTSIPSTLVSFLSERKTIS LSGCAVQMFLGLAMGTTECVLLGMMAFDRYVAICNPLRYPIIMSKDAYVPMAAGSWIIGA VNSAVQSVFVVQLPFCRNNIINHFTCEILAVMKLACADISDNEFIMLVATTLFILTPLLL IIVSYTLIIVSIFKISSSEGRSKASSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDL DATDKIISMFYGVMTPMMNPLIYSLRNKDVKEAVKHLLNRRFFSNILDPHLHTPMYFFLG NLSFLDICYTTTSIPSTLVSFLSERKTISLSGCAVQMFLSLAMGTTECVLLGVMAFDRYV AICNPLRYPIIMSKDAYVPMAAGSWIIGAVNSAVQTVFVVQLPFCRNNIINHFTCEILAV MKLACADISGNEFILLVTTTLFLLTPLLLIIVSYTLIILSIFKISSSEGRSKPSSTCSAR LTVVITFCGTIFLMYMKPKSQETLNSDDLDATDKLIFIFYRVMTPMMNPLIYSLRNKDVK EAVKHLLRRKNFNK >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_8|1665_bp atgtatgtggtcatccttctggggaatggtactctcattttaatcagcatcttggaccct caccttcacacccctatgtacttctttctggggaacctctccttcttggacatctgctac accaccacctctattccctccacgctagtgagcttcctttcagaaagaaagaccatttcc ctttctggctgtgcagtgcagatgttcctcggcttggccatggggacaacagagtgtgtg cttctgggcatgatggcctttgaccgctatgtggctatctgcaaccctctgagatatccc atcatcatgagtaaggatgcctatgtacccatggcagctgggtcctggatcataggagct gtcaattctgcagtacaatcagtgtttgtggtacaattgcctttctgcaggaataacatc atcaatcatttcacctgtgaaattctggctgtcatgaaactggcctgtgctgacatctca gacaatgagttcatcatgcttgtggccacaacattgttcatattgacacctttgttatta atcattgtctcttacacgttaatcattgtgagcatcttcaaaattagctcttccgagggg agaagcaaagcttcctctacctgttcagcccatctgactgtggtcataatattctatggg accatcctcttcatgtacatgaagcccaagtctaaagagacacttaattcggatgacttg gatgctaccgacaaaattatatccatgttctatggggtgatgactcccatgatgaatcct ttaatctacagtcttagaaacaaggatgtgaaagaggcagtaaaacacctactgaacaga aggttctttagcaacatcttggaccctcaccttcacacccctatgtacttctttctgggg aacctctccttcttggacatctgctacaccaccacctctattccctccacgctagtgagc ttcctttcagaaagaaagaccatttccctttctggctgtgcagtgcagatgttcctcagc ttggccatggggacaacagagtgtgtgcttctgggcgtgatggcctttgaccgctatgtg gctatctgcaaccctctgagatatcccatcatcatgagtaaggatgcctatgtacccatg gcagctgggtcctggatcataggagctgtcaattctgcagtacaaacagtgtttgtggta caattgcctttctgcaggaataacatcatcaatcatttcacctgtgaaattctagctgtc atgaaactggcctgtgctgacatctcaggcaatgagttcatcctgcttgtgaccacaaca ttgttcctattgacacctttgttattaattattgtctcttacacgttaatcattttgagc atcttcaaaattagctcttcggaggggagaagcaaaccttcctctacctgctcagctcgt ctgactgtggtgataacattctgtgggaccatcttcctcatgtacatgaagcccaagtct caagagacacttaattcagatgacttggatgccactgacaaacttatattcatattctac agggtgatgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaag gaggcagtaaaacacctactgagaagaaaaaattttaacaagtaa >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_9|212_aa MGTTECVLLGMMAFDRYVAICNPLRYPIIMSKNAYVPMAVGSWFAGIVNSAVQTTFVVQL PFCRKNVINHFSCEILAVMKLACADISGNEFLMLVATILFTLMPLLLIVISYSLIISSIL KIHSSEGRSKAFSTCSAHLTVVIIFYGTILFMYMKPKSKETLNSDDLDATDKIISMFYGV MTPMMNPLIYSLRNKDVKEAVKHLPNRRFFSK >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_9|639_bp atggggacaacagagtgtgtgcttctgggcatgatggcctttgaccgctatgtggctatc tgcaaccctctgagatatcccatcatcatgagcaagaatgcctatgtacccatggctgtt gggtcctggtttgcagggattgtcaactctgcagtacaaactacatttgtagtacaattg cctttctgcaggaagaatgtcatcaatcatttctcatgtgaaattctagctgtcatgaag ttggcctgtgctgacatctcaggcaatgagttcctcatgcttgtggccacaatattgttc acattgatgccactgctcttgatagttatctcttactcattaatcatttccagcatcctc aagattcactcctctgaggggagaagcaaagctttctctacctgctcagcccatctgact gtggtcataatattctatgggaccatcctcttcatgtatatgaagcccaagtctaaagag acacttaattcagatgacttggatgctaccgacaaaattatatccatgttctatggggtg atgactcccatgatgaatcctttaatctacagtcttagaaacaaggatgtgaaagaggca gtaaaacacctaccgaacagaaggttctttagcaagtga >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_10|250_aa MWKRLWNWVTGRRWKSLEGSEEDRKMWESLELPTDLLNGFTQNVDSNMDNKVQAEVVSDG DEKLVGSWSKAAPDVAKRGQCRARAMASEGVSLKPQKLPHGVEPASAPKSRIGVWKPLPR LQKMYGNTWMSRQKFAAGMGCSWRTSDRAVQEGKVGLEPPHRVPTGALPSGAVRRGPPPS RLQNGRSTNSLHHLPGKATDTQYQPVKVAEKEAVLCKATGSELPKTMGMHFLHQCDLDMR PESKEIILEL >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_10|753_bp atgtggaagcgactttggaactgggtaacaggcagaagatggaagagtttggagggctca gaagaagacaggaaaatgtgggaaagtttagaacttcctacagacttgctgaatggcttt acccaaaatgttgatagcaatatggataataaggtccaggctgaggtggtctcagatgga gatgagaaacttgttgggagctggagcaaagctgctccagatgtggctaaaaggggccaa tgcagagctcgggccatggcttcagagggggtaagcctcaagcctcagaagcttccacat ggtgttgagcctgcaagtgcaccgaagtcaagaattggggtttggaaacctctacctaga cttcagaagatgtatggaaacacctggatgtccaggcagaagtttgctgcagggatgggg tgctcatggagaacttctgatagggcagtgcaggagggaaaagtggggttggaaccccca cacagagtccctactggggcactgcctagtggagctgtgagaagagggccaccaccctcc agactccagaatggtagatctaccaacagcttgcaccatttgcctggaaaagccacagac actcaataccagcctgtgaaagtagctgagaaggaggctgtactctgcaaagccacaggg tcagagctgcccaagaccatgggaatgcacttcttgcatcagtgtgacctggatatgaga ccagagtcaaaggagatcattttggagctttaa >gi568815589r:104435773_104636813|GENSCAN_predicted_peptide_11|71_aa MPVFTELQAAEPERAVTFPGGSDLGTPRAKAVTSLGGLRLLASPSFRVPPPRLDASAQHG SCCGTLSPAAG >gi568815589r:104435773_104636813|GENSCAN_predicted_CDS_11|216_bp atgcccgtgttcactgagctgcaggcggcagaaccagagagagctgtaacatttcctggg ggctcagacctcgggactcccagagcaaaagctgtaacatcccttgggggtctgcggttg ctggcatctccaagttttcgggtgccaccacctcgtctagacgccagtgcccaacacgga agctgctgtggcacgctgagtccagccgcaggctga