GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:03:01 Sequence gi568815596r:6749853_6965696 : 215844 bp : 43.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 540 668 129 2 0 66 42 148 0.976 8.15 1.02 Intr + 1589 1743 155 1 2 50 59 98 0.576 1.97 1.03 Intr + 4847 4962 116 2 2 37 127 17 0.291 0.59 1.04 Intr + 9579 9696 118 1 1 112 52 32 0.178 1.52 1.05 Intr + 11268 11436 169 0 1 58 97 34 0.191 1.25 1.06 Intr + 12361 12548 188 0 2 24 109 85 0.147 2.69 1.07 Term + 22867 22954 88 1 1 52 40 171 0.093 5.83 1.08 PlyA + 22999 23004 6 1.05 2.04 PlyA - 23171 23166 6 1.05 2.03 Term - 23332 23194 139 0 1 87 45 41 0.493 -2.86 2.02 Intr - 23531 23407 125 2 2 109 81 82 0.555 8.98 2.01 Init - 31189 31088 102 1 0 77 84 68 0.770 5.54 2.00 Prom - 33577 33538 40 -4.26 3.02 PlyA - 36060 36055 6 1.05 3.01 Sngl - 40088 39819 270 2 0 80 47 139 0.821 4.28 3.00 Prom - 40294 40255 40 -4.06 4.06 PlyA - 40341 40336 6 1.05 4.05 Term - 46412 46283 130 1 1 96 47 69 0.023 1.25 4.04 Intr - 67462 67363 100 2 1 52 96 71 0.061 3.47 4.03 Intr - 71296 71143 154 1 1 120 -13 80 0.062 0.75 4.02 Intr - 71571 71439 133 2 1 92 72 46 0.740 4.05 4.01 Init - 84443 84352 92 2 2 61 94 35 0.284 1.36 4.00 Prom - 86308 86269 40 -4.96 5.08 PlyA - 87448 87443 6 1.05 5.07 Term - 90826 90711 116 2 2 117 44 74 0.615 4.63 5.06 Intr - 99511 99427 85 1 1 62 82 21 0.428 -1.71 5.05 Intr - 100121 100037 85 1 1 97 19 98 0.395 3.62 5.04 Intr - 101831 101598 234 1 0 73 56 146 0.469 6.70 5.03 Intr - 111533 111332 202 0 1 57 96 155 0.988 11.54 5.02 Intr - 113726 113612 115 2 1 46 94 158 0.995 12.22 5.01 Init - 115844 115170 675 2 0 87 94 950 0.995 88.67 5.00 Prom - 124022 123983 40 -6.06 6.00 Prom + 127890 127929 40 -4.66 6.01 Init + 127949 128294 346 1 1 82 105 237 0.565 20.48 6.02 Intr + 133519 133680 162 2 0 43 54 138 0.604 5.85 6.03 Intr + 137083 137312 230 2 2 65 106 240 0.994 21.09 6.04 Intr + 140324 140473 150 1 0 73 1 175 0.322 7.66 6.05 Intr + 143819 143851 33 1 0 105 73 40 0.479 2.62 6.06 Term + 145926 146090 165 2 0 54 43 130 0.528 3.02 6.07 PlyA + 147451 147456 6 1.05 7.02 PlyA - 148736 148731 6 1.05 7.01 Sngl - 154092 153493 600 0 0 79 49 312 0.994 22.60 7.00 Prom - 166092 166053 40 -3.56 8.10 PlyA - 166291 166286 6 1.05 8.09 Term - 171787 171423 365 2 2 91 44 132 0.467 3.83 8.08 Intr - 173827 173673 155 0 2 47 44 53 0.221 -3.48 8.07 Intr - 174585 174378 208 1 1 56 32 162 0.322 5.54 8.06 Intr - 175230 175040 191 2 2 19 70 53 0.235 -4.17 8.05 Intr - 176872 176717 156 0 0 26 69 121 0.430 3.13 8.04 Intr - 179147 178962 186 1 0 28 94 118 0.120 5.20 8.03 Intr - 181315 181207 109 2 1 55 76 39 0.061 -1.26 8.02 Intr - 186210 186141 70 0 1 95 77 75 0.111 5.85 8.01 Init - 206624 206463 162 2 0 85 57 101 0.277 6.43 8.00 Prom - 212279 212240 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 20313 20227 87 1 0 85 86 55 0.884 4.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_1|320_aa MGSGEENVCVLVGNGTEKRKSIFSAKIAVCVPFECTRREGKQKLKWGDIAQIVLTLNAPT FAGRGTSLGLRHIDESSDNTESHVHCDINSSLNLFLCVSLGNSTCTVISLCFLNPELFDS LGSHLYSACNSCNGSVQPLLLMLHSVPGKLDASSMFNFADTSISLVPSSRFLFRGQVFTD IMFMFDSPWTPAGALYLNCGATHKGHGQSGCRFARCSYPSDTQELFEGQTLPAEVTTVPA EEDAFPQEHGHKETVSKAALSVSLLLRFWDSDWLPGSSDCRHPIVGLHFVIRLSPSFLGQ TFTVEQKAAYNIAQIKAKKN >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_1|963_bp atgggaagtggagaagagaacgtctgcgtcttagtgggaaatggcacagagaagaggaaa tctattttctcagccaaaattgccgtgtgcgttcccttcgaatgcacacgaagggaagga aaacagaagctaaaatggggagacatagcacagattgttctaactctcaatgcccctaca tttgctggtagaggtacgtccttaggtctacggcacattgacgagtctagtgacaatacg gaatcccacgtccactgtgacattaactcatcactcaaccttttcctctgtgtctccctg ggcaacagcacctgtaccgtcattagtctttgctttctcaacccagagctgtttgactca ctgggttcacacctctattctgcttgcaattcatgtaacggttctgtacagcctctcctg ctcatgctgcacagcgttccgggaaagctggatgcatcgagcatgtttaatttcgctgat acatccatctctttggtcccttcatcacgatttctttttagaggccaggtgttcacagac ataatgttcatgtttgactcaccctggacccctgctggggcactgtatctaaactgtggg gcaacccataaaggccatggccaatctggttgtagatttgcaagatgttcttatcctagt gatacccaagagctgtttgagggacagactctcccagctgaagtgacaactgtcccagct gaagaagatgcctttccccaggagcatggccacaaggagacagtgtccaaggctgcactg tcagtctccttacttttgagattttgggactcggactggcttcctggctcctcagattgc agacatcctattgtgggacttcactttgtgatcagattgtccccgtccttcctcgggcag accttcaccgtggagcagaaagctgcctacaatatcgcccaaatcaaagccaagaagaat tag >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_2|121_aa MWKDDANLAGSQRENICFNRQEMLGCRESPRKSQAAEDRTGERAAASRGLTTGWHCCSME SCQRRPRGPGHVEHKSVSCRPGPVSDCGLWVEPMGSVRVELALPRPTAPGAATSRVTRAA S >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_2|366_bp atgtggaaagatgatgccaatctggctggcagccagagagagaacatctgttttaataga caggaaatgctgggatgccgtgagagcccccgtaagagccaggccgctgaggaccgcaca ggggagcgggcggcggcatcgcgaggactgaccactgggtggcactgttgctccatggag agctgccaacggcgtccccggggacccggccacgttgagcacaaaagcgtctcctgccgc ccggggccggtttctgactgtggactctgggttgagcccatggggtcagtgcgcgtggag cttgcactgcccagacccacagccccaggagcagccacctcccgtgtcacccgggcggcg tcctga >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_3|89_aa MPECFPNAMRNSTWNYRLELMCKSIGLVRVQLDELTFLQDATTNAAPLESNAEGEVEATA RPSTLHDALGCPVPDIVWLASLRSSCYKE >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_3|270_bp atgcctgagtgctttcctaatgccatgagaaattcaacgtggaattacaggcttgagtta atgtgcaagagtatcgggttggttcgtgtgcagctagatgagctgacttttctgcaggat gctactaccaatgcagcccctctagagagcaatgctgaaggggaagtagaggctacagcg aggccatccactcttcatgatgcattaggatgtcctgttcctgacatagtctggctggct tcattacgctcatcctgctacaaagaataa >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_4|202_aa MSCDDRGRDWTDAAESQEHQNMIITTKRWKRLTGAQVKKEPQTYFLPCPFRPLGADISFA AVRPSLPRSLELNAGMRAPTLPPTPAAELRFLSTAVRFQPRSDASDVISSSCLWTWFSQG LSPCVHEFPGLAEAEDTCSTCLPQLVDHGLLSWVGHIQAGQHFTPRSPHSLNSTNAGFGE TDFADIWEPYSGPIREQATTYE >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_4|609_bp atgtcatgtgatgatagaggtagagattggactgatgcagctgaaagccaggaacaccaa aatatgatcatcactaccaaaagatggaagaggctcacaggagcccaggtcaagaaggaa ccacagacgtacttcctcccctgccccttccgtcccctgggagctgacatttcctttgca gcagtccgtcccagtttacccaggagcctggagctcaacgctgggatgagagcacccacg cttcctccaactcctgccgcagagctgcgtttcctcagtactgctgtccgcttccagccc cgctccgacgcatctgacgtcatttcctcgtcttgtctctggacctggttctcccaaggc ctcagtccctgtgtccatgagtttccaggcctggcagaggctgaggacacatgcagcacc tgcctgccccagctggtggaccatgggttgctgtcctgggttggacatatccaggcaggc cagcattttactcctcgctcccctcattctttaaactccaccaacgcaggctttggagaa acagactttgctgacatttgggagccatacagtggccccatcagagaacaggcaacaact tatgaatga >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_5|503_aa MAFARRLLRGPLSGPLLGRRGVCAGAMAPPRRFVLELPDCTLAHFALGADAPGDADAPDP RLAALLGPPERSYSLCVPVTPDAGCGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCP GGQAGGAQQGFLLRDPLDDPDTRQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQ DGRRLQVGCAQVVPVPEPPLHPVVPDLPSSVVFPDREAARAVLEECTSFIPEARAVLDLV DQCPKQIQKGKFQVVAIEGLDATGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPT IIRRAFYSLGNYIVASEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVY QWPEDLLKPDLILLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENP GCHVVDASPSREKVLQTSQILEPSVPAEAVYVSWLPKDTPAEPFLGPSWEEPYTILLSTC SAVKFVGLESWIHHTRVKAWNDP >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_5|1512_bp atggccttcgcccgccggctcctgcgcgggccactgtcggggccgctgctcgggcggcgc ggggtctgcgctggggccatggctccgccgcgccgcttcgtcctggagcttcccgactgc accctggctcacttcgccctaggcgccgacgcccccggcgacgcagacgcccccgacccc cgcctggcggcgctgctggggcccccggagcgcagctactcgctgtgcgtgcccgtgacc ccggacgccggctgcggggcccgggtccgggcggcgcggctgcaccagcgcctgctgcac cagctgcgccgcggccccttccagcggtgccagctgctcaggctgctctgctactgcccg ggcggccaggccggcggcgcacagcaaggcttcctgctgcgcgaccccctggatgaccct gacacccggcaagcgctgctcgagctgctgggcgcctgtcaggaggcaccacgcccgcac ttgggcgagttcgaggccgacccgcgcggccagctgtggcagcgcctctgggaggtgcaa gacggcaggcggctgcaggtgggctgcgcacaggtcgtgcccgtcccggagcccccgctg cacccggtggtgccagacttgcccagttccgtggtcttcccggaccgggaagccgcccgg gccgttttggaggagtgtacctcctttattcctgaagcccgggcagtgcttgacctggtc gaccagtgcccaaaacagatccagaaaggaaagttccaggttgttgccatcgaaggactg gatgccacgggtaaaaccacggtgacccagtcagtggcagattcacttaaggctgtcctc ttaaagtcaccaccctcttgcattggccagtggaggaagatctttgatgatgaaccaact atcattagaagagctttttactctttgggcaattatattgtggcctccgaaatagctaaa gaatctgccaaatctcctgtgattgtagacaggtactggcacagcacggccacctatgcc atagccactgaggtgagtgggggtctccagcacctgcccccagcccatcaccctgtgtac cagtggccagaggacctgctcaaacctgaccttatcctgctgctcactgtgagtcctgag gagaggttgcagaggctgcagggccggggcatggagaagaccagggaagaagcagaactt gaggccaacagtgtgtttcgtcaaaaggtagaaatgtcctaccagcggatggagaatcct ggctgccatgtggttgatgccagcccctccagagaaaaggtcctgcagacgtcccaaatt ctagagccaagtgttcctgcagaggctgtctatgtgtcctggctgcccaaggacactcct gcagagccatttttgggtcccagctgggaggaaccttataccatccttctctccacctgc tcggcagtgaagtttgtgggactggaatcttggattcatcacactcgagtcaaggcctgg aatgacccctga >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_6|361_aa MWVLTPAAFAGKLLSVFRQPLSSLWRSLVPLFCWLRATFWLLATKRRKQQLVLRGPDETK EEEEDPPLPTTPTSVNYHFTRQCNYKCGFCFHTAKTSFVLPLEEAKRGLLLLKEAGMEKI NFSGGEPFLQDRGEYLGKLVRFCKVELRLPSVSIVSNGSLIRERWFQNYGEYLDILAISC DSFDEEVNVLIGRGQGKKNHVENLQKLRRWCRDYRVAFKINSVINRFNVEEDMTEQIKAL NPVRWKVFQCLLIEGENCGEDALREAERFVIGDEEFERFLERHKEVSCLVPESNQKMKDS YLILDEYMRFLNCRKGRKDPSKSILDVGVEEAIKFSGFDEKMFLKRGGKYIWSKADLKLD W >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_6|1086_bp atgtgggtgcttacacctgctgcttttgctgggaagctcttgagtgtgttcaggcaacct ctgagctctctgtggaggagcctggtcccgctgttctgctggctgagggcaaccttctgg ctgctagctaccaagaggagaaagcagcagctggtcctgagagggccagatgagaccaaa gaggaggaagaggaccctcctctgcccaccaccccaaccagcgtcaactatcacttcact cgccagtgcaactacaaatgcggcttctgtttccacacagccaaaacatcctttgtgctg ccccttgaggaagcaaagagaggattgcttttgcttaaggaagctggtatggagaagatc aacttttcaggtggagagccatttcttcaagaccggggagaatacctgggcaagttggtg aggttctgcaaagtagagttgcggctgcccagcgtgagcatcgtgagcaatggaagcctg atccgggagaggtggttccagaattatggtgagtatttggacattctcgctatctcctgt gacagctttgacgaggaagtcaatgtccttattggccgtggccaaggaaagaagaaccat gtggaaaaccttcaaaagctgaggaggtggtgtagggattatagagtcgctttcaagata aattctgtcattaatcgtttcaacgtggaagaggacatgacggaacagatcaaagcacta aaccctgtccgctggaaagtgttccagtgcctcttaattgagggtgagaattgtggagaa gatgctctaagagaagcagaaagatttgttattggtgatgaagaatttgaaagattcttg gagcgccacaaagaagtgtcctgcttggtgcctgaatctaaccagaagatgaaagactcc taccttattctggatgaatatatgcgctttctgaactgtagaaagggacggaaggaccct tccaagtccatcctggatgttggtgtagaagaagctataaaattcagtggatttgatgaa aagatgtttctgaagcgaggaggaaaatacatatggagtaaggctgatctgaagctggat tggtag >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_7|199_aa MRKNQHKNAENSKNHNASSPPKDHNSPPARKQNWMQNEFDRLTEVDFRRWVTTNSSKLKE HVLTQCKEAKNLDKRLQELLTRITSLEKNINDLIELKNTAQELHEAYTSINSRIDQVEEM ISKIEDQLNEIKREDKIREKRMQRNKQSLQEIWDYVKRPNLRLIGVPERDEDNGTKLENT SGYHPGELPQPSKTGQHSN >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_7|600_bp atgaggaaaaaccagcacaaaaatgctgaaaattccaaaaaccacaatgcctcttctcct ccaaaggatcacaactccccaccagcaaggaaacaaaactggatgcagaatgagtttgac agactgacagaggtagacttcagaaggtgggtaacaacaaactcctccaagctaaaggag catgtcctaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaataaccagtttagagaagaacataaatgacctgatagagctgaaaaacacagca caagaacttcatgaagcatacacaagtatcaatagccgaatcgatcaagtggaagaaatg atatcaaagattgaagatcaacttaatgaaataaagcgtgaagacaagattagagaaaaa agaatgcaaaggaacaaacaaagcctccaagaaatatgggattatgtgaaaagaccaaat ctacgtttaattggtgtacctgaaagagacgaggataatggaaccaagttggaaaacact tcaggatatcatccaggagaacttccccaacctagcaagacaggccaacattcaaattag >gi568815596r:6749853_6965696|GENSCAN_predicted_peptide_8|533_aa MGIYATTYRNDIQKDVWCTDEIRELKAASSGGTSDSLVGLLHNHFRYTPWVQTKAEPPKF PKGTVKAVKNRNLTDIEALTTTAALGDASGLCRAHCPSSDMNSTQSDLQVHGIREQMDAD FIPCHPQNAKLLRELTGSEGATQNPQKWQEEDGPVSDHHRRQAATAGDTGNAGKSREKTQ IHFNKYLLSASYEKPVCRVLEGYKDNQDRTGIPSPHCTHTEPIDGTHRLQSSSAYAIKIR RVAIEVARKLTEGPRCFLTFSIRQRASPKVSGSPHPSQSKGSLGAHSLCYAYSQDMRSDG HIQALGSNHSTLCISLHIAGKGHVLCFPCIAQCLGPPSNTATEVSPAVTLDLTYMVSTQP RSAPCGSIFSSVQCQCQRRASPQSPTLAILQLEGKAAFVMSLQESLGSWHMDDYSLLKVM GSILICRPLGSKPGFKIDDVCAMLDNIPALCFTLLLCQLGIATYACAAGLCENCIHDASK ALKIRQIHNEYQPPPPPLVPNTPEMIKYWLTHCMPPRSLLLSGTLHPAYKPAP >gi568815596r:6749853_6965696|GENSCAN_predicted_CDS_8|1602_bp atggggatctatgcaacaacctacagaaatgacatccaaaaagatgtgtggtgtacagat gagattagggagctgaaagctgcttcttctggaggcacctctgactctcttgttgggcta cttcacaaccatttcaggtacacgccatgggtacagacgaaggccgagccgcccaagttc cctaaaggaacagtgaaagcagtgaaaaacagaaacctgacagatattgaagcactgacc acgaccgctgcactaggagatgcttctggactctgccgggcacactgtcccagttctgac atgaacagtactcagtcagacctgcaggtgcatggtatcagggagcaaatggatgctgac ttcattccatgtcacccccaaaatgccaagctgctcagggaactgacagggtctgaaggg gccacgcagaacccccagaagtggcaggaagaggatgggccagtttctgaccatcaccga aggcaggctgccacagcaggagacactgggaatgcaggaaaaagcagggaaaagacgcag attcatttcaacaaatatttattgagcgcctcctatgagaagcctgtgtgccgagtgcta gaggggtacaaagacaaccaggacaggactgggattcccagcccacactgtacccacacc gaacctattgatggaacacacaggctacaaagctccagtgcatacgctatcaaaattcgc cgtgtggcaatcgaggtcgccagaaaactcacagaaggacccaggtgcttcctgacattc tccatccgtcaaagagccagccccaaggtctctggaagcccacacccttcccaaagcaaa ggctcccttggagcacacagcctgtgttatgcttactcccaggacatgcgctcggatggg cacattcaggcgcttggttctaaccattccacattatgcatctcccttcacatcgcaggc aaaggccacgtcctatgcttcccgtgcatcgcacagtgcctcgggccccccagcaacacg gccacggaagtctcgccagctgttactctggatctcacttacatggtgtctacacaacca cgttctgctccgtgcggctccatcttctcatctgtgcaatgccagtgtcagaggagagca tctccacagtcacccacattggcaattttacagctggaaggaaaggcagcatttgtgatg agccttcaagaatcattaggaagctggcacatggatgactacagcctcctcaaggtcatg ggatctatccttatctgcaggcctctgggttcaaagccaggcttcaaaattgacgacgtg tgtgcgatgttggacaacatccctgctctctgcttcactctgctcctctgtcagctaggg atagcaacatacgcctgtgctgcagggctgtgtgagaactgcatccacgatgcaagcaaa gccctgaagatcaggcaaatccataatgagtatcaaccacctcctccacctctagtgcca aacacacctgagatgataaaatactggttgacacactgcatgccaccaaggagcttgctt ttgtctggaacactgcatccagcttataagcctgctccttag