GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:14:48 Sequence gi568815578r:34181037_34395586 : 214550 bp : 45.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 302 297 6 1.05 1.07 Term - 362 348 15 2 0 119 43 7 0.003 -2.46 1.06 Intr - 10452 10380 73 2 1 62 99 70 0.517 4.91 1.05 Intr - 15742 15667 76 2 1 87 80 96 0.118 7.27 1.04 Intr - 20617 20548 70 1 1 66 63 41 0.008 -1.85 1.03 Intr - 32753 32567 187 1 1 59 82 82 0.050 4.39 1.02 Intr - 34510 33420 1091 1 2 87 64 382 0.042 24.54 1.01 Init - 34856 34749 108 2 0 83 89 53 0.961 5.12 1.00 Prom - 44323 44284 40 -3.66 2.00 Prom + 51449 51488 40 -5.86 2.01 Init + 52741 52846 106 0 1 92 47 61 0.064 2.78 2.02 Intr + 60371 60453 83 0 2 44 84 62 0.015 0.76 2.03 Intr + 71120 71223 104 1 2 102 10 66 0.210 -0.83 2.04 Intr + 79329 79498 170 0 2 130 106 119 0.824 17.49 2.05 Intr + 81745 81857 113 2 2 91 116 71 0.909 10.30 2.06 Term + 87955 88131 177 0 0 88 53 254 0.983 19.59 2.07 PlyA + 91692 91697 6 1.05 3.17 PlyA - 91876 91871 6 1.05 3.16 Term - 100129 99998 132 1 0 101 54 209 0.972 16.89 3.15 Intr - 104598 104404 195 0 0 110 98 300 0.997 32.81 3.14 Intr - 109413 109296 118 2 1 60 94 299 0.997 28.17 3.13 Intr - 109602 109515 88 1 1 61 94 113 0.995 8.23 3.12 Intr - 109902 109695 208 0 1 123 53 431 0.999 41.75 3.11 Intr - 110495 110383 113 0 2 83 105 247 0.999 26.00 3.10 Intr - 111471 111322 150 1 0 52 99 286 0.902 26.23 3.09 Intr - 113120 113045 76 2 1 96 114 76 0.979 10.09 3.08 Intr - 114549 114359 191 1 2 95 78 388 0.999 37.80 3.07 Intr - 122295 122207 89 2 2 16 109 79 0.570 2.41 3.06 Intr - 122620 122482 139 2 1 43 117 102 0.388 8.12 3.05 Intr - 130368 130245 124 0 1 49 36 87 0.287 -0.24 3.04 Intr - 135160 135050 111 1 0 74 59 83 0.561 4.58 3.03 Intr - 151728 151569 160 2 1 93 29 126 0.346 7.19 3.02 Intr - 152858 152828 31 0 1 106 77 30 0.756 0.89 3.01 Init - 154840 154768 73 1 1 71 109 18 0.707 2.08 3.00 Prom - 166904 166865 40 -2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:34181037_34395586|GENSCAN_predicted_peptide_1|539_aa MDKQALLGLNPNADSDFRLRALAYFEQLKISPDAWQNISLSGPSFFFDILSVVDLNPRGV DLYLRILMAINSELVDRDVVHTSEEAHRNTLIKDTMREQCIPNLVESWYQISQNYQYTNS EVMCQCLEVVGAYVSWIDLSLLANDRFINMLLGHMSIEVLREEACDCLFEVVNKGMDPVD KMKLVESLCQVLQSAGFFSIDQEEDVDFFSALTRKKMLARFSKPVNGMGQSWIVSWSKLT KNGNIKNAQEALQAIETKVALMLQLLIHEDDDISSNIGFCYDYLHILKQLTVLSDQQKAN VEAIMLAVMKKLTYDEEYNFENEGQDKAMFVEYRKQLKLVLDRLAQVSRELLLASVRRVF SSTLQNWQTTRFMEVEVAIRLLYMLAEALPVSISWCSLLSCPLQKGILRSGVCTFLHRMI ICLEEEVFPFIPSASEHILKDCEAKDLQEFIPLINQITAKFKVYGSGSTRTRIYQRQLET NSCLSNSCPVSKMQTSKGSREYFTPQNRLRRSCSRPPTDRWDQRAFNGDDSVLREVQLS >gi568815578r:34181037_34395586|GENSCAN_predicted_CDS_1|1620_bp atggataaacaggctctattagggctaaatccaaatgctgattcagactttagactaagg gccctggcctattttgagcagttaaagatttccccagatgcctggcagaatatctcacta agtggcccaagttttttttttgacattctctcagtagtggacctaaatccaaggggagta gatctgtacctgcgaatcctcatggctatcaactcagagttggtggatcgtgatgtggtg catacatcagaggaggctcacaggaatactctcataaaagataccatgagggaacagtgc attccaaatctggtggaatcatggtaccaaatatcacaaaattatcagtatactaattct gaagtgatgtgtcagtgccttgaagtagttggggcttatgtctcttggattgacttatcc cttttagccaatgataggtttataaatatgctgctaggtcatatgtcaatagaagttcta cgggaagaagcatgtgactgtttatttgaagttgtaaataaaggaatggaccctgttgat aaaatgaaactagtagaatctttgtgtcaagtattacagtctgctggctttttcagcatt gaccaggaagaagatgttgacttcttttcagcattgaccaggaagaagatgttggccaga ttttctaagccggtaaacggaatgggacagtcatggatagttagttggagtaaattaact aagaatgggaatattaagaatgctcaagaggcactacaagctattgaaacaaaagtggca ctgatgttgcagctactaattcatgaggatgatgatatttcttctaatattggattttgt tatgattatcttcatattttgaaacagcttacagtgctctcggatcagcaaaaagctaat gtggaggcaatcatgttggccgttatgaaaaaattgacttatgatgaagaatataacttt gaaaatgagggtcaagataaagccatgtttgtagaatatagaaaacaactgaagttagtg ttggacaggcttgctcaagtttcacgagagttactactggcctctgttcgcagagttttt agttctacactgcagaattggcagactacacggttcatggaagttgaagtagcaataaga ttgctgtatatgttggcagaagctcttccagtatctatctcatggtgctcacttctcagt tgtcccttacaaaagggtattctcagaagtggagtctgtactttccttcatcgaatgatt atttgcctggaggaagaagtttttccgttcatcccatctgcttcagaacatatactcaaa gattgtgaagcaaaagacctccaggagttcattcctcttatcaaccagattacggccaaa ttcaaggtatatggctccggctccactcgcacccgaatctatcaaaggcagctggagacc aattcttgtctctccaacagctgccctgtgagcaaaatgcagacctctaagggctctagg gagtacttcacccctcagaaccggctacgcaggtcctgctccaggcctcccacagaccgc tgggaccagagggccttcaatggggatgacagtgtgctcagggaggtacaactctcctag >gi568815578r:34181037_34395586|GENSCAN_predicted_peptide_2|250_aa MMWPQLADQTRTAKPLPEKKSTGMRYQWNSLEEVKVIDEQAANLYSCKVKKEVPWPGLFA GKHVGRETEKRNKTQRQSIEKEQWAQETGAQYTEDPHWPPGMDVTRLLLATLLVFLCFFT ANSHLPPEEKLRDDRSLRSNSSVNLLDVPSVSIVAQRPQKHLTLLLLSLFEALNKKSKQI GRKAAEKKRSSKKEASMKKVVRPRTPLSAPCVATRNSCKPPAPACCDPCASCQCRFFRSA CSCRVLSLNC >gi568815578r:34181037_34395586|GENSCAN_predicted_CDS_2|753_bp atgatgtggccacaattggctgatcagactagaacagcaaaaccattacctgagaaaaag tcaacaggaatgagataccagtggaatagcttggaagaggtcaaagttatagatgagcaa gcagcaaatctctacagctgcaaggtgaaaaaggaagttccttggcctgggctctttgcg ggaaagcatgtgggacgagagactgagaaaagaaataagacacagagacaaagtatagag aaagaacagtgggcccaggagaccggtgctcagtatacagaggacccgcactggcctcct gggatggatgtcacccgcttactcctggccaccctgctggtcttcctctgcttcttcact gccaacagccacctgccacctgaggagaagctccgagatgacaggagcctgagaagcaac tcctctgtgaacctactggatgtcccttctgtctctattgtggctcaacgtccccagaaa catctgactttgctccttttgtctctctttgaagcgctgaacaagaaatccaaacagatc ggcagaaaagcagcagaaaagaaaagatcttctaagaaggaggcttcgatgaagaaagtg gtgcggccccggacccccctatctgcgccctgcgtggccacccgcaacagctgcaagccg ccggcacccgcctgctgcgacccgtgcgcctcctgccagtgccgcttcttccgcagcgcc tgctcctgccgcgtgctcagcctcaactgctga >gi568815578r:34181037_34395586|GENSCAN_predicted_peptide_3|665_aa MYPQVSSILEAFMLEILSFSFQWAAFPPHKTLKEWREHVSECGIWPATPSTDTGASSMRA RGHTRCIASRGTWRHPGKDACDPEAPERTPRAAIMKYHAFGGLQQQKFILSQFWKPEVQN QGIGRKLHWGLDLEKGHQQLTSRAAFSGVGGDPQGSTVSSFLYIQPDEEIEAQGSELAEF QFRWVLTLNQLAEPARGRPGWLSGLSRRSLPDRSCSQTEAQPPSPVSITSAASMSDKLPY KVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKPLKGARIAGCLHMTVETAVLIETL VTLGAEVQWSSCNIFSTQDHAAAAIAKAGIPVYAWKGETDEEYLWCIEQTLYFKDGPLNM ILDDGGDLTNLIHTKYPQLLPGIRGISEETTTGVHNLYKMMANGILKVPAINVNDSVTKS KFDNLYGCRESLIDGIKRATDVMIAGKVAVVAGYGDVGKGCAQALRGFGARVIITEIDPI NALQAAMEGYEVTTMDEACQEGNIFVTTTGCIDIILGRHFEQMKDDAIVCNIGHFDVEID VKWLNENAVEKVNIKPQVDRYRLKNGRRIILLAEGRLVNLGCAMGHPSFVMSNSFTNQVM AQIELWTHPDKYPVGVHFLPKKLDEAVAEAHLGKLNVKLTKLTEKQAQYLGMSCDGPFKP DHYRY >gi568815578r:34181037_34395586|GENSCAN_predicted_CDS_3|1998_bp atgtacccacaggtctcctctatcctggaggcattcatgcttgagatcctgagcttctca ttccaatgggcagcttttcctccgcataagaccctgaaggaatggagggagcacgtgagt gagtgcgggatctggccagccactccaagcactgacacaggagcaagctccatgcgggcc cgtggacacaccaggtgtattgcctcaagggggacatggcggcatccaggcaaggatgcc tgcgaccctgaagccccagagaggactcccagagctgccattatgaagtaccacgcattt ggtggcttacaacaacagaaatttattctctcacagttctggaagccagaagtccaaaat caaggcattggcaggaagctccactggggcctggacctggagaagggacatcaacagtta acgtccagggcagctttcagcggcgtgggcggggatccccagggaagcactgttagcagt ttcttgtacatccagcctgatgaggaaattgaggcacagggaagtgaactggcggaattt cagttccgctgggttttgacactgaatcaactggctgaaccagcgagaggacggcccggc tggctctcaggcctctcccggcgctcccttccggaccgttcctgttcccagactgaggcc cagcccccttcgcccgtttccatcacgagtgccgccagcatgtctgacaaactgccctac aaagtcgccgacatcggcctggctgcctggggacgcaaggccctggacattgctgagaac gagatgccgggcctgatgcgtatgcgggagcggtactcggcctccaagccactgaagggc gcccgcatcgctggctgcctgcacatgaccgtggagacggccgtcctcattgagaccctc gtcaccctgggtgctgaggtgcagtggtccagctgcaacatcttctccacccaggaccat gcggcggctgccattgccaaggctggcattccggtgtatgcctggaagggcgaaacggac gaggagtacctgtggtgcattgagcagaccctgtacttcaaggacgggcccctcaacatg attctggacgacgggggcgacctcaccaacctcatccacaccaagtacccgcagcttctg ccaggcatccgaggcatctctgaggagaccacgactggggtccacaacctctacaagatg atggccaatgggatcctcaaggtgcctgccatcaatgtcaatgactccgtcaccaagagc aagtttgacaacctctatggctgccgggagtccctcatagatggcatcaagcgggccaca gatgtgatgattgccggcaaggtagcggtggtagcaggctatggtgatgtgggcaagggc tgtgcccaggccctgcggggtttcggagcccgcgtcatcatcaccgagattgaccccatc aacgcactgcaggctgccatggagggctatgaggtgaccaccatggatgaggcctgtcag gagggcaacatctttgtcaccaccacaggctgtattgacatcatccttggccggcacttt gagcagatgaaggatgatgccattgtgtgtaacattggacactttgacgtggagatcgat gtcaagtggctcaacgagaacgccgtggagaaggtgaacatcaagccgcaggtggaccgg tatcggttgaagaatgggcgccgcatcatcctgctggccgagggtcggctggtcaacctg ggttgtgccatgggccaccccagcttcgtgatgagtaactccttcaccaaccaggtgatg gcgcagatcgagctgtggacccatccagacaagtaccccgttggggttcatttcctgccc aagaagctggatgaggcagtggctgaagcccacctgggcaagctgaatgtgaagttgacc aagctaactgagaagcaagcccagtacctgggcatgtcctgtgatggccccttcaagccg gatcactaccgctactga