GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:20:58 Sequence gi568815578f:34160375_34369164 : 208790 bp : 46.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8735 8758 24 0 0 82 100 21 0.249 0.70 1.02 Intr + 12060 12093 34 1 1 76 75 47 0.390 -0.52 1.03 Term + 13867 14005 139 1 1 100 44 89 0.745 3.14 1.04 PlyA + 14078 14083 6 1.05 2.08 PlyA - 15026 15021 6 1.05 2.07 Term - 15098 15042 57 2 0 104 40 36 0.019 -1.91 2.06 Intr - 31114 31042 73 0 1 62 99 70 0.655 4.91 2.05 Intr - 36404 36329 76 0 1 87 80 96 0.126 7.27 2.04 Intr - 41279 41210 70 2 1 66 63 41 0.007 -1.85 2.03 Intr - 53415 53229 187 2 1 59 82 82 0.050 4.39 2.02 Intr - 55172 54082 1091 2 2 87 64 382 0.042 24.54 2.01 Init - 55518 55411 108 0 0 83 89 53 0.961 5.12 2.00 Prom - 64985 64946 40 -3.66 3.00 Prom + 72111 72150 40 -5.86 3.01 Init + 73403 73508 106 1 1 92 47 61 0.064 2.78 3.02 Intr + 81033 81115 83 1 2 44 84 62 0.015 0.76 3.03 Intr + 91782 91885 104 2 2 102 10 66 0.210 -0.83 3.04 Intr + 99991 100160 170 1 2 130 106 119 0.824 17.49 3.05 Intr + 102407 102519 113 0 2 91 116 71 0.909 10.30 3.06 Term + 108617 108793 177 1 0 88 53 254 0.983 19.59 3.07 PlyA + 112354 112359 6 1.05 4.17 PlyA - 112538 112533 6 1.05 4.16 Term - 120791 120660 132 2 0 101 54 209 0.972 16.89 4.15 Intr - 125260 125066 195 1 0 110 98 300 0.997 32.81 4.14 Intr - 130075 129958 118 0 1 60 94 299 0.997 28.17 4.13 Intr - 130264 130177 88 2 1 61 94 113 0.995 8.23 4.12 Intr - 130564 130357 208 1 1 123 53 431 0.999 41.75 4.11 Intr - 131157 131045 113 1 2 83 105 247 0.999 26.00 4.10 Intr - 132133 131984 150 2 0 52 99 286 0.902 26.23 4.09 Intr - 133782 133707 76 0 1 96 114 76 0.979 10.09 4.08 Intr - 135211 135021 191 2 2 95 78 388 0.999 37.80 4.07 Intr - 142957 142869 89 0 2 16 109 79 0.570 2.41 4.06 Intr - 143282 143144 139 0 1 43 117 102 0.388 8.12 4.05 Intr - 151030 150907 124 1 1 49 36 87 0.287 -0.24 4.04 Intr - 155822 155712 111 2 0 74 59 83 0.561 4.58 4.03 Intr - 172390 172231 160 0 1 93 29 126 0.348 7.19 4.02 Intr - 173520 173490 31 1 1 106 77 30 0.759 0.89 4.01 Init - 175502 175430 73 2 1 71 109 18 0.709 2.08 4.00 Prom - 187566 187527 40 -2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:34160375_34369164|GENSCAN_predicted_peptide_1|65_aa XFFNKTGKDNFNVEATNIPSPPPEISADQNRSLKKLPEAGESSTRKEQKEQSSEYQPQQD KIHYV >gi568815578f:34160375_34369164|GENSCAN_predicted_CDS_1|198_bp ngtttcttcaacaaaactggaaaagataacttcaatgtagaagccaccaatatccccagt ccccctcctgaaattagtgctgatcagaacagaagtttgaagaaactacctgaggctgga gaaagtagcaccagaaaagagcagaaggaacaatcctcagagtaccagccccaacaagat aaaattcactatgtctga >gi568815578f:34160375_34369164|GENSCAN_predicted_peptide_2|553_aa MDKQALLGLNPNADSDFRLRALAYFEQLKISPDAWQNISLSGPSFFFDILSVVDLNPRGV DLYLRILMAINSELVDRDVVHTSEEAHRNTLIKDTMREQCIPNLVESWYQISQNYQYTNS EVMCQCLEVVGAYVSWIDLSLLANDRFINMLLGHMSIEVLREEACDCLFEVVNKGMDPVD KMKLVESLCQVLQSAGFFSIDQEEDVDFFSALTRKKMLARFSKPVNGMGQSWIVSWSKLT KNGNIKNAQEALQAIETKVALMLQLLIHEDDDISSNIGFCYDYLHILKQLTVLSDQQKAN VEAIMLAVMKKLTYDEEYNFENEGQDKAMFVEYRKQLKLVLDRLAQVSRELLLASVRRVF SSTLQNWQTTRFMEVEVAIRLLYMLAEALPVSISWCSLLSCPLQKGILRSGVCTFLHRMI ICLEEEVFPFIPSASEHILKDCEAKDLQEFIPLINQITAKFKVYGSGSTRTRIYQRQLET NSCLSNSCPVSKMQTSKGSREYFTPQNRLRRSCSRPPTDRWDQRAFNGDDSVLREPNTAV KKRQSVVFQELRD >gi568815578f:34160375_34369164|GENSCAN_predicted_CDS_2|1662_bp atggataaacaggctctattagggctaaatccaaatgctgattcagactttagactaagg gccctggcctattttgagcagttaaagatttccccagatgcctggcagaatatctcacta agtggcccaagttttttttttgacattctctcagtagtggacctaaatccaaggggagta gatctgtacctgcgaatcctcatggctatcaactcagagttggtggatcgtgatgtggtg catacatcagaggaggctcacaggaatactctcataaaagataccatgagggaacagtgc attccaaatctggtggaatcatggtaccaaatatcacaaaattatcagtatactaattct gaagtgatgtgtcagtgccttgaagtagttggggcttatgtctcttggattgacttatcc cttttagccaatgataggtttataaatatgctgctaggtcatatgtcaatagaagttcta cgggaagaagcatgtgactgtttatttgaagttgtaaataaaggaatggaccctgttgat aaaatgaaactagtagaatctttgtgtcaagtattacagtctgctggctttttcagcatt gaccaggaagaagatgttgacttcttttcagcattgaccaggaagaagatgttggccaga ttttctaagccggtaaacggaatgggacagtcatggatagttagttggagtaaattaact aagaatgggaatattaagaatgctcaagaggcactacaagctattgaaacaaaagtggca ctgatgttgcagctactaattcatgaggatgatgatatttcttctaatattggattttgt tatgattatcttcatattttgaaacagcttacagtgctctcggatcagcaaaaagctaat gtggaggcaatcatgttggccgttatgaaaaaattgacttatgatgaagaatataacttt gaaaatgagggtcaagataaagccatgtttgtagaatatagaaaacaactgaagttagtg ttggacaggcttgctcaagtttcacgagagttactactggcctctgttcgcagagttttt agttctacactgcagaattggcagactacacggttcatggaagttgaagtagcaataaga ttgctgtatatgttggcagaagctcttccagtatctatctcatggtgctcacttctcagt tgtcccttacaaaagggtattctcagaagtggagtctgtactttccttcatcgaatgatt atttgcctggaggaagaagtttttccgttcatcccatctgcttcagaacatatactcaaa gattgtgaagcaaaagacctccaggagttcattcctcttatcaaccagattacggccaaa ttcaaggtatatggctccggctccactcgcacccgaatctatcaaaggcagctggagacc aattcttgtctctccaacagctgccctgtgagcaaaatgcagacctctaagggctctagg gagtacttcacccctcagaaccggctacgcaggtcctgctccaggcctcccacagaccgc tgggaccagagggccttcaatggggatgacagtgtgctcagggagcccaacactgcagtg aaaaagagacagtctgtcgtcttccaggagcttagagattag >gi568815578f:34160375_34369164|GENSCAN_predicted_peptide_3|250_aa MMWPQLADQTRTAKPLPEKKSTGMRYQWNSLEEVKVIDEQAANLYSCKVKKEVPWPGLFA GKHVGRETEKRNKTQRQSIEKEQWAQETGAQYTEDPHWPPGMDVTRLLLATLLVFLCFFT ANSHLPPEEKLRDDRSLRSNSSVNLLDVPSVSIVAQRPQKHLTLLLLSLFEALNKKSKQI GRKAAEKKRSSKKEASMKKVVRPRTPLSAPCVATRNSCKPPAPACCDPCASCQCRFFRSA CSCRVLSLNC >gi568815578f:34160375_34369164|GENSCAN_predicted_CDS_3|753_bp atgatgtggccacaattggctgatcagactagaacagcaaaaccattacctgagaaaaag tcaacaggaatgagataccagtggaatagcttggaagaggtcaaagttatagatgagcaa gcagcaaatctctacagctgcaaggtgaaaaaggaagttccttggcctgggctctttgcg ggaaagcatgtgggacgagagactgagaaaagaaataagacacagagacaaagtatagag aaagaacagtgggcccaggagaccggtgctcagtatacagaggacccgcactggcctcct gggatggatgtcacccgcttactcctggccaccctgctggtcttcctctgcttcttcact gccaacagccacctgccacctgaggagaagctccgagatgacaggagcctgagaagcaac tcctctgtgaacctactggatgtcccttctgtctctattgtggctcaacgtccccagaaa catctgactttgctccttttgtctctctttgaagcgctgaacaagaaatccaaacagatc ggcagaaaagcagcagaaaagaaaagatcttctaagaaggaggcttcgatgaagaaagtg gtgcggccccggacccccctatctgcgccctgcgtggccacccgcaacagctgcaagccg ccggcacccgcctgctgcgacccgtgcgcctcctgccagtgccgcttcttccgcagcgcc tgctcctgccgcgtgctcagcctcaactgctga >gi568815578f:34160375_34369164|GENSCAN_predicted_peptide_4|665_aa MYPQVSSILEAFMLEILSFSFQWAAFPPHKTLKEWREHVSECGIWPATPSTDTGASSMRA RGHTRCIASRGTWRHPGKDACDPEAPERTPRAAIMKYHAFGGLQQQKFILSQFWKPEVQN QGIGRKLHWGLDLEKGHQQLTSRAAFSGVGGDPQGSTVSSFLYIQPDEEIEAQGSELAEF QFRWVLTLNQLAEPARGRPGWLSGLSRRSLPDRSCSQTEAQPPSPVSITSAASMSDKLPY KVADIGLAAWGRKALDIAENEMPGLMRMRERYSASKPLKGARIAGCLHMTVETAVLIETL VTLGAEVQWSSCNIFSTQDHAAAAIAKAGIPVYAWKGETDEEYLWCIEQTLYFKDGPLNM ILDDGGDLTNLIHTKYPQLLPGIRGISEETTTGVHNLYKMMANGILKVPAINVNDSVTKS KFDNLYGCRESLIDGIKRATDVMIAGKVAVVAGYGDVGKGCAQALRGFGARVIITEIDPI NALQAAMEGYEVTTMDEACQEGNIFVTTTGCIDIILGRHFEQMKDDAIVCNIGHFDVEID VKWLNENAVEKVNIKPQVDRYRLKNGRRIILLAEGRLVNLGCAMGHPSFVMSNSFTNQVM AQIELWTHPDKYPVGVHFLPKKLDEAVAEAHLGKLNVKLTKLTEKQAQYLGMSCDGPFKP DHYRY >gi568815578f:34160375_34369164|GENSCAN_predicted_CDS_4|1998_bp atgtacccacaggtctcctctatcctggaggcattcatgcttgagatcctgagcttctca ttccaatgggcagcttttcctccgcataagaccctgaaggaatggagggagcacgtgagt gagtgcgggatctggccagccactccaagcactgacacaggagcaagctccatgcgggcc cgtggacacaccaggtgtattgcctcaagggggacatggcggcatccaggcaaggatgcc tgcgaccctgaagccccagagaggactcccagagctgccattatgaagtaccacgcattt ggtggcttacaacaacagaaatttattctctcacagttctggaagccagaagtccaaaat caaggcattggcaggaagctccactggggcctggacctggagaagggacatcaacagtta acgtccagggcagctttcagcggcgtgggcggggatccccagggaagcactgttagcagt ttcttgtacatccagcctgatgaggaaattgaggcacagggaagtgaactggcggaattt cagttccgctgggttttgacactgaatcaactggctgaaccagcgagaggacggcccggc tggctctcaggcctctcccggcgctcccttccggaccgttcctgttcccagactgaggcc cagcccccttcgcccgtttccatcacgagtgccgccagcatgtctgacaaactgccctac aaagtcgccgacatcggcctggctgcctggggacgcaaggccctggacattgctgagaac gagatgccgggcctgatgcgtatgcgggagcggtactcggcctccaagccactgaagggc gcccgcatcgctggctgcctgcacatgaccgtggagacggccgtcctcattgagaccctc gtcaccctgggtgctgaggtgcagtggtccagctgcaacatcttctccacccaggaccat gcggcggctgccattgccaaggctggcattccggtgtatgcctggaagggcgaaacggac gaggagtacctgtggtgcattgagcagaccctgtacttcaaggacgggcccctcaacatg attctggacgacgggggcgacctcaccaacctcatccacaccaagtacccgcagcttctg ccaggcatccgaggcatctctgaggagaccacgactggggtccacaacctctacaagatg atggccaatgggatcctcaaggtgcctgccatcaatgtcaatgactccgtcaccaagagc aagtttgacaacctctatggctgccgggagtccctcatagatggcatcaagcgggccaca gatgtgatgattgccggcaaggtagcggtggtagcaggctatggtgatgtgggcaagggc tgtgcccaggccctgcggggtttcggagcccgcgtcatcatcaccgagattgaccccatc aacgcactgcaggctgccatggagggctatgaggtgaccaccatggatgaggcctgtcag gagggcaacatctttgtcaccaccacaggctgtattgacatcatccttggccggcacttt gagcagatgaaggatgatgccattgtgtgtaacattggacactttgacgtggagatcgat gtcaagtggctcaacgagaacgccgtggagaaggtgaacatcaagccgcaggtggaccgg tatcggttgaagaatgggcgccgcatcatcctgctggccgagggtcggctggtcaacctg ggttgtgccatgggccaccccagcttcgtgatgagtaactccttcaccaaccaggtgatg gcgcagatcgagctgtggacccatccagacaagtaccccgttggggttcatttcctgccc aagaagctggatgaggcagtggctgaagcccacctgggcaagctgaatgtgaagttgacc aagctaactgagaagcaagcccagtacctgggcatgtcctgtgatggccccttcaagccg gatcactaccgctactga