GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:59:46 Sequence gi568815578f:2002873_2217375 : 214503 bp : 44.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12972 13124 153 2 0 39 41 175 0.610 5.82 1.02 PlyA + 13178 13183 6 1.05 2.05 PlyA - 17032 17027 6 1.05 2.04 Term - 17988 17923 66 0 0 98 42 60 0.325 0.34 2.03 Intr - 19423 19327 97 0 1 90 110 5 0.189 2.91 2.02 Intr - 24408 24338 71 0 2 104 70 55 0.012 3.28 2.01 Init - 34811 34665 147 2 0 62 87 102 0.866 7.59 2.00 Prom - 42632 42593 40 -5.66 3.00 Prom + 43492 43531 40 -7.46 3.01 Init + 43878 43929 52 2 1 82 74 38 0.735 3.12 3.02 Intr + 49472 49697 226 0 1 80 15 96 0.005 -1.46 3.03 Intr + 56368 56437 70 1 1 81 115 27 0.399 3.88 3.04 Intr + 65110 65130 21 0 0 95 113 11 0.541 2.04 3.05 Term + 71166 71453 288 2 0 -13 35 471 0.975 27.08 3.06 PlyA + 71672 71677 6 1.05 4.00 Prom + 80318 80357 40 -3.36 4.01 Init + 82928 83051 124 1 1 50 89 53 0.400 1.93 4.02 Intr + 92273 92473 201 0 0 116 103 -9 0.109 2.66 4.03 Intr + 99999 100493 495 1 0 87 97 717 0.245 65.47 4.04 Term + 113794 114506 713 2 2 77 38 629 0.954 50.46 4.05 PlyA + 115178 115183 6 1.05 5.19 PlyA - 115472 115467 6 1.05 5.18 Term - 116215 116103 113 2 2 25 54 58 0.011 -5.18 5.17 Intr - 123145 123006 140 0 2 53 80 74 0.447 3.21 5.16 Intr - 124878 124792 87 2 0 108 71 57 0.660 5.09 5.15 Intr - 125717 125558 160 0 1 98 82 12 0.501 0.65 5.14 Intr - 133352 133216 137 1 2 80 52 103 0.050 6.11 5.13 Intr - 134763 134600 164 0 2 79 36 74 0.395 0.07 5.12 Intr - 138715 138578 138 1 0 63 50 91 0.034 3.46 5.11 Intr - 142105 142055 51 1 0 92 98 -1 0.011 0.40 5.10 Intr - 149753 149735 19 1 1 107 64 24 0.022 -1.49 5.09 Intr - 152500 152369 132 0 0 66 95 33 0.031 1.56 5.08 Intr - 159070 158909 162 0 0 89 80 26 0.026 0.99 5.07 Intr - 159579 159492 88 1 1 52 75 40 0.005 -1.87 5.06 Intr - 166777 166658 120 2 0 59 21 103 0.012 1.17 5.05 Intr - 171295 171270 26 0 2 112 98 9 0.631 1.97 5.04 Intr - 171544 171477 68 1 2 78 95 66 0.491 4.00 5.03 Intr - 176000 175827 174 2 0 108 109 25 0.532 6.74 5.02 Intr - 186669 186644 26 1 2 81 94 26 0.056 0.24 5.01 Init - 198128 198026 103 2 1 51 64 117 0.258 6.00 5.00 Prom - 199081 199042 40 -5.86 6.00 Prom + 201661 201700 40 -7.86 6.01 Sngl + 204273 204833 561 2 0 104 44 295 0.964 22.84 6.02 PlyA + 205444 205449 6 1.05 7.03 PlyA - 206000 205995 6 1.05 7.02 Term - 208642 208582 61 0 1 97 44 66 0.579 0.38 7.01 Intr - 209984 209651 334 0 1 70 32 170 0.453 4.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 49472 49734 263 0 2 80 41 156 0.880 5.79 S.002 Init - 135872 135828 45 2 0 89 106 18 0.832 4.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_1|50_aa PQANLSGSIIEEGTVIVGDDSSTRVIAPEHLPLQQDAEVEDSDTDDRDHV >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_1|153_bp cctcaggcaaatctttcaggaagtattatagaagaaggcactgttatcgtaggagatgac agctccacacgtgttattgcccctgaacacctcccgctgcaacaagatgcggaagtggaa gacagtgacactgatgatcgtgaccatgtgtag >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_2|126_aa MNPNPCGMDKVQADIGQYSLVVKSMEYGVRETWAHVVALPLTSCVTFSKAQKCLLPLPGL FLLLVPTPILEQRRGYFHISPQSPAAAYPSQIWSLLATFPAVWAQSTYYMLGAVLGAGEI PVTEID >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_2|381_bp atgaatccaaacccttgtggaatggataaagttcaagctgacatagggcagtacagcttg gtggttaagagcatggagtatggagtcagagagacctgggctcatgtcgtggctttgcca cttactagctgtgtgacattcagcaaggctcagaagtgcctgctcccactgcctggcctc ttcctgctcctggtgcccactccaattttggagcaaaggagaggatatttccacatcagc ccacaatctcctgctgctgcatacccaagtcagatctggagtctcttggccactttccca gcagtgtgggcccagagcacctactatatgttgggtgctgttctgggcgctggtgaaatt ccagtgaccgaaatagattaa >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_3|218_aa MSWAEGYEKTCDRATWQAETEFNVQDTYLKRPLDQHLMAESDQNRIGQKKKSSYNAGLIT ALAKPMDSSEARMGLQSCPKLGHDDQALMLLYQCNKHCGNITFQRLDVYSQRFTLKELYG VLTKNKKEKERKRKKKKEKEEEEQEEEEEKRRGRKKKKKKKKKRKKKKKRRREEKRKKRK KKKKKKGEEEEEYEEEREEKIIAILNNLSENRSRETIS >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_3|657_bp atgtcgtgggcggagggctatgagaagacctgtgacagggccacgtggcaagctgagaca gagtttaatgtgcaggacacttacttaaaaaggcccttggatcaacacttgatggcagag agtgaccaaaatagaattgggcagaagaagaagtcaagctacaatgcaggtctaataaca gcactggccaaacccatggacagctctgaagctagaatgggccttcaatcttgtcccaaa ttgggccatgatgaccaggcccttatgctcctatatcagtgtaacaagcactgtggcaac attacttttcaaagacttgatgtctattctcaaaggttcacactgaaggagctctatggg gttctcacgaagaataagaaggagaaggagaggaagaggaagaagaagaaggagaaggag gaggaggagcaggaggaggaggaggagaaaagaaggggaaggaagaagaagaaaaagaag aagaagaagaggaagaagaagaagaagagaagaagagaagaaaagaggaaaaagaggaaa aagaagaagaagaagaaaggagaggaggaggaggaatacgaagaagaaagagaagaaaaa attatagcaattctcaacaatctctcagaaaatagaagcagagagactatttcctaa >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_4|510_aa MPRCYYENSFNLMDALKGLRVPQVTPIAFGDDLSMRALALTRRSSLLSHGTTPRAGCPSG VLSPLKYRDGSPASALLFCLLTWDGSYLSELQHSRVKGGPMGSSHITDPMETGKDGARRG TQSPERKRRSPVPRAPSTKLRPAAAARAMDPVAAEAPGEAFLARRRPEGGGGSARPRYSL LAEIGRGSYGVVYEAVAGRSGARVAVKKIRCDAPENVELALAEFWALTSLKRRHQNVVQF EECVLQRNGLAQRMSHGNKSSQLYLRLVETSLKGERILGYAEEPCYLWFVMEFCEGGDLN QYVLSRRPDPATNKSFMLQLTSAIAFLHKNHIVHRDLKPDNILITERSGTPILKVADFGL SKVCAGLAPRGKEGNQDNKNVNVNKYWLSSACGSDFYMAPEVWEGHYTAKADIFALGIII WAMIERITFIDSETKKELLGTYIKQGTEIVPVGEALLENPKMELHIPQKRRTSMSEGIKQ LLKDMLAANPQDRPDAFELETRMDQVTCAA >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_4|1533_bp atgcctcggtgttattatgagaatagttttaacctcatggatgctttaaagggtctcagg gtcccccaggtaaccccgatcgcatttggtgatgatttgtccatgcgtgctctggccctt acaagacgaagctccctgctttcccatggcaccacccccagggctggctgcccctcgggt gtcctcagccccctgaagtacagagatggatcaccagcctcagccttactattctgcctt ctcacttgggacggttcttacctatccgagcttcagcactctcgtgtgaaaggagggcca atggggtccagccacatcaccgaccccatggaaacggggaaggacggcgcccgcagaggt acacaaagcccggagcggaaaaggcgaagcccagtgccgcgggcgcccagcacgaagctg aggccggcggcggcggcccgggccatggatccggtggcggccgaggccccgggcgaggcc ttcctggcgcggcgacggcctgagggcggtggcgggtccgcgcggccgcgttacagcctg ttggcggagatcgggcgcggcagctacggcgtggtttatgaggcagtggccgggcgcagc ggggcccgggtggcggtcaagaagatccgctgcgacgcccccgagaacgtggagctggcg ctggctgaattctgggccctcaccagcctcaagcggcgccaccagaacgtcgtgcagttt gaggagtgcgtcctgcagcgcaatgggttagcccagcgcatgagtcacggcaacaagagc tcgcagctttacctgcgcctggtggagacctcgctgaaaggagaaaggatcctgggttat gctgaggagccctgctatctctggtttgtcatggagttctgtgaaggtggagacctgaat cagtatgtcctgtcccggaggccagacccagccaccaacaaaagtttcatgctacagctg acgagcgccattgccttcctgcacaaaaaccatattgtgcacagggacctgaagccagac aacatcctcatcacagagcggtctggcacccccatcctcaaagtggccgactttggacta agcaaggtctgtgctgggctggcaccccgaggcaaagagggcaatcaagacaacaaaaat gtgaatgtgaataagtactggctgtcctcagcctgcggttcggacttctacatggctcct gaagtctgggagggacactacacagccaaggcggacatctttgccctgggcattatcatc tgggcaatgatagaaagaatcacttttattgactctgagaccaagaaggagctcctgggg acctacattaaacaggggactgagatcgtccctgttggtgaggcgctgctagaaaaccca aagatggagttgcacatcccccaaaaacgcaggacttccatgtctgaggggatcaagcag ctcttgaaagatatgttagctgctaacccacaggaccggcctgatgcctttgaacttgaa accagaatggaccaggtcacatgtgctgcttaa >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_5|635_aa MRSELSSWDYRKVEEERACGDIRHKDSGADNPIPAQHDEALPQLLTAAPLTLPEDFLQLQ ELLSLHGPAQKCPGAEGPGSSPWPLKNKRCWIDTTIPLAPQRTVRSLLDPALPVRMESWK CLESCSWKYFTHSSSHCKELGALKTSPLGPPPPLRSHPGPLDLQSSYEEDSGDAGDMAAN TVDKNDAPGMWGERQTRSMTRSLPIYYFSIFAPSSATHLCFHSELWAGGGDLLLPVVQVR QDGQHHGCWEKEDAARSGFAACPELCNTDVHTLPLTNTAGTRPTSLSPLDPSCLEQKFSR SRMCQMVRCFPIGYVYTEGRHLTFLIGKIGYFNEQEKLAAPIERSRCGVLSGLFHAQWTK ESFHTAKREGRRNGSTDKWDNTVFSLRVYRTGKMLGKRGARGQRLYNIYNRSFAVLHDNP EDHRRPSFKGKKTANFGKYICFTWRGSLKSKCGYLEQARPQNVSPESKLGVRGHRASPYP EDKCTESLSGSQLSQGPRTTSTMELGTRQDRRNGGKGKQWSFRTPEQQEGAMCSLKAFPA IPVGTLAPKMLRMGFCQGTRTTDTYDAFGSYWSPFLAAGARRWPNDSIPSVLGAFPAADA EFMKMEKVSDLAYHVTPEPSTYSYGSLNNGLLNDF >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_5|1908_bp atgaggtcagaactttcaagctgggactaccggaaggtggaggaggagcgagcgtgtggg gacatcaggcacaaggacagcggtgcagacaaccccatcccagctcagcacgatgaagct ctgccacagctgctcacggcagcaccactgactctgcctgaggacttcctccagctgcag gagctgctcagcctgcacgggccagcccagaagtgcccaggagctgagggccctgggagc agcccttggccactaaagaacaaaaggtgctggatagataccaccattcccctcgctccg cagaggacagttcgtagcctgctcgacccagctttgcctgtacgtatggagagctggaag tgcctggaaagctgttcctggaaatacttcacacacagctcatcccactgcaaggagctc ggggccttgaaaacgtcacccctgggaccaccaccaccattgcggtcacaccctggaccc ctggacttgcagtcctcatatgaggaggactcaggggatgctggggatatggcagcgaac acagtggataaaaatgatgcccctggcatgtggggggagagacagacaagaagcatgaca cgcagcctgcccatctattacttctccatatttgctccctcctctgcaactcacctatgc tttcattctgagctatgggcaggaggaggtgacctcctgctcccagtggtccaggtccgg caggatggacagcaccatggctgctgggagaaggaagatgctgccaggtcaggctttgct gcctgtccagaattatgcaacacagatgtccacacgctacctctcaccaacacagctggg accagacccacctctcttagccccttagatccctcttgcctggaacaaaagttctcaagg tcacggatgtgccagatggtccgctgttttcccattggatacgtctatactgagggcagg cacctaacatttttaataggaaaaatagggtactttaatgaacaagaaaagctggcagct ccaatagaaaggtccagatgtggggtcttgtctggactcttccatgcccagtggacaaag gagagcttccacactgccaaacgggagggtcgaaggaatgggagtactgacaaatgggat aacactgttttcagtcttcgggtatacagaacgggcaaaatgttaggaaaaagaggagca agagggcagagattatacaacatatacaacaggagcttcgcagtgctccatgacaaccca gaagaccacaggaggccttctttcaaggggaagaagacggccaactttggcaaatacatc tgtttcacgtggaggggttccttgaaatccaagtgtggctatttggagcaggcccggcct cagaatgtgtcccctgagtctaagttaggagtaagaggccacagggcttcaccctatcct gaagacaaatgcactgagagcctgagcgggagtcagttgtcccagggtcccagaaccaca tccaccatggaactaggaacacgccaggacagaaggaacgggggaaagggaaaacagtgg tccttcaggactccagagcagcaggaaggagctatgtgctctctgaaggcttttcctgct atacctgttggtaccctggcccctaagatgctcagaatgggcttctgccagggcaccagg accacagacacgtatgatgcctttggctcttattggagccctttccttgcagctggtgcc agacgctggcctaatgacagcattccctcagtgctgggtgctttcccagctgccgatgca gaattcatgaagatggagaaagtctctgatttggcttatcatgtcaccccagagcctagc acatacagttatgggtcgcttaacaatgggctgcttaacgatttctga >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_6|186_aa MATAGSQHRRRENEDRSRVTALAFEQPRESLCVLQPPLPFMSEVLSPQRSPQRAFPDAYE TGSAWAQVEEGPAWPPPRSPWGPCPFRRFSRVLHHGNSGVGSVGCEPSGWGSRRALSASP REVTQVTGGRRDLRAERGIQKVPGGGFRFLCGVSGWSWLGAAGARWPCASAPGKLSLQGP GGGRGQ >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_6|561_bp atggcaactgcaggcagccagcaccgacggcgggaaaacgaggaccggagccgcgttaca gccttggccttcgagcagcccagggagagtctctgcgtcctccagcccccgcttcccttc atgtcggaggtcctaagcccgcagcgaagtccgcaaagagctttccccgacgcctacgag accggaagtgcctgggcccaggtggaggaaggacccgcgtggccgcctccgcgctctccc tggggtccgtgccccttccggcgtttctcccgcgtccttcaccatggcaactcgggcgtc gggagcgtgggctgcgagccgagtgggtgggggagccgccgggctctatccgccagccca cgagaagtgacgcaggtgacgggaggccgcagggacctgagggcggagaggggaattcag aaagtgccgggcggcggcttccgcttcctctgcggggtctctgggtggtcgtggctggga gctgccggggcccggtggccgtgcgcctctgctccgggcaagctctccctgcaaggcccg ggcggcggcagggggcagtag >gi568815578f:2002873_2217375|GENSCAN_predicted_peptide_7|131_aa XWNHQDIKIHQLPESEAQPNRNEIFVDANRHYAALAPGIICSSSRSNLCDCGQVTVMRKP QLKHPPGLRASDITLEQFLLHREKQLNWQTVTVTNVSVNSALHPGAQAQALSCYTSLEVA TEGQKKESDAL >gi568815578f:2002873_2217375|GENSCAN_predicted_CDS_7|396_bp ncatggaaccaccaagacatcaaaatacaccagctgcctgaatctgaagcgcagccaaac agaaatgagatttttgtggatgctaataggcactatgcagccttagcaccagggatcatt tgctctagttctagatccaacttgtgtgactgtgggcaagtgactgttatgagaaagcct cagctaaaacaccctcctggtctcagagcttctgacatcaccctagagcagttcctcctc catcgtgaaaagcagttgaattggcaaactgtgactgtgacaaatgtctcagtaaacagc gcccttcatccaggtgcacaggcccaagccttgagctgctacacttctctggaagtggcc acagagggacaaaagaaggaatctgatgccctgtga