GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:41:19 Sequence gi568815587f:124963368_125186018 : 222651 bp : 43.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4992 5097 106 2 1 72 80 87 0.141 4.70 1.02 Intr + 7854 8094 241 1 1 -26 71 179 0.045 1.41 1.03 Intr + 20497 20617 121 1 1 35 96 -9 0.024 -4.90 1.04 Intr + 23375 23521 147 1 0 71 103 143 0.999 14.53 1.05 Intr + 23760 23963 204 2 0 68 32 93 0.507 1.10 1.06 Intr + 28094 28216 123 1 0 65 80 57 0.883 3.38 1.07 Intr + 29802 29876 75 2 0 91 76 27 0.731 1.51 1.08 Intr + 40500 40592 93 2 0 73 87 81 0.917 6.66 1.09 Intr + 41742 41845 104 2 2 49 106 106 0.813 7.47 1.10 Intr + 52749 52833 85 0 1 48 82 83 0.243 3.52 1.11 Intr + 57569 57611 43 1 1 100 76 36 0.116 1.41 1.12 Term + 63393 63502 110 1 2 53 34 81 0.032 -2.13 1.13 PlyA + 65987 65992 6 1.05 2.10 PlyA - 66671 66666 6 1.05 2.09 Term - 67239 66972 268 2 1 55 44 151 0.344 2.37 2.08 Intr - 67967 67672 296 2 2 114 37 174 0.681 10.71 2.07 Intr - 68883 68728 156 0 0 75 77 97 0.903 7.51 2.06 Intr - 69634 69481 154 0 1 67 100 150 0.941 14.17 2.05 Intr - 91932 91911 22 1 1 66 115 52 0.069 2.30 2.04 Intr - 95298 95124 175 0 1 -1 105 32 0.044 -4.49 2.03 Intr - 98078 97977 102 2 0 65 85 38 0.731 1.57 2.02 Intr - 99065 98950 116 0 2 103 98 65 0.720 9.17 2.01 Init - 99341 99188 154 2 1 92 29 104 0.603 3.19 2.00 Prom - 102508 102469 40 -5.56 3.00 Prom + 105661 105700 40 -5.36 3.01 Init + 106378 106389 12 0 0 70 89 20 0.396 -0.40 3.02 Intr + 109664 109788 125 1 2 76 71 74 0.655 3.88 3.03 Intr + 113390 113471 82 2 1 122 100 73 0.751 11.54 3.04 Intr + 113863 113956 94 0 1 93 91 62 0.993 6.54 3.05 Intr + 114083 114161 79 0 1 93 110 218 0.987 23.11 3.06 Intr + 115745 115880 136 2 1 76 80 165 0.983 15.07 3.07 Intr + 116317 116393 77 0 2 162 81 30 0.999 8.01 3.08 Intr + 117247 117413 167 1 2 131 92 359 0.992 40.20 3.09 Intr + 118331 118539 209 0 2 51 63 132 0.472 5.80 3.10 Intr + 118877 118967 91 1 1 95 106 172 0.999 19.27 3.11 Intr + 120448 120510 63 2 0 95 66 99 0.991 7.09 3.12 Intr + 120867 120952 86 1 2 56 74 91 0.913 4.04 3.13 Intr + 121458 121506 49 2 1 129 66 82 0.996 8.35 3.14 Intr + 121699 121772 74 2 2 80 105 124 0.999 12.43 3.15 Intr + 122028 122106 79 2 1 112 48 228 0.999 20.32 3.16 Intr + 122210 122307 98 0 2 80 91 102 0.987 9.43 3.17 Intr + 122587 122651 65 0 2 114 100 -5 0.365 0.82 3.18 Term + 126374 126803 430 2 1 57 41 243 0.221 11.27 3.19 PlyA + 127051 127056 6 1.05 4.04 PlyA - 133681 133676 6 1.05 4.03 Term - 134373 134239 135 0 0 104 47 99 0.982 5.42 4.02 Intr - 137936 137834 103 1 1 95 98 -13 0.982 0.58 4.01 Init - 138874 138765 110 1 2 79 94 151 0.994 12.49 4.00 Prom - 143951 143912 40 -4.36 5.00 Prom + 144860 144899 40 -3.56 5.01 Sngl + 150593 150994 402 1 0 39 54 643 0.646 52.27 5.02 PlyA + 151064 151069 6 -5.51 6.00 Prom + 151074 151113 40 -16.42 6.01 Sngl + 151139 151498 360 1 0 32 52 377 0.886 24.57 6.02 PlyA + 151538 151543 6 1.05 7.00 Prom + 171331 171370 40 -4.16 7.01 Init + 177135 177289 155 2 2 69 66 187 0.647 13.11 7.02 Term + 181616 181628 13 2 1 112 55 13 0.277 -1.93 7.03 PlyA + 182256 182261 6 1.05 8.00 Prom + 182491 182530 40 -1.46 8.01 Init + 188820 188834 15 2 0 102 81 26 0.004 3.67 8.02 Intr + 197215 197336 122 0 2 53 97 91 0.003 5.79 8.03 Intr + 203929 204058 130 1 1 54 32 106 0.093 2.30 8.04 Term + 208328 208447 120 1 0 106 43 74 0.500 3.17 8.05 PlyA + 209847 209852 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 189228 189152 77 0 2 74 75 67 0.865 2.31 S.002 Init - 189361 189356 6 1 0 47 116 10 0.814 0.01 S.003 Term - 195307 195124 184 0 1 100 44 141 0.805 7.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_1|483_aa MADAPPPARLLPRSLISDCCASSEQGSVGVGPAKPERNSIDINKKDIHTKTPSVGHHHQR PKVDKTTKMWRNQSRKAENSKNQGISSPKGHSSSPATEQSWMENDFDELTEVGFRSAAPE DGAAGVGPTGISVHVHSGGSVSMSWGTHRHGAGSLQELDYEEPDYEESSSLVTDEKGKED LFGRGQQDQQAIHSEDKNKPFSRVQKVKFKNPLFVLMEEEEQKQLHFEGLQDILPEAQDY FLEAQGDLLETQGDLTGIQSVKPDTQAVEMKVQKVHFKEPYSDMTDEKGREDFSLADYQC LPPKSQDQDDIKNQNKHIKLPSSFEKWEIARGNTPGVPLAYDRYQSGLSTEFQAPLAFQS DVDKEEDKKERQKQYLRHRRLFMDIEREQVKEQQRQKEQKKKIENLFDIVRLSSICMVEG PLLLNILFIVRKKENTCETSSGDVKLAAVGEQMVFGYMDKFFSDDFRDFGAPVTQAVYTV PNR >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_1|1452_bp atggcggacgcccctcctccagccaggctgctgcctcgcagtttaatctcggactgctgc gctagcagtgagcaaggctctgtgggtgtgggacctgccaagccagaaaggaatagcatc gacatcaacaaaaaggacatccacaccaaaaccccatctgtaggtcaccatcatcaaaga ccaaaggtagataaaaccacaaagatgtggagaaaccagagcagaaaagctgaaaattct aaaaaccagggcatctcttctccaaagggtcacagctcctcgccagcaacagaacaaagc tggatggagaatgactttgatgagttgacagaagtaggcttcagaagtgctgcaccagag gatggagcggcgggagtggggcctactggcatcagtgtgcatgttcactctggtggcagt gtcagcatgtcatggggcactcacaggcatggggctggttccctccaggaacttgactat gaggaacctgactatgaggaatcttcatctcttgtaactgatgagaaagggaaagaagat ttgtttgggagaggccagcaggaccagcaggctatccattctgaagataagaacaaacct ttcagcagagttcagaaagtaaaattcaaaaatccattatttgttctgatggaagaggaa gaacaaaagcagttacattttgagggccttcaggatattctgccagaagcccaggattat tttctagaagcccaaggtgatttgctggaaacccagggtgatttgacaggaatccagagt gttaagccagatacccaggctgttgaaatgaaggttcagaaagtacactttaaggagcca tactctgatatgacagatgagaaagggagagaagacttttctctggcagactatcagtgt ttgcctcccaaatcccaggaccaggatgacatcaaaaatcagaacaagcatatcaaacta ccctcatcttttgagaaatgggagattgcaagaggaaatactcctggagtgcccttggct tatgataggtatcaatcaggattgagcactgaattccaagctccactggcatttcagtct gacgtggataaagaagaagataagaaagagcgtcaaaagcagtacctgagacatagacga cttttcatggatattgagagagaacaagttaaagaacaacaaaggcaaaaagaacaaaag aagaaaattgaaaatttgtttgacattgtaagactaagcagtatctgcatggttgagggc cccttgctgctgaatatcctgtttattgtgcgtaagaaggagaatacctgtgagacatca agtggagatgtcaaattggcagctgttggggaacagatggtttttggttacatggataaa ttctttagtgatgatttccgagattttggtgcacccgtcacccaagcagtgtacactgta cccaataggtag >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_2|480_aa MTMWQSLLLSFLEASSHQAGTSVVLKVTASSLKISGYVSTDNCEILVHEPKGVPRPYAQG QRSGVTTPPNRLGPRIMGLARTSPRIKKKRLLPVNSSHVSLFRGQGGSGNTPTNVGTLSC SLLLESLRRLTVDNIRTRFEEQLSAYQVKRRKVEPENCMCKIRNHEKAWYIEEMVHKTFF VADVYSIPNRRPPDFTRVTVHWEKGNNQTFRGLLDTDSELTLNPGAQNIIVVLQLKKGLM KGYINSPALCHNLIQRDLDHFPLPQDITLVHYIDDIMLIRSSEREVANTLNLLPAPMASW GVPCDQLAEEEKTTAWFTDGSARYAGITRKWTAAAFQSLSRTSLMDRPEGKSSQWAELRA MHLVVHFAWKEKWPDERLYTDLWAVANGLTGWLIILDFFHHVKGRGLSSLEYTLTLDMGL PILHTMLLPMLLYSQNALSTVMVFYTALPLTLQLKKCSSGLLLMELTGFISSPIILKQLD >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_2|1443_bp atgacaatgtggcagtcactcctgctctcctttctggaggcaagcagccaccaggcaggg acttctgtggtcctgaaagtgactgcctctagcctgaagatttcagggtatgtttccact gataattgtgagatccttgtacatgaacccaagggagtgcccaggccctacgcgcaaggt cagagaagtggagtaacaactccaccaaacagattaggacctagaataatgggcttggct cgtaccagccctagaatcaagaaaaagaggttacttcctgtgaattccagccacgtatcc ctgttcaggggccagggtggcagtgggaacactcccaccaatgttggaactctttcttgc tcgctcctgttggagagtctcagaaggcttactgtagacaacataagaactagatttgaa gaacagcttagtgcataccaagtgaagaggagaaaggtggaaccggaaaattgcatgtgc aaaataaggaaccatgaaaaagcatggtacattgaagaaatggtgcataagacattcttt gtagctgatgtctacagcatccccaacaggagacctccggactttaccagggtaactgtg cactgggaaaagggaaacaatcagacgtttcggggactactggacactgactctgagctg acactgaatccaggggcccaaaacatcattgtggtcctccagttaaagaaggggcttatg aaggggtatatcaactctccggctttgtgtcataatcttattcagagagaccttgatcac tttccacttccgcaagatatcacactggtccactacattgatgacattatgctgattaga tccagtgagcgagaagtagcaaacacactgaatttattgcctgcaccaatggcctcatgg ggagttccctgtgatcaactggcagaggaagagaagactacggcctggttcacagatggt tctgcacgatatgcaggcatcacccgaaagtggacagctgcagcatttcagtccctttcc aggacatccctgatggacagacctgaagggaaatcttcccagtgggcagaacttcgagca atgcacctggttgtgcactttgcatggaaggagaaatggccagatgagcgattatatact gacttgtgggctgtagccaatggtttgactggatggctgattatactggacttcttccat catgtaaagggcagaggtttgtcctcactggaatacacacttactctggatatgggtttg cctatcctgcacacaatgcttctgccaatgcttctgtactcacagaatgccttatccacc gtcatggtattctacacagcattacctctgactttacagctgaagaagtgcagcagtggg ctcctgctcatggaactcactggttttatcagttcccccatcatcctgaagcagctggat tga >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_3|671_aa MWRQGFYWQHHGGLQVVDFRELGSELAGTSGYHLEEGRAGILEPTRFRGLILLLTFLIYA CYHMSRKPISIVKSRLHQNCSEQIKPINDTHSLNDTMWCSWAPFDKDNYKELLGGVDNAF LIAYAIGMFISGVFGERLPLRYYLSAGMLLSGLFTSLFGLGYFWNIHELWYFVVIQVCNG LVQTTGWPSVVTCVGNWFGKGKRGFIMGIWNSHTSVGNILGSLIAGIWVNGQWGLSFIVP GIITAVMGVITFLFLIEPGGRQHMDAPTAGLISSAQGEPAENQDNPEDPGNSPCSIRESG LETVAKCSKGPCEEPAAISFFGALRIPGVVEFSLCLLFAKLVSYTFLYWLPLYIANVAHF SAKEAGDLSTLFDVGGIIGGIVAGLVSDYTNGRATTCCVMLILAAPMMFLYNYIGQDGIA SSIVMLIICGGLVNGPYALITTAVSADLGTHKSLKGNAKALSTVTAIIDGTGSIGAALGP LLAGLISPTGWNNVFYMLISADVLACLLLCRLVYKEILAWKVSLSRGSGARDLQPAVPEP PTRSMGSCGARASPTSTTPCSRAPSPIDHPRAEECERTARDWQAAPPAAPVRDPLGEASW APESDGDVESPYVLLRDCKHTNQHPVFSSRFVSAPVDTLYLAALVGPRRTFISSSGIVNT PIGTVYLAQGL >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_3|2016_bp atgtggcgccagggtttctactggcagcatcatgggggactccaggtggtggatttcagg gagttggggtcagagctggcggggacatcaggctaccacctggaggagggcagggctgga atcttggaacctaccaggttccgaggcctcatcctgctgctgaccttcctaatttacgcc tgctatcacatgtccaggaagcctatcagtatcgtcaagagccgtctgcaccagaactgc tcggagcagatcaaacccatcaatgatactcacagtctcaatgacaccatgtggtgcagc tgggccccatttgacaaggacaactataaggagttactagggggcgtggacaacgccttc ctcatcgcctatgccatcggcatgttcatcagtggggtttttggggagcggcttccgctc cgttactacctctcagctggaatgctgctcagtggccttttcacctcgctctttggcctg ggatatttctggaacatccacgagctctggtactttgtggtcatccaggtctgtaatgga ctcgtccagaccacaggctggccctctgtggtgacctgtgttggcaactggttcgggaag gggaagcgggggttcatcatgggcatctggaattcccacacatctgtgggcaacatcctg ggctccctgatcgccggcatctgggtgaacgggcagtggggcctgtcgttcatcgtgcct ggcatcattactgccgtcatgggcgtcatcaccttcctcttcctcatcgaacctggtggg aggcagcacatggacgcacccacagcagggctcatctcctctgctcagggtgagccagct gagaaccaggacaaccctgaggaccctgggaacagtccctgctctatcagggagagcggc cttgagactgtggccaaatgctccaaggggccatgcgaagagcctgctgccatcagcttc tttggggcgctccggatcccaggcgtggtcgagttctctctgtgtctgctgtttgccaag ctggtcagttacaccttcctctactggctgcccctctacatcgccaatgtggctcacttt agtgccaaggaggctggggacctgtctacactcttcgatgttggtggcatcataggcggc atcgtggcagggctcgtctctgactacaccaatggcagggccaccacttgctgtgtcatg ctcatcttggctgcccccatgatgttcctgtacaactacattggccaggacgggattgcc agctccatagtgatgctgatcatctgtgggggcctggtcaatggcccatacgcgctcatc accactgctgtctctgctgatctggggactcacaagagcctgaagggcaacgccaaagcc ctgtccacggtcacggccatcattgacggcaccggctccataggtgcggctctggggcct ctgctggctgggctcatctcccccacgggctggaacaatgtcttctacatgctcatctct gccgacgtcctagcctgcttgctcctttgccggttagtatacaaagagatcttggcctgg aaggtgtccctgagcagaggcagcggggctcgggacctgcagcccgcggtgcctgagcct cccacccgctccatgggctcctgtggggcccgagcctccccgacgagtaccaccccctgc tccagggcgcccagtcccatcgaccacccaagggctgaggagtgcgagcgcacggcgcgg gactggcaggcagctccacctgcagccccggtgcgggatccactaggtgaagccagctgg gctcctgagtctgatggggacgtggagagtccttatgtcctgctcagggattgtaaacac accaatcagcaccctgtgtttagctcaaggtttgtgagtgcaccagtcgacactctgtat ctagctgctctggtggggcctcggagaacctttatatctagctcagggattgtaaataca cccatcggcactgtgtatctagctcaaggtttataa >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_4|115_aa MAGTVLGVGAGVFILALLWVAVLLLCVLLSRASGAARFSVIFLFFGAVIITSVLLLFPRA GEFPAPEVEVKIVDDFFIGRYVLLAFLSAIFLGGLFLVLIHYVLEPIYAKPLHSY >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_4|348_bp atggctggcactgtgctcggagtcggtgcgggcgtgttcatcttagccctgctctgggtg gcagtgctgctgctgtgtgtgctgctgtccagagcctccggggcggcgaggttctctgtc atttttttattcttcggtgctgtgatcatcacatcagttctgttgcttttcccgcgagct ggtgaattcccagccccagaagtggaagttaagattgtggatgactttttcattggccgc tatgtcctgctggctttccttagtgccatcttccttggaggcctcttcttggttttaatc cattatgttctggagccgatctatgccaaaccactgcactcctactga >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_5|133_aa MTIKDLRAHIFANSVDNACVVLQIDNARLAADDFRVKYKTELVMCLSVESNVRGLHKVTD DTNVTRLQLETEMEALKEELLFMKKNQEKEVKGLQALIASSELTMEVNAPKSQDLSNIMA DLPAQYDELAQKN >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_5|402_bp atgaccatcaaggacttgagggctcatatctttgcaaattctgtggacaatgcttgtgtt gttctacagattgacaatgcccgtcttgctgctgatgactttagagtcaagtataagacg gagctggtcatgtgcctgtctgtggagagcaacgtccgtgggctccacaaggtcactgat gacaccaatgtcactcggctacagctggagacagagatggaggctctcaaggaggagctg ctcttcatgaagaagaaccaagagaaggaagtaaaaggtctacaagccctgattgccagc tctgagttgaccatggaggtaaatgcccccaaatctcaggacctcagcaacatcatggca gacctcccggcccagtatgatgagctggcccagaagaactga >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_6|119_aa MRNLKASLENSLRWVEARYAMQMEQLSGAVLHLDSELAQAQAEGQHQAQKYEALLNIKVK REAEIAATTTTAACWKMGRTSIVAMPWTTATLCKPSKRLSRIADDKVVSEINNTKVLRR >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_6|360_bp atgagaaatctgaaggccagcctggagaacagcctgaggtgggtggaggcccgctatgcc atgcaaatggagcagctcagtggggccgtgctgcacctagattcagagctggcacaggcc caggcagaagggcagcaccaggcccagaagtatgaggccctgctgaacatcaaggtcaag cgggaggctgagattgccgctactactactactgccgcctgctggaagatggggaggact tcaattgtggcgatgccctggacaacagcaactctatgcaaaccatccaaaagactgtcc aggatagcggatgacaaagtggtgtctgagatcaacaacaccaaagttctgaggcgctga >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_7|55_aa MNLQHQPLPVSHSQGGQALRALQDNPRRLLLLLLLLEPSQGVLCWQAGFAHRLAE >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_7|168_bp atgaatctgcagcaccagcccctgccagtctcccacagccagggtggccaagccctcagg gctctgcaggacaatcctcggcgtctgctgctgctgctgctgctgctggaaccttctcag ggtgtcctctgctggcaggcaggcttcgcacacaggttggcagaatga >gi568815587f:124963368_125186018|GENSCAN_predicted_peptide_8|128_aa MPEKKGGELSQGLAIPELVFSFGGSKDPLKRGDVSMKAYKQIGFRRESQTALTGHGRRTT HDPRVASAARGAATAAAELQGCCFVGLEFNLAWALFQVTGVFNQRCTNSQSMLWPGAACC AQPWQRLQ >gi568815587f:124963368_125186018|GENSCAN_predicted_CDS_8|387_bp atgccggagaagaagggtggagagctgagccagggcctggcgatcccagagctggtgttc tcttttgggggatccaaggatcctttgaagagaggtgatgtttccatgaaagcttataag cagataggattcagaagggagagccagacagcccttacaggacacggaagacgcacgacc cacgacccacgagtggcctcggcggcccggggcgcggcgactgccgccgcggagctccaa ggctgttgttttgtaggcttggaattcaacctggcttgggcccttttccaggtcactggc gtcttcaaccagcgttgcacaaatagccagtcaatgctgtggcctggtgctgcctgctgt gcacagccctggcagaggctgcagtag