GENSCAN 1.0 Date run: 7-Nov-116 Time: 03:49:40 Sequence gi568815595f:11159038_11360498 : 201461 bp : 44.36% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 541 536 6 1.05 1.01 Sngl - 8548 7391 1158 1 0 78 44 328 0.558 23.62 1.00 Prom - 16442 16403 40 -3.46 2.00 Prom + 26114 26153 40 -4.06 2.01 Sngl + 36410 36604 195 1 0 58 48 171 0.437 5.27 2.02 PlyA + 38558 38563 6 1.05 3.05 PlyA - 38869 38864 6 1.05 3.04 Term - 42152 42135 18 2 0 113 54 0 0.140 -2.58 3.03 Intr - 43051 42933 119 2 2 68 93 42 0.180 2.88 3.02 Intr - 55458 55365 94 0 1 94 60 51 0.102 2.44 3.01 Init - 75470 75231 240 2 0 78 59 297 0.361 23.67 3.00 Prom - 80319 80280 40 -4.96 4.03 PlyA - 81230 81225 6 1.05 4.02 Term - 92082 91685 398 1 2 118 50 135 0.888 7.94 4.01 Init - 95534 95474 61 2 1 65 116 33 0.971 5.21 4.00 Prom - 96587 96548 40 -4.46 5.00 Prom + 97860 97899 40 -9.65 5.01 Sngl + 100064 101464 1401 1 0 87 49 1488 0.601 138.83 5.02 PlyA + 101736 101741 6 1.05 6.00 Prom + 104108 104147 40 -0.06 6.01 Init + 113275 113393 119 0 2 83 115 41 0.713 4.04 6.02 Intr + 123156 123237 82 0 1 77 67 76 0.863 4.04 6.03 Term + 123854 124132 279 1 0 56 40 172 0.651 4.65 6.04 PlyA + 125450 125455 6 1.05 7.00 Prom + 136229 136268 40 -6.76 7.01 Init + 139659 139818 160 2 1 79 78 191 0.882 17.22 7.02 Intr + 140325 140379 55 1 1 79 84 10 0.625 -2.26 7.03 Intr + 147906 148023 118 0 1 113 55 75 0.586 7.17 7.04 Intr + 149947 150024 78 0 0 145 92 29 0.925 8.75 7.05 Intr + 156307 156456 150 0 0 76 77 42 0.499 2.26 7.06 Intr + 172022 172137 116 1 2 108 28 15 0.003 -3.25 7.07 Intr + 173935 174056 122 1 2 3 91 108 0.030 2.74 7.08 Intr + 181608 181698 91 1 1 81 71 63 0.946 2.95 7.09 Intr + 183098 183242 145 2 1 98 115 -10 0.966 2.98 7.10 Intr + 188840 188998 159 1 0 89 68 139 0.812 12.18 7.11 Intr + 199381 199575 195 0 0 80 102 232 0.881 23.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 173984 174056 73 1 1 76 91 47 0.901 5.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_1|385_aa MNYRYHRPGEARGQQMSRAWRMQGTADVTGLERPGDSRCHGPGEARGQQMSRAWRGQGTA DVTGLERPGDSRCHGPGEARGQQMSRAWRGQGTADVTGLENAGDSRCHGPGECRGQQMSR AWRSQGTADVTGLERPGDSRCHGPGEYRGQQMSRAWGIQGTADVTGLENAGDQQMSLVWR MQGSADVMGLEKPRDSRCHAPGEARGQQMSLAWRSQGKEMLRAWRSQGTADVTAWRIQGT ANVVGLEEPGDSRCHGPGEARGQEMSWAWRSQGTADVMGLEKPGDSRCHGPGESRGQQMS RAWRSQGTGDVTAWRSQGTADVMGLEKPGDSRCHGPGESRGQQMSGPGECRRQQMSRAWR SRKVQCRPHQGRNACLLCSAGIPCA >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_1|1158_bp atgaactacagatatcacaggcctggagaagccaggggacagcagatgtcacgggcctgg agaatgcaggggacagcagatgtcacgggcctggagaggccaggggacagcagatgtcac gggcctggagaggccaggggacagcagatgtcacgggcctggagaggccaggggacagca gatgtcacgggcctggagaggccaggggacagcagatgtcacgggcctggagaggccagg ggacagcagatgtcacgggcctggagaggccaggggacagcagatgtcacgggcctggag aatgcaggggacagcagatgtcacgggcctggagaatgcaggggacagcagatgtcacgg gcctggagaagccaggggacagcagatgtcacgggcctggagaggccaggggacagcaga tgtcacgggcctggagaatacaggggacagcagatgtcacgggcctggggaatccaggga acagcagatgtcacaggcctggagaatgcaggggaccagcagatgtcactggtctggaga atgcaggggtcagcagatgtcatgggcctggagaagccaagggacagtagatgtcatgcg cctggagaagccagggggcagcagatgtcactagcctggagaagccaggggaaggagatg ttgcgggcctggagaagccaggggacagcagatgtcacagcctggagaatccaggggaca gcaaatgtcgtgggtctggaggagccaggggacagcagatgtcatgggcctggagaagcc aggggacaggagatgtcatgggcctggagaagccaggggacagcagatgtcatgggcctg gagaagccaggggacagcagatgtcacgggcctggagaatctaggggacagcagatgtca cgggcctggagaagccaggggacaggagatgtcacagcctggagaagccaggggacagca gatgtcatgggcctggagaagccaggggacagcagatgtcatgggcctggagaatctagg ggacagcagatgtcagggcctggagaatgcaggagacagcagatgtcacgggcctggaga agccggaaagtacaatgtaggcctcaccagggcaggaatgcttgtctgctctgttctgca ggaatcccgtgtgcctag >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_2|64_aa MWFLNPPPLEVPSTSAYRVGMTWCGCAEARGSTCIQQNQVTLMTVLDEMSYLDNLFLVSC FLPF >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_2|195_bp atgtggttcctcaatcccccaccactggaagttccctccacttcagcgtacagagtggga atgacctggtgtggctgtgctgaagctcgagggtctacgtgtattcagcagaaccaagtg actctcatgacagtattggatgaaatgagctatctggataatctcttcctggtgtcctgc ttcttgcccttttga >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_3|156_aa MASGPCGEQFKSAFSCFHYSTEEIKGSDCVDRFRAMQECLQKYPDLYPQEDEDEEEEREK KPAEQAEETAPTEATATKEESIGQQKATVPRNPTITDWFLLDVISHSELFAEIRNNFIYR KTQPYVLRKNPFSPPLNYPKTSSSSVLVPAPTEGIL >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_3|471_bp atggccagcggcccctgtggggaacagttcaagtcagccttttcctgcttccactatagc acggaggagatcaaggggtcagactgtgtagaccggttccgggccatgcaggaatgcctg cagaaatacccagacctctatccccaagaggatgaggatgaggaagaggaaagagagaag aagccagcagaacaagcagaagaaacagctcccactgaggccactgcaaccaaagaagag tctataggacagcagaaggccacagtgcccagaaatcccaccattaccgactggtttctt ttggatgtgatatcccattcagagttatttgcagaaattagaaataattttatctaccgc aaaactcaaccgtatgtattaaggaagaatccattttccccacctctgaattaccccaag acctccagctcttcagtcctggtgcctgccccgactgaggggatcctctga >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_4|152_aa MREEKADRYCGDQVTLCIDQATVPVFSPREKEKLKEIGAKESHEGRWVQPDKKEMLSKPF MREVLSQLHQGTHWGPQAMCDAVLRVYGCVGIYTLARQVADRFRVRRKTNKQPKENGSTT CRASTLIRESRKAYCPVESPNCSLKKHRSTPL >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_4|459_bp atgagggaagaaaaggcagacagatactgtggcgaccaggtaactctgtgcatagaccaa gctacagtccctgttttctcccccagagaaaaagaaaagttaaaagaaataggagccaaa gaaagtcacgagggaagatgggtacaaccagacaagaaggaaatgctgtctaagcccttc atgcgagaggtattgtcacagctacatcaaggaactcactggggcccccaagctatgtgt gatgcagtcctcagagtttatggatgcgtaggaatttatacccttgctagacaagttgca gatcgtttcagagtgcgcagaaaaaccaacaaacaacctaaagaaaatgggagtaccaca tgccgtgcatccaccctcatcagggagagtcgaaaggcctattgccctgttgagagtccg aactgctccctgaaaaaacataggtctacccccttatga >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_5|466_aa MASPQLMPLVVVLSTICLVTVGLNLLVLYAVRSERKLHTVGNLYIVSLSVADLIVGAVVM PMNILYLLMSKWSLGRPLCLFWLSMDYVASTASIFSVFILCIDRYRSVQQPLRYLKYRTK TRASATILGAWFLSFLWVIPILGWNHFMQQTSVRREDKCETDFYDVTWFKVMTAIINFYL PTLLMLWFYAKIYKAVRQHCQHRELINRSLPSFSEIKLRPENPKGDAKKPGKESPWEVLK RKPKDAGGGSVLKSPSQTPKEMKSPVVFSQEDDREVDKLYCFPLDIVHMQAAAEGSSRDY VAVNRSHGQLKTDEQGLNTHGASEISEDQMLGDSQSFSRTDSDTTTETAPGKGKLRSGSN TGLDYIKFTWKRLRSHSRQYVSGLHMNRERKAAKQLGFIMAAFILCWIPYFIFFMVIAFC KNCCNEHLHMFTIWLGYINSTLNPLIYPLCNENFKKTFKRILHIRS >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_5|1401_bp atggccagcccccagctgatgcccctggtggtggtcctgagcactatctgcttggtcaca gtagggctcaacctgctggtgctgtatgccgtacggagtgagcggaagctccacactgtg gggaacctgtacatcgtcagcctctcggtggcggacttgatcgtgggtgccgtcgtcatg cctatgaacatcctctacctgctcatgtccaagtggtcactgggccgtcctctctgcctc ttttggctttccatggactatgtggccagcacagcgtccattttcagtgtcttcatcctg tgcattgatcgctaccgctctgtccagcagcccctcaggtaccttaagtatcgtaccaag acccgagcctcggccaccattctgggggcctggtttctctcttttctgtgggttattccc attctaggctggaatcacttcatgcagcagacctcggtgcgccgagaggacaagtgtgag acagacttctatgatgtcacctggttcaaggtcatgactgccatcatcaacttctacctg cccaccttgctcatgctctggttctatgccaagatctacaaggccgtacgacaacactgc cagcaccgggagctcatcaataggtccctcccttccttctcagaaattaagctgaggcca gagaaccccaagggggatgccaagaaaccagggaaggagtctccctgggaggttctgaaa aggaagccaaaagatgctggtggtggatctgtcttgaagtcaccatcccaaacccccaag gagatgaaatccccagttgtcttcagccaagaggatgatagagaagtagacaaactctac tgctttccacttgatattgtgcacatgcaggctgcggcagaggggagtagcagggactat gtagccgtcaaccggagccatggccagctcaagacagatgagcagggcctgaacacacat ggggccagcgagatatcagaggatcagatgttaggtgatagccaatccttctctcgaacg gactcagataccaccacagagacagcaccaggcaaaggcaaattgaggagtgggtctaac acaggcctggattacatcaagtttacttggaagaggctccgctcgcattcaagacagtat gtatctgggttgcacatgaaccgcgaaaggaaggccgccaaacagttgggttttatcatg gcagccttcatcctctgctggatcccttatttcatcttcttcatggtcattgccttctgc aagaactgttgcaatgaacatttgcacatgttcaccatctggctgggctacatcaactcc acactgaaccccctcatctaccccttgtgcaatgagaacttcaagaagacattcaagaga attctgcatattcgctcctaa >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_6|159_aa MPLLLCARAPLPSGKRGQDRVASSGRAPQRELWLPEVERRLLKPHNAGIQCIRSQKLKLM DTRLHVKQLRTYAVCQRDLVMWFGAYFNDCQSLQPCSWENENRSLLCVLHFGPEKALQGQ IAVILALLLSEWPHMGSEKSDTECSLPFRTCSFGLLDDL >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_6|480_bp atgcctctgctcctttgcgcacgcgcgccgcttcccagtggcaagcgcgggcaggaccgc gttgcgtcatcggggcgcgcgcctcagagagagctgtggttgccggaagttgagcggcgg cttctcaaaccccataatgctggtatccagtgcatccgttctcaaaaactgaagctgatg gacaccagattgcatgtgaagcagctaagaacatatgctgtttgtcagcgggacttggtc atgtggtttggtgcttattttaatgactgccagagcttgcagccctgcagctgggaaaat gagaatagaagcctgctctgtgtgctccattttggcccagagaaggcacttcagggccaa attgcagttattctggcacttctgctaagtgaatggccccacatgggctcagagaaatcg gacactgagtgctcgctgccctttagaacttgttcttttggacttctcgatgacctgtag >gi568815595f:11159038_11360498|GENSCAN_predicted_peptide_7|463_aa MAAATGDPGLSKLQFAPFSSALDVGFWHELTQKKLNEYRLDEAPKDIKGYYYNGDSAGLP ARLTLEFSAFDMSAPTPARCCPAIGTLYNTNTLESFKTADKKLLLEQAANEIWESIKSGT ALENPVLLNKFLLLTFAIEALECAYDNLCQTEGVTALPYFLIKYDENMVLVSLLKHYSDF FQGQRTKSFFLLRVFSSYFLPGTMQGTGGADKNLIAPAIKKFCSAVSSSFQSVEVVCFRD RTMQGARDVAHSIIFEVKLPEMAFSPDCPKAVGWEKNQKGGMGPRMVNLSECMDPKRLAE SSVDLNLKLMCWRLVPTLDLDKVVSVKCLLLGAGTLGCNVARTLMGWGVRHITFVDNAKI SYSNPVRQPLYEFEDCLGGGKPKALAAADRLQKIFPGVNARGFNMSIPMPGHPVNFSSVT LEQARRDVEQLEQLIESHDVVFLLMDTRESRWLPAVIAASKRK >gi568815595f:11159038_11360498|GENSCAN_predicted_CDS_7|1389_bp atggcggcagctacgggggatcctggactctctaaactgcagtttgccccttttagtagt gccttggatgttgggttttggcatgagttgacccagaagaagctgaacgagtatcggctg gatgaagctcccaaggacattaagggttattactacaatggtgactctgctgggctgcca gctcgcttaacattggagttcagtgcttttgacatgagtgctcccaccccagcccgttgc tgcccagctattggaacactgtataacaccaacacactcgagtctttcaagactgcagat aagaagctccttttggaacaagcagcaaatgagatatgggaatccataaaatcaggcact gctcttgaaaaccctgtactcctcaacaagttcctcctcttgacatttgcaattgaagca ctagagtgtgcatatgataatctttgtcaaacagaaggagtcacagctcttccttacttc ttaatcaagtatgatgagaacatggtgctggtttccttgcttaaacactacagtgatttc ttccaaggtcaaaggacgaagtcttttttccttctacgtgtatttagctcctactttctg ccaggcacaatgcaaggcacagggggtgcagacaagaatttgatagcccctgctatcaag aagttctgtagtgcagtgagtagcagtttccagtctgttgaagttgtttgcttccgtgac cgtaccatgcagggggcgagagacgttgcccacagcatcatcttcgaagtgaagcttcca gaaatggcatttagcccagattgtcctaaagcagttggatgggaaaagaaccagaaagga ggcatgggaccaaggatggtgaacctcagtgaatgtatggaccctaaaaggttagctgag tcatcagtggatctaaatctcaaactgatgtgttggagattggttcctactttagacttg gacaaggttgtgtctgtcaaatgtctgctgcttggagccggcaccttgggttgcaatgta gctaggacgttgatgggttggggcgtgagacacatcacatttgtggacaatgccaagatc tcctactccaatcctgtgaggcagcctctctatgagtttgaagattgcctagggggtggt aagcccaaggctctggcagcagcggaccggctccagaaaatattccccggtgtgaatgcc agaggattcaacatgagcatacctatgcctgggcatccagtgaacttctccagtgtcact ctggagcaagcccgcagagatgtggagcaactggagcagctcatcgaaagccatgatgtc gtcttcctattgatggacaccagggagagccggtggcttcctgccgtcattgctgcaagc aagagaaag