GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:53:10 Sequence gi568815597f:103426071_103651777 : 225707 bp : 35.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 7026 6821 206 2 2 103 97 122 0.789 12.52 1.02 Intr - 21119 21052 68 2 2 62 116 55 0.123 2.48 1.01 Init - 22830 22621 210 0 0 51 72 106 0.058 4.13 1.00 Prom - 42555 42516 40 -5.65 2.00 Prom + 55665 55704 40 -4.25 2.01 Init + 60321 60936 616 2 1 71 60 174 0.037 8.44 2.02 Intr + 79551 79612 62 1 2 61 115 44 0.133 1.93 2.03 Intr + 93855 94051 197 2 2 63 86 111 0.153 5.69 2.04 Intr + 99997 100192 196 1 1 75 95 156 0.961 13.50 2.05 Intr + 101625 101672 48 2 0 68 86 46 0.548 0.36 2.06 Intr + 110846 110980 135 1 0 32 34 115 0.427 0.34 2.07 Intr + 111272 111414 143 1 2 30 68 225 0.477 12.93 2.08 Intr + 115280 115405 126 2 0 84 91 31 0.559 1.87 2.09 Intr + 117226 117377 152 1 2 34 91 79 0.414 1.69 2.10 Intr + 118871 119032 162 0 0 32 90 116 0.803 5.23 2.11 Intr + 120178 120272 95 2 2 35 100 21 0.339 -3.24 2.12 Intr + 120907 120965 59 0 2 61 103 17 0.360 -2.74 2.13 Term + 124871 125039 169 2 1 105 28 146 0.799 6.67 2.14 PlyA + 125179 125184 6 1.05 3.04 PlyA - 125334 125329 6 1.05 3.03 Term - 143912 143416 497 0 2 55 42 301 0.536 15.94 3.02 Intr - 144185 143927 259 2 1 76 2 249 0.303 11.21 3.01 Init - 144510 144310 201 0 0 99 2 163 0.700 7.76 3.00 Prom - 144745 144706 40 -8.45 4.00 Prom + 145486 145525 40 -9.75 4.01 Init + 145533 145700 168 2 0 78 94 75 0.926 5.12 4.02 Intr + 146040 146186 147 2 0 60 98 37 0.711 1.41 4.03 Intr + 146993 147190 198 1 0 74 99 180 0.999 16.33 4.04 Intr + 147638 147868 231 1 0 70 73 182 0.999 11.95 4.05 Intr + 148190 148323 134 1 2 75 33 130 0.993 4.72 4.06 Intr + 149371 149470 100 1 1 69 111 59 0.989 5.49 4.07 Intr + 151420 151538 119 0 2 16 100 94 0.989 1.74 4.08 Intr + 151650 151775 126 0 0 72 119 102 0.995 10.57 4.09 Intr + 190681 190747 67 1 1 33 109 88 0.046 3.39 4.10 Intr + 191419 191538 120 0 0 28 94 64 0.326 0.77 4.11 Intr + 191884 192030 147 0 0 70 98 37 0.831 2.41 4.12 Intr + 192841 193038 198 0 0 78 91 192 0.999 17.13 4.13 Intr + 193484 193714 231 1 0 70 73 144 0.994 8.15 4.14 Intr + 194481 194614 134 2 2 67 33 125 0.983 3.42 4.15 Intr + 195724 195823 100 1 1 50 111 75 0.983 5.19 4.16 Intr + 197796 197914 119 2 2 49 100 94 0.999 5.04 4.17 Intr + 198026 198151 126 2 0 72 119 85 0.996 8.87 4.18 Term + 199487 199676 190 2 1 79 38 96 0.885 -0.36 4.19 PlyA + 202458 202463 6 1.05 5.03 PlyA - 203050 203045 6 1.05 5.02 Term - 224328 224210 119 1 2 103 33 122 0.929 5.92 5.01 Intr - 225009 224904 106 0 1 32 16 149 0.582 0.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 93882 94051 170 2 2 92 86 93 0.808 8.55 S.002 Term + 153241 153430 190 1 1 89 38 93 0.952 0.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:103426071_103651777|GENSCAN_predicted_peptide_1|162_aa MNKKGKQPIVDMEKVFVVWIEDQTSHNIPVNLNLIQSKAVTLFHSVSTERGEEAAEEKSE TSRGWLMRFKKLPQLPQPSAITTLISQRPSTSRLARGIPTPEYVMRCVKSMAMDANSLCV SGFKQLYTLMLSYTQQQFVTSLGNSSYSFELQPNFLSITALX >gi568815597f:103426071_103651777|GENSCAN_predicted_CDS_1|486_bp atgaataagaaaggaaaacagcctattgttgatatggaaaaagttttcgtggtttggata gaagatcaaaccagccacaacattcctgtaaacctaaacctaatccagagtaaggctgta actctctttcattctgtgtcaactgagagaggtgaggaagctgcagaagaaaagtctgaa actagcagaggttggctcatgaggtttaagaaattgccacagctaccccaaccttcagca atcaccaccctgattagtcagcggccatcaacatcaagattggcacgagggataccaaca ccggaatatgtgatgagatgtgtgaagagcatggcaatggatgcaaactccctttgtgtt tctggctttaagcagctctatactctcatgctgtcctacactcagcagcaatttgtcaca agtttgggaaattcttcctactcttttgaactccagcccaatttcctttctatcactgcc ctagnn >gi568815597f:103426071_103651777|GENSCAN_predicted_peptide_2|719_aa MGKDFMSKTPKAMTIKAKIDKWDLIKLKSFCTAKATTISVNRKPTEWEKIFAICSSDKGL ISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREIQIKTT MRYHLTAVRMAIIKKSGNSRCWRGCGEVGTLLHSRWDCKLVQPLWKSVWRFLRDLELEIP FDPAIPLLGVYPKDYKSCCYKDTCTRNFNMKEKMRQMNTLKAHLDKYGKRIEEAEMVLTD NEEFNEKVICKHLKKVRELSMQISVGTASLKERMVKAKVLVMGLKCSSTARRKMAAPEQP LAISRGCTSSSSLSPPRGDRTLLVRHLPAELTAEEKEDLLKYFGAQSVRVLSDKGRLKHT AFATFPNEKAAIKILSKSRAEGYFVLLSHEKHLNCFDHVPSQVDEDIATQSADQEGRIYE DYMPLHAPLPPTSPQPPEEPPLPDEDEELSSEESEYESTDDEDRQRMNKLMELANLQPKR PKTIKQRHVRKKRKIKDMLNTPLCPSHSSLHPVLLPSDVFDQPQPVGNKRIEFHISTDMP AAFKKDLEKEQNCEEKNHDLPATEVDASNIGFGKIFPKPNLDITEEIKEDSDEMPSECIS RRELEKGRISREEMETLSVFRSYEPGEPNCRIYVKNLAKHVQEKDLKYIFGRYVDFSSET QRIMFDIRLMKEGRMKGQAFIGLPNEKAAAKALKEANGYVLFGKPMVVVSFKSIYFKTI >gi568815597f:103426071_103651777|GENSCAN_predicted_CDS_2|2160_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatgacaataaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagcaactaccatcagtgtg aacaggaaacctacagaatgggagaaaatttttgcaatctgctcatctgacaaagggcta atatccagaatctacaatgaactcaaacaaatttacaagaaaaaaactaacaaccccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaa agacacatgaaaaaatgctcatcatcactggccataagagaaatacaaatcaaaaccaca atgagataccatctcacagcagttagaatggcgatcattaaaaagtcaggaaacagcagg tgctggagaggatgtggagaagtaggaacacttttacacagtcggtgggactgtaaacta gttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaatacca tttgacccagccatcccattactgggtgtatacccaaaggattataaatcatgctgctat aaagacacatgcacacgtaattttaacatgaaagaaaaaatgcgacaaatgaatacactt aaagcacacttggacaagtatgggaagagaatagaggaggcagagatggtgttgactgac aatgaagaatttaatgagaaagtcatatgtaagcatctgaagaaagtcagggagctaagc atgcagatatctgtaggaacagcatccctgaaagagagaatggtaaaggcaaaggtctta gttatgggtctgaagtgttcaagcacagccagaaggaaaatggcagctcccgagcagccg cttgcgatatcaaggggatgcacgagctcctcctcgctttccccgcctcggggcgaccga acccttctggtcaggcacctgccggctgagcttactgctgaggagaaagaggacttgctg aagtacttcggggctcagtctgtgcgggtcctgtcagataaggggcgactgaaacataca gcttttgccacattccctaatgaaaaagcagctataaagatcctttccaaaagtagagca gagggatattttgttctactgagccacgaaaaacacctgaattgtttcgaccatgtgcct tcccaggttgatgaagacattgctacacagtctgcagatcaggaaggaagaatttatgaa gactatatgccattgcatgcacctcttccacccacatctcctcagccacctgaggaacct cctttgccagacgaggatgaggaattatctagtgaagaatcagaatatgaaagcactgat gatgaggaccgacagagaatgaacaaattaatggaactagcaaatcttcagcccaaaaga cctaaaacaataaagcagcgccatgtgagaaaaaagagaaaaataaaggatatgttgaat acacctttgtgtccttcacacagcagtttacatccagtgctgttaccttcagatgtattt gaccaaccacaacctgtaggtaacaaaagaattgaattccatatatctaccgacatgcca gctgcatttaagaaagatttagaaaaggaacaaaattgtgaggaaaaaaatcatgattta cctgctactgaagttgatgcatccaatataggatttggaaaaatcttccccaaacctaat ttggacatcacagaggagattaaagaagactctgatgaaatgccttcagaatgtatttct agaagggaattggaaaagggcagaatttctagagaagaaatggaaacactttcagttttc agaagttatgaaccgggtgaaccaaactgtagaatttatgtaaagaatttagctaaacat gttcaagaaaaggaccttaaatatatttttggaagatatgttgacttttcatcagaaaca cagcggatcatgtttgatatacgtttgatgaaagaaggtcgtatgaaaggacaagctttc attggacttcctaatgaaaaagcagcagcaaaagccttaaaggaagctaatggatatgtg ctttttggaaaacccatggtggttgtatcctttaaatccatctatttcaaaactatataa >gi568815597f:103426071_103651777|GENSCAN_predicted_peptide_3|318_aa MEPLIHTEYLCSGDAIILIFMVLGARVVISFCILSVMPGYMLPPDSTVLAYRSLRMSTSH FMMELKMGGGGCGSGHLLLKVQGNIAQFLLDVVSDLPLGSGAEAIANLGKDLHEVVGQVL ASQVQMQNGVREGVALIDGHHVGDPISKAHDNASVEGQPSLDGHVHGQGVEGIKHDLSHL LSVGLEVQGVLGQQHQVLLQGHVQSLVEGVMPDLLYVIPVGHDTMLNGGTSGSGYCACSQ PCSPHRSPSGPCPPSCPGAGGTDDGRKHGWGLSSPAKPALHMLEPLSITSTAISSSIATG GGVGQWSGRTTWCMGWQR >gi568815597f:103426071_103651777|GENSCAN_predicted_CDS_3|957_bp atggagccgctgatccacacggagtacttgtgctctggggatgcgatcatcttgatcttc atggtgctaggtgcccgggtggtgatctccttctgcatcctgtcggtgatgcctgggtac atgttgccaccagatagcaccgtgttggcctacaggtctttgcggatgtccacgtcacac ttcatgatggagttgaagatgggaggaggaggatgtggcagtggccatctactgctcaaa gtccagggcaacatagcacagtttctccttgacgtagtgtctgatctcccgctcggcagt ggtgctgaagctatagccaaccttggtaaggatcttcatgaggtagttggtcaggtcctg gccagccaggtccagatgcagaatggtgtcagggagggcgtagcccttatagatgggcac catgtgggtgaccccatctccaaagcccatgacaatgccagtgtagagggacagcctagc ctggatggccacgtacatggccagggtgttgaaggtatcaaacatgatctgagtcatctt ctctctgttggccttgaggttcagggggtcctcggtcagcagcaccaggtgctcctccaa ggccatgtgcagtcccttgtagaaggtgtgatgccagatcttctctatgtcatcccagtt ggtcatgataccatgcttaatgggggtacttcagggtcaggatactgtgcttgctctcag ccatgtagcccacataggagtccctctggcccatgcccaccatcatgccctggtgccggg ggcaccgatgatggaaggaaacatggctgggggttgtcgtccccagcaaagccagctttg cacatgctggagccattgtcaatcaccagtacggcaatctcttcttccattgcgactggt ggaggagtgggacagtggagcggcaggacaacatggtgcatgggctggcagaggtga >gi568815597f:103426071_103651777|GENSCAN_predicted_peptide_4|884_aa MKFFLLLFTIGFCWAQYSPNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPP NENVAIHNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMSGN AVSAGTSSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLVGL LDLALEKDYVRSKIAEYMNHLIDIGVAGFRLDASKHMWPGDIKAILDKLHNLNSNWFPAG SKPFIYQEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVG FMLAHPYGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTTCGNDWVCEH RWRQIRNMVNFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDCLYYTWLGHFMAK NMLVEDQSGSYSPNTQQGRTSIVHLFEWRWVDIALECERYLAPKGFGGVQVSPPNENVAI YNPFRPWWERYQPVSYKLCTRSGNEDEFRNMVTRCNNVGVRIYVDAVINHMCGNAVSAGT SSTCGSYFNPGSRDFPAVPYSGWDFNDGKCKTGSGDIENYNDATQVRDCRLTGLLDLALE KDYVRSKIAEYMNHLIDIGVAGFRLDASKHMWPGDIKAILDKLHNLNSNWFPAGSKPFIY QEVIDLGGEPIKSSDYFGNGRVTEFKYGAKLGTVIRKWNGEKMSYLKLYKMAVGFMLAHP YGFTRVMSSYRWPRQFQNGNDVNDWVGPPNNNGVIKEVTINPDTTCGNDWVCEHRWRQIR NMVIFRNVVDGQPFTNWYDNGSNQVAFGRGNRGFIVFNNDDWSFSLTLQTGLPAGTYCDV ISGDKINGNCTGIKIYVSDDGKAHFSISNSAEDPFIAIHAESKL >gi568815597f:103426071_103651777|GENSCAN_predicted_CDS_4|2655_bp atgaagttctttctgttgcttttcaccattgggttctgctgggctcagtattccccaaat acacaacaaggacggacatctattgttcatctgtttgaatggcgatgggttgatattgct cttgaatgtgagcgatatttagctcccaagggatttggaggggttcaggtctctccacca aatgaaaatgttgcaattcacaaccctttcagaccttggtgggaaagataccaaccagtt agctataaattatgcacaagatctggaaatgaagatgaatttagaaacatggtgactaga tgtaacaatgttggggttcgtatttatgtggatgctgtaattaatcatatgtctggtaat gctgtgagtgcaggaacaagcagtacctgtggaagttacttcaaccctggaagtagggac tttccagcagtcccatattctggatgggattttaatgatggtaaatgtaaaactggaagt ggagatatcgagaactacaatgatgctactcaggtcagagattgtcgtctggttggtctt cttgatcttgcactggagaaagattatgtgcgttccaagattgccgaatatatgaatcat ctcattgacattggtgttgcagggttcagacttgatgcttccaagcacatgtggcctgga gacataaaggcaattttggacaaactgcataatctaaacagtaactggttccctgcagga agtaaacctttcatttaccaggaggtaattgatctgggtggtgagccaattaaaagcagt gactactttggaaatggccgggtgacagaattcaagtatggtgcaaaactcggcacagtt attcgcaagtggaatggagagaagatgtcttacctaaagctgtataaaatggcagttgga tttatgcttgctcatccttatggttttacacgagtaatgtcaagctaccgttggccaaga cagtttcaaaatggaaacgatgttaatgattgggttgggccaccaaataataatggagta attaaagaagttactattaatccagacactacttgtggcaatgactgggtctgtgaacat cgatggcgccaaataaggaacatggttaatttccgcaatgtagtggatggccagcctttt acaaactggtatgataatgggagcaaccaagtggcttttgggagaggaaacagaggattc attgttttcaacaatgatgactgtctttactacacgtggcttggtcacttcatggctaaa aacatgcttgtggaagaccagtctggctcgtattccccaaatacacaacaaggacggaca tctattgttcatctgtttgaatggcgatgggttgatattgctcttgaatgtgagcgatat ttagctccgaagggatttggaggggttcaggtctctccaccaaatgaaaatgttgcaatt tacaaccctttcagaccttggtgggaaagataccaaccagttagctataaattatgcaca agatctggaaatgaagatgaatttagaaacatggtgactagatgtaacaatgttggggtt cgtatttatgtggatgctgtaattaatcatatgtgtggtaacgctgtgagtgcaggaaca agcagtacctgtggaagttacttcaaccctggaagtagggactttccagcagtcccatat tctggatgggatttcaatgatggtaaatgtaaaactggaagtggagatatcgagaactac aatgatgctactcaggtcagagattgtcgtctgactggtcttcttgatcttgcactggag aaggattacgtgcgttctaagattgccgaatatatgaaccatctcattgacattggtgtt gcagggttcagacttgatgcttccaagcacatgtggcctggagacataaaggcaattttg gacaaactgcataatctaaacagtaactggttccctgcaggaagtaaacctttcatttac caggaggtaattgatctgggtggtgagccaattaaaagcagtgactactttggtaatggc cgggtgacagaattcaagtatggtgcaaaactcggcacagttattcgcaagtggaatgga gagaagatgtcttacttaaagctgtacaaaatggcagttggatttatgcttgctcatcct tacggatttacacgagtaatgtcaagctaccgttggccaagacagtttcaaaatggaaac gatgttaatgattgggttgggccaccaaataataatggagtaattaaagaagttactatt aatccagacactacttgtggcaatgactgggtctgtgaacatcgatggcgccaaataagg aacatggttattttccgcaatgtagtggatggccagccttttacaaattggtatgataat gggagcaaccaagtggcttttgggagaggaaacagaggattcattgttttcaacaatgat gactggtcattttctttaactttgcaaactggtcttcctgctggcacatactgtgatgtc atttctggagataaaattaatggcaattgcacaggcattaaaatttacgtttctgatgat ggcaaagctcatttttctattagtaactctgctgaagatccatttattgcaattcatgct gaatctaaattgtaa >gi568815597f:103426071_103651777|GENSCAN_predicted_peptide_5|74_aa HNAKQGPSVPRGIEASGAAHFEDLQVDFTEMPKCRSDIVWIKDWNVAPLWPQWKGPQTIN LTTPTAVNVEGIPA >gi568815597f:103426071_103651777|GENSCAN_predicted_CDS_5|225_bp cacaatgcgaagcaaggcccctctgtacctcggggaatagaagcctctggagcagctcat tttgaagatcttcaagtggacttcacagaaatgcctaaatgtagaagtgacattgtgtgg atcaaggactggaacgtggctccgctgtggccacagtggaaaggaccccagacaattaac ctgaccactcccacagctgtcaatgtagaaggaatcccagcctag