GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:37:33 Sequence gi568815595r:131362726_131602821 : 240096 bp : 39.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 631 841 211 1 1 126 55 125 0.970 8.78 1.02 PlyA + 1774 1779 6 1.05 2.00 Prom + 16476 16515 40 -2.05 2.01 Init + 19080 19217 138 2 0 103 85 250 0.970 24.39 2.02 Intr + 19321 19590 270 0 0 127 59 518 0.999 49.62 2.03 Term + 20437 20616 180 0 0 135 44 140 0.999 10.93 2.04 PlyA + 21937 21942 6 1.05 3.00 Prom + 27328 27367 40 -4.55 3.01 Init + 34545 34566 22 2 1 74 106 9 0.007 0.92 3.02 Intr + 45436 47605 2170 2 1 46 53 740 0.032 52.67 3.03 Intr + 58900 58993 94 1 1 96 110 35 0.639 5.55 3.04 Term + 69967 70281 315 0 0 53 42 189 0.754 4.76 3.05 PlyA + 70441 70446 6 1.05 4.08 PlyA - 70603 70598 6 1.05 4.07 Term - 79124 79066 59 0 2 129 49 59 0.135 2.97 4.06 Intr - 100150 100024 127 1 1 91 78 127 0.418 11.33 4.05 Intr - 108554 108446 109 1 1 56 59 79 0.434 1.17 4.04 Intr - 125015 124955 61 0 1 56 61 79 0.012 -1.13 4.03 Intr - 127355 127256 100 2 1 90 91 99 0.476 9.16 4.02 Intr - 135552 135454 99 0 0 99 58 48 0.390 2.29 4.01 Init - 137755 137705 51 1 0 83 110 30 0.554 5.91 4.00 Prom - 138754 138715 40 -8.45 5.00 Prom + 138986 139025 40 -5.55 5.01 Init + 139447 139495 49 0 1 66 55 6 0.442 -3.74 5.02 Term + 139927 140363 437 2 2 70 44 487 0.682 36.96 5.03 PlyA + 140414 140419 6 1.05 6.00 Prom + 148187 148226 40 -8.05 6.01 Init + 149147 149210 64 1 1 47 111 61 0.164 5.66 6.02 Intr + 155109 155184 76 1 1 107 107 75 0.702 8.95 6.03 Term + 155992 156220 229 1 1 65 44 146 0.911 3.02 6.04 PlyA + 156252 156257 6 1.05 7.00 Prom + 157607 157646 40 -6.65 7.01 Init + 163734 163884 151 2 1 70 65 115 0.277 7.65 7.02 Intr + 164520 164608 89 1 2 58 67 128 0.734 6.47 7.03 Term + 167013 167105 93 2 0 93 39 100 0.780 2.45 7.04 PlyA + 168040 168045 6 1.05 8.03 PlyA - 168169 168164 6 1.05 8.02 Term - 172604 172470 135 2 0 121 43 144 0.972 10.24 8.01 Init - 180023 179832 192 2 0 99 34 289 0.931 21.61 8.00 Prom - 180274 180235 40 -7.55 9.15 PlyA - 180576 180571 6 1.05 9.14 Term - 181297 181034 264 1 0 102 38 135 0.681 4.42 9.13 Intr - 187355 187222 134 0 2 89 99 149 0.993 15.54 9.12 Intr - 189766 189715 52 1 1 59 103 55 0.127 1.66 9.11 Intr - 192826 192772 55 0 1 87 113 14 0.089 1.86 9.10 Intr - 201624 201491 134 0 2 125 110 92 0.995 13.62 9.09 Intr - 202045 201918 128 2 2 45 85 38 0.752 -1.32 9.08 Intr - 204418 204321 98 0 2 58 94 80 0.678 4.33 9.07 Intr - 206345 206057 289 0 1 62 66 88 0.195 -0.62 9.06 Intr - 208802 208709 94 2 1 14 68 107 0.111 -0.08 9.05 Intr - 212405 212346 60 2 0 108 98 25 0.631 3.51 9.04 Intr - 218940 218854 87 0 0 100 95 68 0.731 7.95 9.03 Intr - 224857 224759 99 1 0 77 88 59 0.571 4.19 9.02 Intr - 226647 226516 132 0 0 71 74 41 0.420 0.92 9.01 Init - 236059 235988 72 1 0 47 83 75 0.365 4.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_1|70_aa XCWAWCECPCTLHDGVGGLPIFLENCFIGAAWEQLLDTLQDLGVVEAGAFSGLQIPVRLR QPSVDSGNLS >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_1|213_bp nngtgctgggcctggtgtgagtgcccctgtaccctgcatgatggtgtgggaggcctgccc atctttctggagaattgctttattggtgctgcctgggagcagttactggataccctccag gacttgggagtggtggaagctggtgctttctcaggacttcagatcccagttcgtctgagg caaccctctgtggactcagggaacctgagctga >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_2|195_aa MAGARRLELGEALALGSGWRHACHALLYAPDPGMLFGRIPLRYAILMQMRFDGRLGFPGG FVDTQDRSLEDGLNRELREELGEAAAAFRVERTDYRSSHVGSGPRVVAHFYAKRLTLEEL LAVEAGATRAKDHGLEVLGLVRVPLYTLRDGVGGLPTFLENSFIGSAREQLLEALQDLGL LQSGSISGLKIPAHH >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_2|588_bp atggccggagcccgcaggctggagctaggcgaggccctggcgctggggtcgggctggcgt catgcgtgccacgctctcctctacgcgccggaccctgggatgctcttcggccgcatcccg ctgcgctacgccatactgatgcagatgcgcttcgatggacgcctgggcttccccggcgga ttcgtggacacgcaggacagaagcctagaggacgggctgaaccgcgagctgcgcgaggag ctgggcgaagcggctgccgctttccgcgtggagcgcactgactaccgcagctcccacgtc gggtcagggccacgcgttgtggcccacttctatgccaagcgtctgacgctcgaggagctg ttggctgtggaggccggcgcaacacgcgccaaggaccacgggctggaggtgctgggcctg gtgcgagtgcccctgtataccctgcgggatggtgtaggaggcctgcctaccttcctggag aattcctttattggctctgcgcgggagcagttacttgaagctctccaggacttgggactg ctgcagtctggctctatttcaggccttaagattccagctcatcactag >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_3|866_aa MDRKKRVEIQTTIREYYKHLYTNKLENLEEMDTFLDTYTLPRLNEEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNRLMGSSTATWVFSLQWVSENLCHKFWAWGSGQTSKTSQEGQGQTSPDCKD YNKYLTLQCPDISEHPQALTIQKNMTSPNILNKAPVTNPRVTKICDFSDRKFKIAVLRKL NEIQDNTEKHSRILSDKLNKEIEIIF >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_3|2601_bp atggacaggaagaaaagggtggaaatacaaactaccatcagagaatactacaaacacctc tacacaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaacgaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgagtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagcgggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccacatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacaggttgatgggatcttctacggccacttgg gtgttctcccttcagtgggtatcagagaatctttgccataaattttgggcatggggtagt ggacagacatcaaagacttctcaagaaggacaggggcaaacaagcccagactgcaaagac tacaataaatacctaactcttcaatgcccagacatcagtgaacatccacaagcattgacc atccagaaaaacatgacctcaccaaacatactaaataaggcaccagtgaccaatcctaga gtgacaaagatatgtgacttttcagacagaaaattcaaaatagctgttttgaggaagctc aatgaaattcaagataacacagagaaacattccagaatcctgtcagataaacttaacaaa gagattgaaataattttttaa >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_4|201_aa MPLWTKDGQKHVVTLLQVQDCHVLKYTSKENCNGKMATLSVGGKTVSRFRKATSILEFYR ELGLPPKQTVKIFNITDNAAIKPGTPLYAAHFRPGQYVDVTAKTIGKGFQGVMKRWGFKG QPATHGQTKTHRRPGAVATGVKDSKLPAYKDLGKNLPFPTYFPDGDEEELPEDLYDENVC QPGPPNPSTVIAPTDPVAKAP >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_4|606_bp atgcctttatggaccaaggatggtcaaaagcatgtggtcacattacttcaggtacaagac tgtcatgtcttaaaatatacgtcaaaggaaaactgtaatggaaaaatggcaaccctgtct gtaggaggaaaaactgtatcacgttttcgtaaagctacatccatattggaattttaccgg gaacttggattgccgccgaaacagacagttaaaatctttaatataacagataatgctgca attaaaccaggcactcctctttatgctgctcactttcgtccaggacagtatgtggatgtc acagccaaaactattggtaaaggttttcaaggtgtcatgaaaagatggggatttaaaggc cagcctgctacgcatggtcaaacgaaaacccacaggagacctggagctgttgcaactggt gtcaaagattctaaactgcctgcatataaggatctcggtaaaaatctaccattccctaca tattttcctgatggagatgaagaggaactgccagaagatttgtatgatgaaaacgtgtgt cagcccgggcctcccaaccccagcactgtgattgcccccacagacccagtagcaaaggct ccatag >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_5|161_aa MTSEQVNIPRKTLVGGGEAQLHQAIPLLQDATVQVGHPHTFLPVPRAQGSTQAVPESTQH LGADLRQQPPTRHGLAREDSTHDFRAPCRSAFRESPRHRHVDAVAVGKFSQWPPERSPAD ALCDGKKAAGPTSGAFYNELRALESLPGNYSSASTVVLLWM >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_5|486_bp atgacctctgaacaggtgaacataccaagaaagaccttagtgggcggaggggaggcccaa ctgcaccaggccattccactgcttcaggacgcaactgtgcaggtaggacaccctcacacc ttcctacctgttccccgggcccagggcagcacccaggccgtccccgagtcgacccagcac ctgggcgccgacctgcgtcagcagcctccaacccggcatggattagcccgggaagactcg actcacgacttccgggcgccctgccgctctgctttcagggagtccccacgccaccgccac gtggacgcagtagccgtggggaagttttcgcaatggccgccggaacggtcgccggccgat gctctctgcgacggaaagaaagccgccggacccacttccggtgcgttttacaatgaactc cgggcactcgagtcgctccccggaaactatagttctgcttccaccgttgtcctactttgg atgtga >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_6|122_aa MRKGLTEIMGAVPEALRQYSPEHQDKNLSLRGTHSVAEKTKEVNRRSGKDWAADVGPVGG MDRLQGGERALLAFRGHAVTVVPPFVVLDSNQAFQDRHGNLPICAAELRRDRIPLVTKEP GR >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_6|369_bp atgagaaagggcttgactgaaataatgggggctgttcctgaagccttgaggcagtacagc ccagagcatcaagacaaaaatctcagtcttcgaggaactcacagtgtagcagaaaagaca aaggaggtaaacaggaggagtggcaaggactgggcagctgatgtgggtccagtaggagga atggacaggctgcaagggggagaaagggcactgctagctttcaggggtcatgcggtgaca gtggtccctccatttgtggtattagactccaaccaggcctttcaggatcgtcatggcaac ttacccatctgtgccgctgagctgagacgagacaggattcccttagtaaccaaggagccc gggagatga >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_7|110_aa MMTVGVEWRRWQVERRRWQWGPRSGASMAGSEELGLREDTLRVLAAFLRRAPLRSFCGEP LLYARTAAMSVDAMPGVNDWLIASKKTGTSVLQLQPTTGVSLEADSFQSI >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_7|333_bp atgatgacagtaggagttgagtggaggagatggcaagttgagcggaggaggtggcagtgg ggccccagatcaggtgcctccatggcaggctctgaagagctggggctccgggaagacacg ctgagggtcctagctgccttccttaggcgtgctccactgcgttcgttctgtggtgagccc ttgctgtatgcccggactgcagccatgagtgtggacgctatgcctggcgtcaatgactgg ctgatagccagcaagaaaacagggacttcagttctgcaactgcaaccaacaaccggagtg agcttggaagcagattctttccagagcatctag >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_8|108_aa MADTREAIVHASHLPMSVIIVGVGNADFSDMQMLDGDDGILRSPKGEPVLRDIVQFVPFR NFKHASPAALAKSVLAEVPNQVVDYYNGKGIKPKCSSEMYESSRTLAP >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_8|327_bp atggccgacacccgggaggccattgtccatgcctcccacctccccatgtcagtcatcatc gtgggagtagggaacgctgacttcagtgacatgcagatgctggacggtgatgatgggatt ctgaggtcacccaagggagagcctgttcttcgagacatcgtccagttcgtgcccttcagg aacttcaaacacgcatctccagctgccctggcaaagagcgtgctggctgaagtcccaaac caagttgtggactattacaatggcaaaggaattaaaccaaaatgttcatcagaaatgtat gaatcttccagaacactagcaccatga >gi568815595r:131362726_131602821|GENSCAN_predicted_peptide_9|565_aa MWLDIVELAGKKQKKEKGKSAIGKPDASFHSFCPWTSESRFFGLWTLGLASVASWGLSDL QPQTEACTCIVWDWDSNGKHDFIGEFTSTFKEMRGAMEGKQVQWECINPKYKAKKKNYKN SGTVILNLCKIHKMHSFLDYIMGGCQIQFTSIVDLVREIREATAAVGKMMNLALDTSEVL SESFLKIEKNKNVTLFKKWSRPNSMFTYKVHCIGPNCTQLFKKKRKKAPQECGPKGRLQV QLTRGCYTAASLRKRREGSQMTCLADEGKGSEEWGCFITPLTLQEFQSDNQHMGQIVSFS IKRQKHDAFREFGFKGSYLCRVDGIVLPRECRSLFLVNCEIGGQIFYPYSCCLVAIDFTA SNGDPRNSCSLHYIHPYQPNEYLKALVAVGEICQDYDSDKMFPAFGFGARIPPEYTVSHD FAINFNEDNPECAGIQGVVEAYQSCLPKLQLYGPTNIAPIIQKVAKSASEETNTKEASAP EHFPRLPCAAEYSTFPPIPISAESSSSFPDGKGPGRLQARSQDLKQEAMGALQPSASVPG EELPPLWHMAFRGRDCNPDPLIAGL >gi568815595r:131362726_131602821|GENSCAN_predicted_CDS_9|1698_bp atgtggttggacatcgtagaattagctggtaaaaaacaaaagaaggaaaaaggaaaaagt gccattggaaagccagatgcttccttccactctttctgcccttggacatcagaatccaga ttctttggcctttggactctaggacttgcatcagtggcttcctgggggctctcagacctt cagccacagactgaagcctgcacttgcatagtatgggactgggactccaatggcaagcat gacttcattggagaattcacctcgacattcaaggagatgagaggagcaatggaagggaaa caggtgcagtgggagtgcatcaatcccaagtacaaagccaagaagaagaattacaagaac tcaggcactgtgattctgaatctgtgcaagattcacaagatgcattctttcttggactac atcatgggtggctgccaaatccagtttacaagcatagtggacctggtcagggaaatacgc gaggcaacagcagctgtaggcaagatgatgaatttggctttggacacgtcagaggtactt tcagaaagttttctgaaaatagaaaagaataaaaatgtaacattatttaagaaatggtcc agaccaaacagtatgttcacctacaaggttcattgtataggtccaaattgtacacagcta tttaaaaagaaaagaaagaaagctccccaagaatgtggtcccaaaggtagactgcaggta caactaaccagaggttgttacacagcagcttcccttagaaaacgtcgagaaggttcacaa atgacctgtttggcagatgaagggaagggatctgaggagtggggttgtttcataacaccc ttaacacttcaggagtttcagtcagacaaccagcacatgggacagattgttagctttagt atcaagaggcagaagcatgatgccttcagagaatttggttttaagggatcttacctatgc cgcgtagatggaattgttctgcccagggagtgtagaagtcttttcttagtgaactgtgaa attggagggcagatattctatccttactcttgctgcctggtagctatagatttcactgcc tcaaacggggaccccaggaacagctgttccttgcactacatccacccttaccaacccaat gagtatctgaaagctttggtagctgtgggggagatttgccaagactatgacagtgacaaa atgttccctgcctttgggtttggcgccaggatacctccagagtacacggtctctcatgac tttgcaatcaactttaatgaagacaacccagaatgtgcaggaattcaaggagttgtggaa gcctatcagagctgtcttcctaagctccaactctacggtcccaccaacattgcccccatc atccagaaggttgccaagtcagcgtcagaggaaactaacaccaaggaggcatcggcccca gagcactttccaaggcttccctgtgctgctgagtacagcacctttcctccaattcccatc tctgctgagtcatcttccagctttccagatggaaaggggccaggcaggctgcaggccaga agtcaggacttaaagcaggaagccatgggagcactgcaaccttctgcaagtgtgcctgga gaagagctgccacctttgtggcacatggccttcagagggagagactgcaaccctgaccct cttatagcaggactttag