GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:42:07 Sequence gi568815597r:167446100_167653741 : 207642 bp : 44.95% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12153 12246 94 2 1 80 94 64 0.049 4.77 1.02 Intr + 16129 16247 119 2 2 80 79 29 0.033 1.38 1.03 Intr + 52144 52262 119 0 2 78 68 79 0.051 4.16 1.04 Intr + 55709 55783 75 2 0 125 75 42 0.056 5.23 1.05 Intr + 71303 71471 169 2 1 95 78 46 0.011 4.25 1.06 Intr + 73431 73581 151 2 1 36 66 97 0.019 2.04 1.07 Term + 74043 74434 392 1 2 -21 43 210 0.438 0.75 1.08 PlyA + 75318 75323 6 1.05 2.00 Prom + 76209 76248 40 -3.26 2.01 Init + 86660 86749 90 1 0 97 78 90 0.971 9.39 2.02 Intr + 89332 89434 103 0 1 66 43 40 0.168 -2.95 2.03 Term + 94211 94359 149 0 2 75 49 144 0.663 7.26 2.04 PlyA + 94856 94861 6 1.05 3.05 PlyA - 95718 95713 6 1.05 3.04 Term - 96202 96199 4 0 1 132 48 0 0.152 -3.22 3.03 Intr - 100186 100002 185 1 2 66 89 99 0.516 6.39 3.02 Intr - 102022 101903 120 1 0 53 115 67 0.929 6.59 3.01 Init - 107642 107289 354 2 0 111 99 764 0.986 74.94 3.00 Prom - 117091 117052 40 -2.46 4.07 PlyA - 117306 117301 6 -0.45 4.06 Term - 117716 117569 148 1 1 78 51 64 0.127 -0.93 4.05 Intr - 123381 123293 89 0 2 112 86 17 0.265 2.67 4.04 Intr - 129169 129047 123 1 0 64 100 36 0.025 3.18 4.03 Intr - 141430 141298 133 0 1 60 37 122 0.007 4.95 4.02 Intr - 151266 151188 79 1 1 55 66 34 0.004 -3.49 4.01 Init - 152232 152139 94 0 1 100 86 26 0.251 4.14 4.00 Prom - 162931 162892 40 -3.06 5.00 Prom + 171016 171055 40 -6.56 5.01 Init + 177223 177302 80 0 2 75 74 49 0.248 0.78 5.02 Intr + 184144 184365 222 1 0 136 89 75 0.082 9.64 5.03 Intr + 200637 200732 96 0 0 114 80 44 0.018 5.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 72366 72309 58 0 1 86 91 113 0.952 11.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:167446100_167653741|GENSCAN_predicted_peptide_1|372_aa MLFTVPAVPTCLALVWAGMSGIGMFSHRSFSGHGPSWDLGLLLGYQPASLKPLPKCHWGP VTIEEASVGHTVRSTAKTNNALTSFYMNPHLPNIYFGMLRSLRSQKSRDTGVRMPLQTTV CGAEMSELEQRRGARRLKFSLPHFSGRCHEADPPEQFVLSSFHWFSSSLPGLENMVEAKG LQQSWAPTIGNEDTMKKVSPANHRIPLSDGNSIPIIRLGTYSEPKLTPKGADDLLKATQQ VGAGQLELIQNNPGLKPKPVSNEVECHPYYTQPKLLKFRQQHDIVMIAYSPLGTTRNPTW TNMSSLPSLKDALLKRCNKITAQVILRFNIQQGVVVIPKSFNPEKIKENLQAFDFSLTES RTLKPGVKMSAS >gi568815597r:167446100_167653741|GENSCAN_predicted_CDS_1|1119_bp atgctcttcaccgtccctgctgtccccacgtgcctagccctggtgtgggcaggaatgagt gggattgggatgttttctcacaggtctttctcaggacatggaccaagttgggatttaggg ttgctcttgggttaccagccagcgtccctgaagcccctcccaaaatgtcactggggccct gtgaccatagaagaggcctctgtgggccacacagtaagatcaactgctaaaaccaacaat gccttgacatccttctacatgaatccacatcttcctaacatctattttggaatgctcagg tccctcaggagtcagaagagcagggacactggggtccgtatgccacttcagaccactgta tgtggagccgagatgtcagagctggagcagagaagaggagcacgcagattgaaattcagc ctgccccacttctcaggaagatgccacgaggctgatccccctgagcagtttgtcctgtcc tcttttcactggttttcttcctcattacctgggctagaaaacatggtggaagctaaaggc ttgcagcaaagctgggctcccacgatcgggaatgaggacactatgaagaaagtgagtcct gcaaatcaccgcatacctctaagtgatggaaacagcattcccatcatcagacttggtacc tactcagagcctaaattgacccctaagggggccgatgacttactcaaggccacacagcaa gtaggtgcaggccagctggagctcatccagaacaacccaggactcaaacccaagccagtc agcaacgaggttgagtgccacccatattacacccaaccaaaacttttgaaatttcgccaa caacatgacattgtcatgattgcatatagccctttggggaccactaggaatccaacctgg acaaatatgtcttctctgccttcgttaaaggatgcacttctaaaaaggtgcaataagatc acagctcaagttattttgcgtttcaacatccagcaaggggtggttgtcattcctaaaagc tttaatcctgaaaagatcaaggaaaaccttcaggcctttgacttttctctcactgaatca aggacattgaagcctggagtgaaaatgtccgcttcatag >gi568815597r:167446100_167653741|GENSCAN_predicted_peptide_2|113_aa MGNGVTVLVERVGDEEEERGGGVDKPSYGELGPLCRIQPAKACRKTTVQSRNGLETIEDK SSCVAGGVQGDQKPDVEAHGVEGMINPGKPSSSSSSSSSSSSKSGLCFSSCPC >gi568815597r:167446100_167653741|GENSCAN_predicted_CDS_2|342_bp atggggaatggggtcactgtcttggtggaaagggtgggagatgaagaggaggaaagagga ggtggtgtggacaagccctcctatggggagctggggcctttgtgcaggatacaacctgca aaagcgtgtcgtaaaaccacagtccagagcaggaatggactagaaaccattgaggataaa tcttcttgcgtggcaggtggggtgcaaggagaccagaagcctgatgtagaagcacatggt gtggagggaatgataaacccaggaaaacctagcagcagcagcagcagcagcagcagcagc agcagcaaatcaggcctctgcttctcctcttgcccctgctga >gi568815597r:167446100_167653741|GENSCAN_predicted_peptide_3|220_aa MAGLSRGSARALLAALLASTLLALLVSPARGRGGRDHGDWDEASRLPPLPPREDAARVAR FVTHVSDWGALATISTLEAVRGRPFADVLSLSDGPPGAGSGVPYFYLSPLQLSVSNLQEN PYATLTMTLAQTNFCKKHGFDPQSPLCVHIMLSGTVTKVNETEMDIAKHSLFIRHPEMKT WPSSHNWFFAKLNITNIWVLDYFGGPKIVTPEEYYNVTVQ >gi568815597r:167446100_167653741|GENSCAN_predicted_CDS_3|663_bp atggccgggctatcccgcgggtccgcgcgcgcactgctcgccgccctgctggcgtcgacg ctgttggcgctgctcgtgtcgcccgcgcggggtcgcggcggccgggaccacggggactgg gacgaggcctcccggctgccgccgctaccaccccgcgaggacgcggcgcgcgtggcccgc ttcgtgacgcacgtctccgactggggcgctctggccaccatctccacgctggaggcggtg cgcggccggcccttcgccgacgtcctctcgctcagcgacgggcccccgggcgcgggcagc ggcgtgccctatttctacctgagcccgctgcagctctccgtgagcaacctgcaggagaat ccatatgctacactgaccatgactttggcacagaccaacttctgcaagaaacatggattt gatccacaaagtcccctttgtgttcacataatgctgtcaggaactgtgaccaaggtgaat gaaacagaaatggatattgcaaagcattcgttattcattcgacaccctgagatgaaaacc tggccttccagccataattggttctttgctaagttgaatataaccaatatctgggtcctg gactactttggtggaccaaaaatcgtgacaccagaagaatattataatgtcacagttcag tga >gi568815597r:167446100_167653741|GENSCAN_predicted_peptide_4|221_aa MAERDQHRVQAVASEGESPKPWQLPCGIEFAVNKSHGSDGFISGFCFCFFLSSLAAIMAV AASVLAAVVIITSSPHTASSFSLPLETACFYMASIMLMGTAQDMLFWENGSQISLGDLSK GPGKRGHHPSLQLHDHLEFNGLKTSFGGRGIYSTLIQDTSGQSTHNLEHSNLGLQMSHFH GELLHRILQLCTGLQPPGIIETGPPLLPNITANICTFSSCC >gi568815597r:167446100_167653741|GENSCAN_predicted_CDS_4|666_bp atggctgaaagggaccaacatagagttcaggctgtggcttcagagggtgaaagccctaag ccttggcagcttccatgtggtattgagtttgcagtgaataagtctcatggatctgatggg tttatcagcggtttctgcttttgcttcttcctctcctctcttgccgccatcatggctgtg gctgccagtgtgcttgctgctgtggtcatcatcacttcatctcctcatactgccagcagc ttcagcctccccctggagacagcctgcttctacatggcttctataatgcttatggggaca gcacaggatatgctgttttgggaaaatggtagtcagatctcccttggagacctatctaag ggtcctggtaaaaggggccatcatccgagtctccagttgcatgaccacttggagtttaat ggcctgaagaccagctttggtgggagagggatctactctacactgatccaagatacaagt ggacagagcacccacaatctggaacacagcaacctaggccttcagatgagccactttcat ggggaactgttgcaccggatactgcagctctgcactggcctacaaccaccgggcatcatc gaaactggacctccactattgccaaacattactgccaacatatgcactttcagttcatgt tgttga >gi568815597r:167446100_167653741|GENSCAN_predicted_peptide_5|133_aa MRLSLGLSLSMASALRPQLVCGSLWWGAPAAGAVPQPSAAGRAPPPTRPVPAAARNWPRP GAEGTAGIVSSGPGERVRLPQSRHLKDMEVKDPGGRRGAERALSQAQGVDASREIGSSEL LGHRPKPGVVLAS >gi568815597r:167446100_167653741|GENSCAN_predicted_CDS_5|399_bp atgaggctctctctgggcctgtctctctccatggccagtgccctcaggccccagcttgtc tgtggatctctgtggtggggggcccccgcggccggggcagtcccgcagccgagcgcagcc gggcgcgcgccaccgcccactcgccctgtgcccgccgcagcccgaaactggccacggccg ggagcggaggggacagcggggatcgtgagctccggcccgggcgagcgggtgcgtctgccg cagagtcggcacctgaaggacatggaggtaaaggaccccggagggagacgcggggctgag cgagctctttctcaagctcagggagtagatgcctcaagagaaataggatcctcggaactt ctggggcatcgcccaaaacctggggtagtgttagccagn