GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:41:06 Sequence gi568815593f:176922739_177164596 : 241858 bp : 46.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 2739 2734 6 1.05 1.08 Term - 17265 17078 188 1 2 93 42 107 0.960 4.25 1.07 Intr - 20750 20597 154 2 1 103 103 117 0.975 14.35 1.06 Intr - 42171 42022 150 0 0 72 68 79 0.819 4.66 1.05 Intr - 46517 45817 701 0 2 44 110 322 0.618 21.20 1.04 Intr - 48128 47982 147 0 0 112 -23 111 0.529 2.61 1.03 Intr - 52742 52658 85 2 1 97 60 32 0.919 0.69 1.02 Intr - 59885 59731 155 0 2 58 88 162 0.546 12.99 1.01 Init - 70278 70230 49 0 1 86 58 53 0.753 1.22 1.00 Prom - 71955 71916 40 0.74 2.03 PlyA - 76119 76114 6 1.05 2.02 Term - 83217 83011 207 0 0 85 48 110 0.599 4.04 2.01 Init - 84091 83801 291 1 0 52 102 130 0.739 5.95 2.00 Prom - 92144 92105 40 -5.76 3.00 Prom + 97185 97224 40 -4.16 3.01 Init + 100001 100159 159 1 0 95 80 203 0.973 18.98 3.02 Intr + 100422 100496 75 2 0 90 119 50 0.969 8.01 3.03 Term + 100969 101127 159 0 0 119 42 91 0.899 5.54 3.04 PlyA + 108019 108024 6 1.05 4.00 Prom + 108115 108154 40 -5.36 4.01 Init + 117142 117211 70 0 1 74 98 36 0.919 2.41 4.02 Intr + 118388 118491 104 0 2 105 85 80 0.993 9.29 4.03 Intr + 119040 119132 93 2 0 88 90 55 0.980 5.86 4.04 Intr + 121582 121795 214 0 1 51 79 210 0.802 14.69 4.05 Intr + 127965 128198 234 1 0 84 94 184 0.605 16.26 4.06 Intr + 139320 139413 94 1 1 65 100 101 0.706 8.02 4.07 Intr + 142768 142871 104 1 2 91 51 73 0.165 3.72 4.08 Intr + 166812 166955 144 1 0 64 52 141 0.164 8.35 4.09 Intr + 167652 167915 264 1 0 124 83 181 0.921 18.68 4.10 Intr + 168007 168087 81 2 0 46 86 54 0.576 0.61 4.11 Intr + 168200 168366 167 0 2 52 100 268 0.941 24.08 4.12 Intr + 168947 169070 124 1 1 77 80 178 0.963 16.06 4.13 Intr + 169583 169773 191 0 2 77 93 398 0.999 38.50 4.14 Intr + 169908 170046 139 2 1 71 99 219 0.778 21.34 4.15 Intr + 170400 170593 194 1 2 -7 84 228 0.846 12.11 4.16 Intr + 170668 170813 146 0 2 20 85 63 0.972 -1.72 4.17 Intr + 170916 171037 122 0 2 91 113 133 0.875 16.24 4.18 Intr + 172592 172702 111 0 0 96 81 228 0.776 23.25 4.19 Intr + 172795 172985 191 2 2 125 87 249 0.988 27.80 4.20 Intr + 173319 173441 123 2 0 76 64 218 0.841 18.98 4.21 Term + 173549 173623 75 1 0 120 47 81 0.999 5.04 4.22 PlyA + 173652 173657 6 -5.32 5.06 PlyA - 173726 173721 6 -8.24 5.05 Term - 173953 173809 145 0 1 54 48 83 0.696 -1.72 5.04 Intr - 174400 174105 296 1 2 -44 5 903 0.682 64.61 5.03 Intr - 175336 175267 70 0 1 105 48 96 0.604 6.48 5.02 Intr - 194341 194180 162 0 0 60 40 162 0.133 7.69 5.01 Init - 209655 209339 317 0 2 31 -38 345 0.026 11.01 5.00 Prom - 212208 212169 40 -11.63 6.00 Prom + 212281 212320 40 -8.66 6.01 Init + 212366 213292 927 1 0 81 52 555 0.661 45.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 156644 156766 123 1 0 97 48 75 0.820 2.78 S.002 Sngl - 209655 209122 534 0 0 31 42 332 0.967 16.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_1|542_aa MGFRHVGHAGLQLLISDKRMPRRKKKVKEVSESRNLEKKDVETTSSVSVKRKRRLEDAFI VISDSDGEEPKEENGLQKTKTKQSNRAKCLAKRKIAQMTEEEQFALALKMSEQEAREVNS QEEEEEELLRKAIAESLNVNMPCCKSSHISQGNEAEEREEPWDHTEKTEEEPVSGSSGSW DQSSQPVFENVNVKSFDRCTGHSAEHTQCGKPQESTGRGSAFLKAVQGSGDTSRHCLPTL ADAKGLQDTGGTVNYFWGIPFCPDGVDPNQYTKVILCQLEVYQKSLKMAQRQLLNKKGFG EPVLPRPPSLIQNECGQGEQASEKNECISEDMGDEDKEERQESRASDWHSKTKDFQESSI KSLKEKLLLEEEPTTSHGQERIYGPFGVSSPVSCGLMEVTCQIGCKPTANGNGRGEGQSS VKADKADNLVGNKEDAEKEVAISTFSSSNQVSCPLCDQCFPPTKIERHAMYCNGLMEEDT GSVTPSNAVQLDLHSGYRRLSDMIDVNVLFDIIESPCILGIICLYSFDANTPDLHVKMSA IS >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_1|1629_bp atggggtttcgccatgttggccatgctggtctccaactcctgatctcagacaaaaggatg ccacggagaaagaaaaaagttaaagaagtctccgaatctcggaacctggagaagaaggat gtggaaactaccagttctgtcagtgtgaagaggaagcgtagacttgaggatgcattcatt gtgatatccgatagtgatggagaggaaccaaaggaggaaaatgggttgcagaaaacgaag acaaaacagtcgaatagagcaaagtgtttggccaaaagaaaaatcgcacagatgacagaa gaagaacagtttgctctggctctcaaaatgagtgagcaggaagctagggaggtgaacagc caggaggaggaagaagaggagctcttgaggaaagccattgctgaaagcctgaatgtaaat atgccatgttgcaaaagctcacatatcagtcagggaaacgaggctgaggaaagagaggag ccttgggaccacactgaaaaaactgaagaggagccggtctctggcagctcaggaagctgg gaccagtcaagccagccagtgtttgagaatgtgaacgttaaatcttttgacagatgtact ggccactcggctgagcacacacagtgtgggaagccacaggaaagtactgggaggggttct gcttttctcaaagctgtccagggtagcggggacacatctaggcactgtctacctacccta gcagatgccaaaggtctccaggacactgggggcactgtgaactatttctggggtattcca ttctgccctgatggagtagaccctaaccagtataccaaggtcattctctgccagttggag gtttatcaaaagagcctgaaaatggctcagaggcagctccttaataaaaaaggttttggg gaaccagtgttacctagacctccttctctgatccagaatgaatgtggccaaggagagcag gctagtgagaaaaatgaatgcatctcagaagatatgggagatgaagacaaagaggagagg caggagtctagggcatctgactggcactcaaaaaccaaggatttccaggaaagctcaatt aaaagcttgaaagagaaacttttgttggaggaagaaccaacaaccagtcatggtcaggaa aggatttatggcccatttggtgtcagtagcccagtcagctgtggcttgatggaggttact tgtcaaattggctgtaagcccactgctaatgggaatgggaggggtgagggacagtcctca gtgaaagcagacaaggcagacaacctagttggtaacaaggaagatgctgagaaggaagta gctatttctaccttctcatccagtaaccaggtatcctgcccgctatgtgaccaatgcttt ccacccacaaagattgaacgacatgccatgtactgcaatggtctgatggaggaagataca ggatcagtgaccccgtctaatgctgttcagttggatttacactctggatatcgcagactt tcagacatgatagatgtaaatgtgctttttgatataatagaaagtccctgtattctcggc atcatatgtctttatagctttgatgcgaataccccagacctccatgtgaagatgtctgct atatcctga >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_2|165_aa MAPPRRGLVKSPRAAPAPAPRACLWHRTRAPLPLPPRSQRVAARDPRQPGEGLQSGLRQR VSTRRLRTAPPTPPCLLFFLSGSAARYSPGGALPDAKGRRVSRCLGSAAGFSSATAGGLV PCRSVNGARVPGSSGAGFESQQLDSSVLSFNHYIAPRYRNDTRKK >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_2|498_bp atggctcccccgcggcgggggttggttaagtctccgcgcgctgcgcctgcgcccgccccg agagcgtgtctctggcatcggacgcgcgcgcccctccccctccccccgcgctcccaacgt gtggcggctcgcgacccccggcaacccggagaaggtctacagagcggcctgcgccagcga gtgagtacccgccgcctgcgcacagctccgcccacccctccctgcctccttttcttcctc agcgggtccgcggcccgctactctccgggaggggcgcttcccgacgccaagggccggcgc gtttccaggtgcttaggcagcgccgcaggcttctcgagcgccacagccggcgggctggtg ccctgccgctcagttaacggggcgcgagtcccgggcagtagtggagctggatttgaatcc cagcagctggactccagtgttctttcatttaatcactacatcgcaccgagatatagaaat gacacccggaaaaagtga >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_3|130_aa MEYPAPATVQAADGGAAGPYSSSELLEGQEPDGVRFDRERARRLWEAVSGAQPRSFSHIR GLSSLYSSCRRPTVFAIPSQLLDANPDVQELYLGFSANLVRYLEGNQISPVICLLAEQTC LRKFNMLFVN >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_3|393_bp atggagtatcccgcgccggccacggtgcaggccgcggacggcggagcggccgggccttac agcagctcggagttgctggagggccaggagccggacggggtgcgctttgaccgcgagagg gcgcgccgcctgtgggaagccgtgtccggtgcccagccgcgcagtttttcccacattcga ggactgtcatccctatactcgtcctgtcggagacctacagtctttgccatcccgagccag ctgcttgatgctaacccagatgtacaagaactctatctgggcttctctgctaaccttgtc aggtatctggagggaaatcagatttctccagtgatttgtttgcttgctgagcagacttgt cttcgaaagtttaatatgctatttgtgaactga >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_4|994_aa MIELMPWPGAVAHACNPSTLEGQVEHMIQKNQCLFTNTQCKVCCALLISESQKLAHYQSK KHANKVKRYLAIHGMETLKGETKKLDSDQWGTIEDFLVGQWYDQTCVLSSNQKSSRSKDK NQCCPICNMTFSSPVVAQSHYLGKTHAKNLKLKQQSTKVEALSKRLTNPFLVASTLALHQ NREMIDPDKFCSLCHATFNDPVMAQQHYVGKKHRKQETKLKLMARYGRLADPAVTDFPAG KGYPCKTCKIVLNSIEQYQAHVSGFKHKNHGPLAQFAVVLAFPIIHRVISGSQGKKERAD AGFYRRQLVGSPAWVPESCEKEMRLLLALLGVLLSVPGPPVLSLEASEEVELEPCLAPSL EQQEQELTVALGQPVRLCCGRAERGGHWYKEGSRLAPAGRVRGWRGRLEIASFLPEDAGR YLCLARGSMIVLQNLTLITGDSLTSSNDDEDPKSHRDPSNRHSYPQQAPYWTHPQRMEKK LHAVPAGNTVKFRCPAAGNPTPTIRWLKDGQAFHGENRIGGIRLRHQHWSLVMESVVPSD RGTYTCLVENAVGSIRYNYLLDVLERSPHRPILQAGLPANTTAVVGSDVELLCKVYSDAQ PHIQWLKHIVINGSSFGADGFPYVQVLKTADINSSEVEVLYLRNVSAEDAGEYTCLAGNS IGLSYQSAWLTVLPEEDPTWTAAAPEARYTDIILYASGSLALAVLLLLAGLYRGQALHGR HPRPPATVQKLSRFPLARQFSLESGSSGKSSSSLVRGVRLSSSGPALLAGLVSLDLPLDP LWEFPRDRLVLGKPLGEGCFGQVVRAEAFGMDPARPDQASTVAVKMLKDNASDKDLADLV SEMEVMKLIGRHKNIINLLGVCTQEGPLYVIVECAAKGNLREFLRARRPPGPDLSPDGPR SSEGPLSFPVLVSCAYQVARGMQYLESRKCIHRDLAARNVLVTEDNVMKIADFGLARGVH HIDYYKKTSNGRLPVKWMAPEALFDRVYTHQSDV >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_4|2985_bp atgatagaactcatgccttggccgggcgcggtggctcacgcctgtaatcccagcactttg gaaggccaagtggagcacatgatccagaagaaccaatgtctcttcaccaacacccagtgt aaggtttgctgcgccttgcttatttctgagtcccagaagctggcacattaccagagcaaa aaacatgccaacaaagtgaagagatacctagcaatccatggaatggagacattaaagggg gaaacgaagaagctagactcagatcagtggggaaccattgaagattttctggtggggcag tggtatgatcagacctgtgttctttcttccaaccagaagagcagcagaagcaaagacaag aaccagtgctgccccatctgtaacatgaccttttcctcccctgtcgtggcccagtcgcac tacctggggaagacccacgcaaagaacttaaagctgaagcagcagtccactaaggtggaa gctctgtcaaaacgccttacaaatcctttccttgtggcctccaccttagccttgcaccag aatagagagatgatagacccagacaagttctgcagcctctgccatgcaactttcaacgac cctgtcatggctcaacaacattatgtgggcaagaaacacagaaaacaggagaccaagctc aaactaatggcacgctatgggcggctggcggaccctgctgtcactgactttccagctgga aagggctacccctgcaaaacatgtaagatagtgctgaactccatagaacagtaccaagct catgtcagcggcttcaaacacaagaaccacggaccacttgcccagtttgctgtggtgcta gccttccccatcatccaccgggtgatttctgggtcccagggaaagaaagagagagctgat gcaggtttctacagaaggcagttggtgggaagtccagcttgggtccctgagagctgtgag aaggagatgcggctgctgctggccctgttgggggtcctgctgagtgtgcctgggcctcca gtcttgtccctggaggcctctgaggaagtggagcttgagccctgcctggctcccagcctg gagcagcaagagcaggagctgacagtagcccttgggcagcctgtgcgtctgtgctgtggg cgggctgagcgtggtggccactggtacaaggagggcagtcgcctggcacctgctggccgt gtacggggctggaggggccgcctagagattgccagcttcctacctgaggatgctggccgc tacctctgcctggcacgaggctccatgatcgtcctgcagaatctcaccttgattacaggt gactccttgacctccagcaacgatgatgaggaccccaagtcccatagggacccctcgaat aggcacagttacccccagcaagcaccctactggacacacccccagcgcatggagaagaaa ctgcatgcagtacctgcggggaacaccgtcaagttccgctgtccagctgcaggcaacccc acgcccaccatccgctggcttaaggatggacaggcctttcatggggagaaccgcattgga ggcattcggctgcgccatcagcactggagtctcgtgatggagagcgtggtgccctcggac cgcggcacatacacctgcctggtagagaacgctgtgggcagcatccgctataactacctg ctagatgtgctggagcggtccccgcaccggcccatcctgcaggccgggctcccggccaac accacagccgtggtgggcagcgacgtggagctgctgtgcaaggtgtacagcgatgcccag ccccacatccagtggctgaagcacatcgtcatcaacggcagcagcttcggagccgacggt ttcccctatgtgcaagtcctaaagactgcagacatcaatagctcagaggtggaggtcctg tacctgcggaacgtgtcagccgaggacgcaggcgagtacacctgcctcgcaggcaattcc atcggcctctcctaccagtctgcctggctcacggtgctgccagaggaggaccccacatgg accgcagcagcgcccgaggccaggtatacggacatcatcctgtacgcgtcgggctccctg gccttggctgtgctcctgctgctggccgggctgtatcgagggcaggcgctccacggccgg cacccccgcccgcccgccactgtgcagaagctctcccgcttccctctggcccgacagttc tccctggagtcaggctcttccggcaagtcaagctcatccctggtacgaggcgtgcgtctc tcctccagcggccccgccttgctcgccggcctcgtgagtctagatctacctctcgaccca ctatgggagttcccccgggacaggctggtgcttgggaagcccctaggcgagggctgcttt ggccaggtagtacgtgcagaggcctttggcatggaccctgcccggcctgaccaagccagc actgtggccgtcaagatgctcaaagacaacgcctctgacaaggacctggccgacctggtc tcggagatggaggtgatgaagctgatcggccgacacaagaacatcatcaacctgcttggt gtctgcacccaggaagggcccctgtacgtgatcgtggagtgcgccgccaagggaaacctg cgggagttcctgcgggcccggcgccccccaggccccgacctcagccccgacggtcctcgg agcagtgaggggccgctctccttcccagtcctggtctcctgcgcctaccaggtggcccga ggcatgcagtatctggagtcccggaagtgtatccaccgggacctggctgcccgcaatgtg ctggtgactgaggacaatgtgatgaagattgctgactttgggctggcccgcggcgtccac cacattgactactataagaaaaccagcaacggccgcctgcctgtgaagtggatggcgccc gaggccttgtttgaccgggtgtacacacaccagagtgacgtgtga >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_5|329_aa MCVRAGLGAGTGRAAPRTADAAVLGTQADGPLVPPPPRAGRLLRSPVPRGVRASPLRAGH LDPSGAAPGHGTARLAGTSPLAQPSAWGLTACILHRARSTESSTTASGFRSRPHAGRIPG RTPPIGPFGQVAATGQSAAAGAVGRAKGSAAVEPQKRSSLLPASLGSGANAPVKTLGQQQ QGEAEEEEEEEEEEEQEEEEEEKEEEEKEEEEEQEEKEEEEEEEEEEEEEEEEEEKEEEE EQEEKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEDEELFSENSSSTGMPGYGEPPRV KISHSRIPKDHLEVGPGLSVAQRPPDDRG >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_5|990_bp atgtgcgtgcgcgccgggctgggggccgggacgggacgggccgcgcctcgcaccgcggac gctgcagttctcggcacccaggcagatggccctctcgtgccaccgccgcccagagcgggc cgcctgctccgcagcccggttccccggggagtgcgcgcctcgcccctgagggccgggcat ctggaccccagcggagctgcgccgggacacgggaccgcccggctggccgggacaagcccg ctggcgcagccctcggcctggggcctcacggcctgcatcctgcaccgggcccgcagcacc gaatctagcacgaccgcgtcgggcttccgctcccggccacacgcgggccgcattcctggg agaacgcctcctattggtccgttcggtcaggtggctgccacgggccaatcagcggcggct ggtgccgtgggccgcgcgaagggctctgcggcggtggagcctcagaaaagatcatctttg cttccagcttctctgggctcaggggccaatgctcccgtcaagacgctggggcagcagcag cagggggaggctgaggaggaggaagaggaggaggaggaagaggagcaggaggaggaggag gaagagaaggaggaggaagagaaggaggaggaagaggagcaggaggagaaggaggaagag gaggaggaggaagaggaggaggaggaagaggaggaggaggaagagaaggaggaggaagag gagcaggaggagaaggaggaagaggaggaggaggaagaggaggaggaggaagaggaggag gaagaggaggaggaggaagaggaggaggaggaagaggaagaggaggaggaggacgaggag ttgttcagcgagaacagctcctccaccgggatgccaggatacggggagcccccgagggtg aagatctcccatagcaggatcccaaaagaccacctggaggtagggccagggctcagtgtg gctcagcgccctcccgacgaccggggctag >gi568815593f:176922739_177164596|GENSCAN_predicted_peptide_6|309_aa MDQTCELPRRNCLLPFSNPVNLDAPEDKDSPFGNGQSNFSEPLNGCTMQLSTVSGTSQNA YGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKSDSRAQTPIVCTSLSPGG PTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQPVTEDESI EEIFEETQTNATCNYETKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQTETQKNKQ RNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTLGNMLELPGTSS SSTSQELPF >gi568815593f:176922739_177164596|GENSCAN_predicted_CDS_6|927_bp atggatcagacctgtgaactacccagaagaaattgtctgctgcccttttccaatccagtg aatttagatgcccctgaagacaaggacagccctttcggtaatggtcaatccaatttttct gagccacttaatgggtgtactatgcagttatcgactgtcagtggaacatcccaaaatgct tatggacaagattctccatcttgttacattccactgcggagactacaggatttggcctcc atgatcaatgtagagtatttaaatgggtctgctgatggatcagaatcctttcaagaccct gaaaaaagtgattcaagagctcagacgccaattgtttgcacttccttgagtcctggtggt cctacagcacttgctatgaaacaggaaccctcttgtaataactcccctgaactccaggta aaagtaacaaagactatcaagaatggctttctgcactttgagaattttacttgtgtggac gatgcagatgtagattctgaaatggacccagaacagccagtcacagaggatgagagtata gaggagatctttgaggaaactcagaccaatgccacctgcaattatgagactaaatcagag aatggtgtaaaagtggccatgggaagtgaacaagacagcacaccagagagtagacacggt gcagtcaaatcgccattcttgccattagctcctcagactgaaacacagaaaaataagcaa agaaatgaagtggacggcagcaatgaaaaagcagcccttctcccagcccccttttcacta ggagacacaaacattacaatagaagagcaattaaactcaataaatttatcttttcaggat gatccagattccagtaccagtacattaggaaacatgctagaattacctggaacttcatca tcatctacttcacaggaattgccattt