GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:32:53 Sequence gi568815583f:48021045_48242348 : 221304 bp : 38.00% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1938 1977 40 -2.95 1.01 Init + 7270 7384 115 0 1 69 68 113 0.056 7.82 1.02 Intr + 7502 7957 456 0 0 105 80 104 0.045 3.56 1.03 Intr + 10407 10590 184 1 1 35 90 89 0.050 1.72 1.04 Intr + 14486 14498 13 2 1 95 94 18 0.052 -3.03 1.05 Intr + 16407 16451 45 2 0 62 72 84 0.027 1.89 1.06 Intr + 20914 21027 114 0 0 31 84 85 0.062 2.12 1.07 Term + 31986 32111 126 2 0 123 37 77 0.209 3.40 1.08 PlyA + 32385 32390 6 1.05 2.00 Prom + 34858 34897 40 -4.25 2.01 Init + 35035 35121 87 0 0 83 51 55 0.561 1.99 2.02 Intr + 38665 38760 96 0 0 84 60 118 0.867 7.89 2.03 Term + 42023 42142 120 1 0 105 47 76 0.681 2.69 2.04 PlyA + 43478 43483 6 1.05 3.00 Prom + 44389 44428 40 -4.95 3.01 Init + 45911 46046 136 1 1 84 98 25 0.301 3.49 3.02 Intr + 52710 52850 141 1 0 55 97 40 0.166 1.00 3.03 Term + 52954 53105 152 2 2 106 48 74 0.180 2.29 3.04 PlyA + 55052 55057 6 1.05 4.08 PlyA - 55362 55357 6 1.05 4.07 Term - 64712 64577 136 1 1 1 54 143 0.069 -1.29 4.06 Intr - 66970 66884 87 0 0 107 80 54 0.292 4.67 4.05 Intr - 69043 68940 104 1 2 109 13 68 0.216 -0.55 4.04 Intr - 72171 71989 183 0 0 34 41 164 0.201 5.36 4.03 Intr - 75516 75352 165 0 0 18 95 81 0.261 1.04 4.02 Intr - 75664 75548 117 1 0 37 99 114 0.785 7.14 4.01 Init - 76822 76703 120 1 0 61 63 38 0.322 -1.16 4.00 Prom - 81384 81345 40 -5.05 5.00 Prom + 86589 86628 40 -8.15 5.01 Init + 90798 90886 89 2 2 50 80 136 0.462 9.16 5.02 Intr + 96608 96723 116 2 2 73 84 54 0.262 2.67 5.03 Intr + 99972 100121 150 1 0 61 96 32 0.466 0.51 5.04 Intr + 100813 100992 180 2 0 112 101 62 0.988 8.82 5.05 Intr + 105418 105557 140 2 2 4 80 144 0.128 4.46 5.06 Intr + 105620 105746 127 1 1 33 77 129 0.646 5.63 5.07 Intr + 115761 115919 159 1 0 36 84 159 0.179 9.34 5.08 Term + 120069 120181 113 1 2 61 42 60 0.434 -3.56 5.09 PlyA + 120774 120779 6 1.05 6.13 PlyA - 121064 121059 6 1.05 6.12 Term - 122027 121864 164 0 2 108 28 151 0.944 8.22 6.11 Intr - 128039 127988 52 2 1 90 103 23 0.919 1.56 6.10 Intr - 128327 128119 209 0 2 54 75 232 0.987 16.37 6.09 Intr - 130527 130429 99 1 0 45 50 113 0.817 2.36 6.08 Intr - 130898 130830 69 0 0 90 92 57 0.955 4.54 6.07 Intr - 132849 132748 102 1 0 104 89 44 0.944 5.33 6.06 Intr - 137012 136949 64 2 1 63 113 77 0.854 5.07 6.05 Intr - 138760 138569 192 1 0 35 101 145 0.985 9.27 6.04 Intr - 146357 146305 53 0 2 39 110 45 0.938 -0.49 6.03 Intr - 147795 147587 209 2 2 84 88 185 0.994 15.90 6.02 Intr - 150148 150025 124 2 1 55 96 87 0.964 4.92 6.01 Init - 157193 157001 193 2 1 97 59 190 0.999 14.48 6.00 Prom - 168280 168241 40 -6.15 7.04 PlyA - 168762 168757 6 1.05 7.03 Term - 170919 170734 186 0 0 90 43 117 0.449 3.91 7.02 Intr - 171140 171004 137 0 2 111 5 79 0.614 1.27 7.01 Init - 174585 174309 277 0 1 56 94 130 0.394 7.59 7.00 Prom - 178150 178111 40 -6.05 8.00 Prom + 179677 179716 40 -5.35 8.01 Init + 186676 187095 420 0 0 47 121 417 0.999 37.33 8.02 Intr + 199590 199721 132 2 0 41 85 231 0.607 18.02 8.03 Intr + 199877 199952 76 1 1 112 99 81 0.935 9.77 8.04 Intr + 205432 205527 96 2 0 82 83 46 0.489 2.56 8.05 Intr + 206035 206130 96 2 0 53 99 109 0.927 7.56 8.06 Intr + 208145 208284 140 0 2 106 86 37 0.984 4.56 8.07 Intr + 209349 209459 111 2 0 73 94 94 0.992 8.16 8.08 Intr + 211683 211794 112 2 1 101 83 60 0.993 5.83 8.09 Intr + 213833 213960 128 0 2 76 94 59 0.511 4.78 8.10 Intr + 220471 220555 85 0 1 62 97 56 0.027 2.37 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_1|350_aa MMVTNQMTSYYSPEKVKEQREFSIRQTSRLCVSSYEEAGREKLRAERRCFTFKHSLFQFE FDKWALLIILHLNPDSFSALLSLCKVHCFMPHQPWGVCSILTIVTTQYTPHILSHESQNL KSHQMPSCRFPIDLHKATVAMLSRSWCPKSPWLLPLWAPGRISCHPALLGLLPIGTPFRS ALAVTGEGAMEIHRLAAVITVLCQGLQKISSLDGHQGNAAYLELFDLFSGLDIDFSFSGI AKAREHEGGRESHFEKSNNLSDREDKLIKPQKVSGTQDKLVAQSPRNCPYRKEDRYVYKE NSAEHSYSKVLNRSDEAHTYSGRPPVILSPPIQMLISYRNTHTDTPRNYV >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_1|1053_bp atgatggtcacgaaccagatgacctcctattattcgccagaaaaagtgaaagagcagaga gaattttcaataaggcaaaccagcaggctttgtgtcagctcctatgaggaagcaggtaga gaaaagctacgagcagaaagaagatgttttacattcaaacacagtctttttcaatttgag tttgataaatgggctttgctaattatcctccatttgaatccagattcattttctgctctt ctcagcttgtgtaaggtgcattgcttcatgccacatcagccctggggtgtctgtagcatc ctaacaatagtgacaacacagtacaccccccacatactctcccatgaatcacagaacctc aaatcccaccaaatgccctcctgcaggtttcccatagatttgcacaaggccacagtagcc atgctttcccgctcctggtgcccaaaatctccctggctgcttccactgtgggctcctggg aggatcagctgccaccctgctcttcttggcctcctgcccattggcacccctttccgttct gctctggctgtcactggggaaggagcaatggaaatccacaggctggctgcagtcattaca gttctttgccagggcctgcagaagatatcttctttagatgggcaccagggaaatgctgcc tatctggagctgtttgatttgttctctgggttagatattgatttttctttcagtggaata gcaaaggcaagagagcatgaaggagggagagagagccattttgaaaagtccaataaccta agtgatcgggaagacaagctgatcaagccacagaaagtgagtggcacacaagacaagtta gttgcccagtccccaaggaactgcccctatcgcaaagaggacagatatgtctacaaagaa aattcagcagagcattcatacagcaaggtcctcaatagatcggatgaagcccacacatac tctggaaggccacctgttatactcagtccaccaattcaaatgctaatctcttacagaaac acccacacagacacacccagaaattatgtttaa >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_2|100_aa MDEAGNHHSQKTNTRTENQTPHILSHKWERVNVDHWPVSSLISCPSECELIFSPEKAKSE RHFSVLELSASQMSNGINSKFSIFKGKNNNDDKKKQQSLL >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_2|303_bp atggatgaagctggaaatcatcattctcagaaaactaacacaagaacagaaaaccaaaca ccacacattctcagtcataagtgggagcgcgtcaatgtggaccactggcctgtgtcatcc ctgatctcctgcccatcagagtgtgagcttatattttcaccagagaaagccaaaagtgaa aggcacttcagtgttttggagctctcagcatctcaaatgtccaatgggatcaattcgaag ttttctatttttaaaggaaaaaacaacaacgacgacaagaagaaacaacaaagcttgtta tga >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_3|142_aa MVERAKWQQSHSGELGQTQGKSLNKTWGEEGLPCDAGPVGIRGCEGRAPSHSQGICPHGP NNFHHTPPPTLGIIFPHEIWREETFKLHQSPLASEPGEKAELPPRSTFASDVLSASCGLS AYTTPTETSQHILGKSREHGLY >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_3|429_bp atggtggagagagcaaagtggcaacagagccattcgggggagctgggccaaacacaggga aagagcttgaacaagacgtggggggaggagggtctgccatgtgatgcaggcccagtaggt atcagaggttgtgaggggagagcaccaagccattcacaagggatctgcccccatggccca aacaacttccaccacaccccacctccaacactggggatcatatttccacatgagatttgg agggaagaaacattcaaactacatcaaagcccactagcctccgaaccaggtgagaaggca gaactgcctcccaggtctacatttgcctctgatgttttaagcgcctcatgtgggctgtca gcttacacaactcccacggaaacttctcagcacattttgggaaaatccagggagcacggt ctatactga >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_4|303_aa MYFDNFRSDIEPISCVTVIYKFTANSRLHLQYLEGENTFKWNKRRWQYRKRNSTSEGLEE GQIPACLHNGQCKGMVKVQKKPGPCMKGALYETWQGCEEERNVWYANQEAKEVPKNGKEG HWCGPHLTSFESVKCITRMPVLEKSSVGPDAVKVGAGQPTPAVGEMLLPRNRLPRPRTKA DVKYESMINAFIKKQVSGAFLWVDSGQEAHSEITSHGSLLALSVHSGQRGFMTGPYAMEL TVLGVESGCVNMPMPLSLGFLLSIQLMRQNSTCRTSGIPTCNIPEALSVQALFVLWILEG ASI >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_4|912_bp atgtattttgataacttcaggagtgatattgaacctatatcctgtgttacggttatttac aagtttacagccaattctagactccacctccagtacttagaaggagagaacacttttaag tggaacaagagaagatggcagtataggaagaggaatagcacgagtgaaggactggaagag ggacagatccctgcgtgtttgcacaatggacagtgtaagggtatggtgaaagtgcagaag aaacctgggccttgcatgaaaggggccttgtatgaaacgtggcagggttgtgaagaggag agaaatgtgtggtatgcaaaccaggaggccaaagaagtgccaaagaatggcaaggagggc cattggtgtggaccgcatctcacatcatttgaatcagtaaagtgcatcacgaggatgcct gttttggagaagagctctgtggggccagatgcagtaaaagtgggtgccggacagcctact ccagcagtgggagagatgctgctgcccaggaacaggctccccagacccagaacaaaggct gatgtgaaatatgaaagcatgatcaatgcattcatcaagaaacaggtttcaggtgccttc ctgtgggtggactctgggcaggaggcacattctgaaatcacatcacatgggtcattactg gccctatccgttcacagtggccagcgggggttcatgacaggcccatatgccatggagtta actgtcctgggagttgagagtgggtgcgtaaatatgccaatgcctttgtcccttggcttt ttgctgagcatccagctaatgcgtcagaactccacttgtagaacttctggcatccccacc tgcaatattccagaggccttgtccgttcaggcactttttgttctatggatcctggaaggc gcttccatctga >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_5|357_aa MKAPSRADAHTDLLFQDPEGKLPMQQQQLRCSLLSALKHWTPSSSALGLGLASLLLSLQM ACCGTLRSSARCSKSTAEMQTKGGQTWARRALLLGILWATAHLPLSGTSLPQRLPRATGN STQCVISPSSEFPEGFFTRQERRDGGIIIYFLIIVYMFMAISIVCDEYFLPSLEIISEFV QGISRLEESLGFWAGKQNGDSGKRVKDVIRRGALSKPKLELKELENVTRMDTLRGVYERQ EERQRHSERGIQMTKEKEGAFVVSDHSSMERSEQQPLMGWEDEGQPFIRRQSRTDSGIFY EDSGYSQLSISLHGLSQVSEGETLEIPDTVMGLTLLAAGTSIPDTIASVLVARKGKN >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_5|1074_bp atgaaagctcctagtcgagctgatgcccatactgacctcctgttccaggatcctgaggga aagctgcccatgcaacagcagcaactgaggtgctcgctgctttctgccctcaaacattgg actccaagttcttcagctttgggacttggactggcttccttgctcctcagcttgcagatg gcctgttgtgggaccttgcgatcatctgcacgctgcagtaagagcacagcagaaatgcag acaaaagggggccaaacatgggcgagaagggctctgttgctcggcatcctgtgggccact gcacatctgcctctctcagggacctccctgccccaacgtctcccaagggccacaggaaat agcacccaatgtgttatttctccatcatcggagtttcccgaagggtttttcacgagacag gagcgcagagatggaggcatcataatctatttcctaattatcgtttacatgttcatggcc atatctattgtctgtgatgaatacttcctaccctccctggaaatcatcagtgaatttgtc cagggcatttcacgtttggaggaaagccttggcttctgggctgggaaacagaatggagat tctgggaaaagagttaaagatgtgatcagaagaggagcattatccaagcccaagttagag ctaaaggagctagagaatgttacccggatggataccctgaggggtgtgtatgagagacag gaagagagacagagacacagtgagagaggaatccaaatgactaaggaaaaggagggagcc tttgtcgtctctgatcattcatctatggagagaagtgaacaacagccactgatgggctgg gaagatgaaggtcaaccattcattcgtcggcaatcaagaactgatagtggaatattttat gaagattctggctactctcagctctctataagtttacatggccttagtcaggtttctgaa ggggaaacactagaaattcccgatacagtaatgggccttactttattagcagcaggaaca agcataccagacacaattgcaagtgtgttggttgcaagaaaaggtaagaactag >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_6|509_aa MADANKAEVPGATGGDSPHLQPAEPPGEPRREPHPAEAEKQQPQHSSSSNGVKMYCLFLR EPGRGPLGIEKKGLFFYLRYSETREKAFVPFGCLLPFSKVSLCLDRENDESAKEEKSDLK EKSTGSKKANRFHPYSKDKNSGAGEKKGPNRNRVFISNIPYDMKWQAIKDLMREKVGEVT YVELFKDAEGKSRDPDGENARRALQRTGGSFPGGHVPDMGSGLMNLPPSILNNPNIPPEV ISNLQAGRLGSTIFVANDDKSVPHEEYRSHDGKTPQLPRGLGGIGMGLGPGGQPISASQL NIGGVMGNLGPGGIGFGGLEAMNSMGGFGGVGRMGELYRGAMTSSMERDFGRGDIGINRG FGDSFGRLGGGMGSMNSVTGGMGMGLDRMSSSFDRMGPGIGAILERSIDMDRGFLSGPMG SGMRERIGSKGNQIFVRNLPFDLTWQKLKEKFSQCGHVMFAEIKMENGKSKGCGTVRFDS PESAEKACRIMNGIKISGREIDVRLDRNA >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_6|1530_bp atggcggacgccaacaaggccgaggtgcccggggccactggtggcgacagcccgcacctg cagcccgcagagccgccgggcgagccgcggcgagagccgcaccccgcggaggcggagaag cagcagccgcagcacagcagcagctccaatggcgttaaaatgtattgtcttttcctccgg gaaccggggcgaggccctttgggaattgaaaagaaaggtttgttcttctatctgcggtac tcagaaaccagagaaaaggcctttgtgccttttggctgcttgctgccattttcaaaagtg tcactctgtttagacagggagaatgatgaatcagcaaaagaagagaaatctgacttaaag gaaaaatctacaggaagtaagaaggccaatagatttcatccttattcaaaagacaagaat tcgggcgctggagaaaagaagggtccaaatcgtaacagagttttcattagcaacatccca tatgacatgaaatggcaagctattaaagatctaatgagagagaaagttggtgaggttaca tacgtggagctctttaaggatgcggaaggaaaatcaagggatcctgatggagaaaatgct cgtagggcattgcagcgaacaggaggatcatttccaggaggacacgtccctgatatggga tcagggttgatgaatttaccaccttccatactcaataatccaaacattcctcctgaagtc atcagtaatttgcaggccggtagacttggttccacaatttttgttgccaatgatgacaag tctgttcctcatgaagagtaccgttcacatgatggtaaaacaccacaattaccacgtggt cttggaggcattgggatgggacttggtccgggtggacagcctattagtgccagccagttg aacataggtggagtaatgggaaatttaggtccaggtggaatagggtttggtggtctggaa gcaatgaatagcatgggaggatttggaggagttggccgaatgggagagctgtaccgtggt gcgatgactagtagcatggagcgagattttggacgtggtgatattggaataaatcgaggc tttggagattcctttggtagacttggtggtggaatgggtagcatgaacagtgtgactgga ggaatggggatgggactggaccggatgagttccagctttgatagaatgggaccaggtata ggagctatactggaaaggagcatcgatatggatcgaggatttttatcgggtccaatggga agcggaatgagagagagaataggctccaaaggcaaccagatatttgtcagaaatctacct tttgacttgacttggcagaaactaaaagagaaattcagtcagtgtggtcatgtaatgttt gcagaaataaaaatggagaatggaaagtcaaaaggctgtggaacagtcagatttgactcc ccagaatcagctgaaaaagcctgcagaataatgaatggcataaaaatcagtggcagagaa attgatgttcgcttggatcgtaatgcataa >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_7|199_aa MIQICEEEKSMTINMKAEKYWIFSEHSEQFGTGKGEYGKQSVGKVGWSQIVKYVDCQAKS LKSICLKAEWLDDETEIDIILRERASRERNGSGLFLRTKPKKHGNFLQNIMGEASAICTY TTLFPRLAAASAVALALGKYKRMRTCFSSGPAFGTLSVNAEGRGRKLLRRQRCLTFHACC HGDEVQPVDLSVGTFVCVR >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_7|600_bp atgattcaaatctgtgaagaagaaaaaagcatgacaataaatatgaaggcagaaaaatac tggattttttcagagcacagtgagcaatttggtacaggcaagggggagtatgggaaacaa agcgttggaaaggtaggttggagccaaattgtgaaatatgttgactgtcaggctaagagt ttgaaatctatttgcttaaaggctgagtggcttgatgatgagactgaaatagacataatt ttgagggagagagcatcacgggaaagaaatggttcaggtctctttctacgcactaaaccg aagaagcatggaaacttcctacaaaacattatgggagaagcatcagcaatctgcacgtac acaacacttttcccccgcctggcagccgcatcagcagtggcgctagcattagggaaatac aagagaatgaggacatgcttctcatctggacctgcatttggaacgctgtcagttaatgcg gagggaagggggagaaagttacttcgcagacagcgatgtctcaccttccatgcttgttgc catggagatgaagtccagccagtagacttaagtgttggcacgtttgtctgtgttcgctga >gi568815583f:48021045_48242348|GENSCAN_predicted_peptide_8|466_aa MSLNNSSNVFLDSVPSNTNRFQVSVINENHESSAAADDNTDPPHYEETSFGDEAQKRLRI SFRPGNQECYDNFLQSGETAKTDASFHAYDSHTNTYYLQTFGHNTMDAVPKIEYYRNTGS ISGPKVNRPSLLEIHEQLAKNVAVTPSSADRVANGDGIPGDEQAENKEDDQAGVVKFGWV KGVLVRCMLNIWGVMLFIRLSWIVGEAGIGLGVLIILLSTMVTSITGLSTSAIATNGFVR GGLGVIIIGLSVVVTTLTGISMSAICTNGVVRGGGAYYLISRSLGPEFGGSIGLIFAFAN AVAVAMYVVGFAETVVDLLKESDSMMVDPTNDIRIIGSITVVILLGISVAGMEWEAKAQV ILLVILLIAIANFFIGTVIPSNNEKKSRGFFNYQASIFAENFGPRFTKGEGFFSVFAIFF PAATGILAGANISGDLEDPQDAIPRGTMLAIFITTVAYLGVAICVX >gi568815583f:48021045_48242348|GENSCAN_predicted_CDS_8|1398_bp atgtcactgaacaactcttccaatgtatttctggattcagtgcccagtaataccaatcgc tttcaagttagtgtcataaatgagaaccatgagagcagtgcagctgcagatgacaatact gacccaccacattatgaagaaacctcttttggggatgaagctcagaaaagactcagaatc agctttaggcctgggaatcaggagtgctatgacaatttcctccaaagtggagaaactgct aaaacagatgccagttttcacgcttatgattctcacacaaacacatactatctacaaact tttggccacaacaccatggatgccgttcccaagatagagtactatcgtaacaccggcagc atcagtgggcccaaggtcaaccgacccagcctgcttgagattcacgagcaactcgcaaag aatgtggcagtcaccccaagttcagctgacagagttgctaacggtgatgggatacctgga gatgaacaagctgaaaataaggaagatgatcaagctggtgttgtgaagtttggatgggtg aaaggtgtgctggtaagatgcatgctgaacatctggggagtcatgctcttcattcgcctc tcctggattgttggagaagctggaattggtcttggagttctcataattcttctttccacc atggtaacttctattactgggttgtcaacttctgcgatagcaactaacgggtttgttcgt ggaggtcttggagtcatcatcattggcctaagtgtggtagtaacgacactcacaggtatt tctatgtctgctatttgcacgaatggagtagtaagaggaggtggggcctactatcttatt tccagaagtttagggcccgagttcggtgggtcaataggcctgatctttgcttttgctaat gcagtggctgttgctatgtatgtggtgggatttgctgagactgtagtagatcttcttaag gagagtgattcgatgatggtggatccaaccaatgacatccggattataggctccatcaca gtggtgattcttctaggaatttcagtagctggaatggaatgggaggcaaaggcccaagtc attcttctggtcattcttctaattgctattgcaaacttcttcattggaactgtcattcca tccaacaatgagaaaaagtccagaggtttctttaattaccaagcatcaatatttgcagaa aactttgggccacgcttcacaaagggtgaaggcttcttctctgtctttgccatttttttc ccagcagctactgggattcttgctggtgccaatatctcaggagatttggaggatccccaa gatgccatccccagaggaaccatgctggccattttcatcaccactgttgcctacttaggg gttgcaatttgtgtagnn