GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:30:42 Sequence gi568815586r:94871443_95103680 : 232238 bp : 41.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 274 269 6 1.05 1.01 Sngl - 1747 1436 312 1 0 73 47 260 0.639 14.09 1.00 Prom - 10145 10106 40 -6.55 2.00 Prom + 12744 12783 40 -7.35 2.01 Sngl + 12875 13363 489 1 0 22 38 246 0.934 8.81 2.02 PlyA + 13548 13553 6 1.05 3.00 Prom + 23280 23319 40 -4.35 3.01 Init + 30687 30845 159 2 0 60 82 138 0.049 10.31 3.02 Term + 42869 42931 63 1 0 112 39 42 0.021 -1.39 3.03 PlyA + 43256 43261 6 1.05 4.03 PlyA - 43735 43730 6 1.05 4.02 Term - 51229 50908 322 0 1 80 38 207 0.476 8.21 4.01 Init - 52254 52211 44 0 2 85 55 83 0.803 4.64 4.00 Prom - 53014 52975 40 -5.05 5.07 PlyA - 54981 54976 6 1.05 5.06 Term - 55482 55318 165 0 0 31 44 387 0.867 25.53 5.05 Intr - 56778 56668 111 0 0 52 37 102 0.093 1.16 5.04 Intr - 77420 77127 294 2 0 2 -1 291 0.145 7.88 5.03 Intr - 77537 77451 87 2 0 50 80 95 0.632 4.15 5.02 Intr - 79657 79537 121 0 1 55 83 82 0.683 3.98 5.01 Init - 81635 81577 59 2 2 97 72 40 0.772 4.13 5.00 Prom - 91408 91369 40 -5.15 6.07 PlyA - 93095 93090 6 1.05 6.06 Term - 100178 99998 181 1 1 96 43 139 0.217 6.30 6.05 Intr - 109001 108974 28 0 1 63 94 27 0.029 -2.94 6.04 Intr - 117828 117621 208 0 1 50 56 151 0.036 5.83 6.03 Intr - 122815 122742 74 2 2 76 6 56 0.008 -5.59 6.02 Intr - 131379 131297 83 2 2 81 97 63 0.958 4.86 6.01 Init - 132238 132153 86 1 2 104 96 103 0.998 13.04 6.00 Prom - 133619 133580 40 -5.65 7.02 PlyA - 133808 133803 6 1.05 7.01 Sngl - 141982 141647 336 1 0 49 37 202 0.856 6.98 7.00 Prom - 149696 149657 40 -7.75 8.13 PlyA - 149761 149756 6 1.05 8.12 Term - 150961 150787 175 0 1 133 48 106 0.994 7.35 8.11 Intr - 153813 153708 106 1 1 66 70 87 0.856 3.05 8.10 Intr - 160046 159907 140 1 2 68 93 49 0.463 2.59 8.09 Intr - 169155 169034 122 0 2 59 96 51 0.379 1.37 8.08 Intr - 177791 177626 166 1 1 61 30 130 0.848 3.54 8.07 Intr - 180501 180320 182 0 2 45 116 160 0.936 12.24 8.06 Intr - 186201 186111 91 2 1 75 93 95 0.990 7.78 8.05 Intr - 186436 186289 148 2 1 23 98 173 0.925 10.17 8.04 Intr - 187047 186868 180 1 0 95 40 81 0.825 2.92 8.03 Intr - 188542 188464 79 1 1 104 92 40 0.997 4.21 8.02 Intr - 191296 191066 231 1 0 32 99 127 0.947 5.25 8.01 Init - 195942 195889 54 0 0 63 87 60 0.664 4.73 8.00 Prom - 196764 196725 40 -6.65 9.00 Prom + 198405 198444 40 -5.65 9.01 Init + 198845 198852 8 1 2 103 80 0 0.740 1.25 9.02 Intr + 201184 201395 212 1 2 56 41 167 0.436 6.43 9.03 Intr + 201825 202108 284 1 2 61 24 254 0.399 12.51 9.04 Intr + 202159 202318 160 0 1 49 91 206 0.823 15.64 9.05 Intr + 213388 213527 140 2 2 103 121 -42 0.003 -0.14 9.06 Intr + 221208 221290 83 2 2 29 94 44 0.011 -3.38 9.07 Intr + 225085 225256 172 1 1 60 110 101 0.163 8.52 9.08 Intr + 227776 227843 68 0 2 62 77 40 0.051 -2.92 9.09 Term + 228986 229475 490 2 1 34 48 387 0.175 22.44 9.10 PlyA + 230052 230057 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 90042 90178 137 1 2 76 48 71 0.848 -0.90 S.002 Term - 110464 110351 114 1 0 42 42 140 0.856 2.39 S.003 Term + 225403 225591 189 0 0 102 35 84 0.841 0.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_1|103_aa MPALRHLEGASLLQARPADGAQLLAWGALGGCGCCFTGRCQGARAHPHPPAKAACGKTKN KQAQLQWLFHLQGQLGGLVLLVVSPGESAGPSAARSYLSGEVN >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_1|312_bp atgcctgcactccggcatttggaaggcgcgagcctcctccaggcacggcccgccgacggc gcgcagctcctggcctggggcgccctcgggggctgcggctgctgctttaccggcaggtgc cagggagcccgcgctcacccccaccctccagccaaggctgcctgtggaaaaacgaaaaat aaacaggctcagctgcagtggctcttccacctccagggacagcttgggggactggtgctg cttgttgttagccccggagagagcgcaggccctagtgcagctcgttcttacttgtctggg gaagttaattga >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_2|162_aa MRQKINKDIQDLNSVLDQVDLIDIYRTLHRKSIEYTLFSAPHHTYSQIDHIIGSKTLRSK CKRMEIINILSDHSAIKLELRIKKLTQNCTTTWKLNNLLLNDYWVNNEIKAEINKFFETN ENKDTMYQNLWDTAKAVFRGKFIALNAHSPTGESRKDLKLTP >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_2|489_bp atgagacagaaaattaacaaggatattcaggatttgaactcagttctggaccaagtggac ctaatagacatctacagaactctccatcgcaaatcaatagaatatacattattctcagca ccacatcacacttattctcaaattgaccacataattggaagtaaaacactccgtagcaaa tgcaaaagaatggaaatcattaacattctctcagaccacagtgcaatcaaattagaactc aggattaagaaactcactcaaaactgcacaactacatggaaactgaacaacctgctcctg aatgactactgggtaaataacgaaattaaggcagaaataaataagttctttgaaaccaat gagaacaaagacacaatgtaccagaatctctgggacacagctaaagcagtgtttcgtggg aaatttatagcactaaatgcccacagccccacaggagaaagcaggaaagatctaaaattg acaccctaa >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_3|73_aa MSPALELDHLLQCLWPGTLRMIKAADATALTAPSGEQSIAKEAQAVRWMNRSQDFTAIFL LFVVYDPSSPHPK >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_3|222_bp atgtctccagccctagagctggaccatttacttcagtgtctctggccaggcacactgagg atgataaaggctgctgatgccacggcactgactgcaccctcaggagagcaaagcattgcc aaagaagcccaagctgttcggtggatgaacagatcccaggacttcactgccattttcttg ctttttgtcgtttatgatccaagctcccctcatcccaaatag >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_4|121_aa MSRVLVAHGPHPVASLSLSNPSSASEYPAERAQLLSAEVLLRGPGRQDSSLVLCYPHMVL GSLLHPLPQETPGVPADPNIEALLVLPSPKHVASFVTEENILNFLGGVGLQFVVHSKFLA L >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_4|366_bp atgagcagagtcctcgtggcccatgggccacatcccgttgcaagcttgtctctttcaaat ccttcctcagcctctgagtatccagctgaacgtgctcagctgctttctgctgaggtcctt ctcagaggacctggtagacaggactcaagccttgtcttgtgttacccccatatggtcttg ggaagtctccttcatcctttgcctcaggaaactccaggagtgcccgctgatcccaatata gaagccctgcttgttcttccatctccaaagcacgtagccagctttgtaacggaggaaaac attctgaattttcttggtggagttggacttcagttcgtggtgcactctaagtttcttgct ctttag >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_5|278_aa MHSNTLQEGEHTDGQMQELGHSPQAPLTSPLHTQTTHIINDDAREDPGFVETKVYTICGE EGKCDAKLTKCEAMRKILPEPSKDLESAQFHSRSAPGDAAVSRGPLCEAAVIRGPLCEAA IVWGSGKLKEHQSELDLKSKLLSLFRSKVLPLACVLRATPLFSLALLCMAGSQQPPEGER GDKGEARELKIIFNASQQKTEVLSPTSHKEMNSANNRVNETAVLKKKKKEEEEEEEEEEE EEEEEEEEEEEEEGRRRRRSSPSCYSITPELSCKLGHR >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_5|837_bp atgcactcaaacaccttacaggagggggagcacacagatgggcagatgcaggagctggga catagccctcaagcacctctcacttcacctctccacacccaaacaactcacataataaat gatgatgccagggaagatccaggttttgtggaaactaaggtttatacaatttgtggggag gaggggaagtgtgatgccaagttgactaaatgtgaggccatgaggaaaatattaccagag ccctccaaggaccttgagtctgcccaatttcacagtaggtctgccccgggtgatgcagcc gtcagtcgggggcccctgtgtgaagcggccgtcatccggggacccctgtgtgaagcagcc attgtctggggatctggcaaattgaaggagcaccagtctgagttggatctgaaatctaaa ttgttgtccttattcagaagcaaagtgctgcccctggcctgtgttctcagagccactcct ctcttttcccttgctctgctctgtatggcaggaagccaacagccacctgaaggagaacgt ggggacaagggagaagccagggaactgaagataatcttcaatgccagccagcaaaaaacg gaggtcctcagtccaacatcccacaaggaaatgaattctgccaacaaccgtgtgaatgag actgcagtcttgaagaagaagaagaaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaaggaagaagaagaagaagaagc agcccatcttgttattctataacccctgaactaagttgtaaacttggccatcgttga >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_6|219_aa MELVQVLKRGLQQITGHGGLRGYLRVFFRTNDAKVGTLVGEDKYGNKYYEDNKQFFGRHR WVVYTTEMNGKNTFWDVDGSMGKAEKHPLYNCGNEKCKEDNMKLGLMGIKSNGRVRWLFY TAVRESFFEKVTFEERRRSGEKAFQAEGTTEAFVEKQGGRHRWLHSMTDDPPTTKPLTAR KFIWTNHKFNVTGTPEQYVPYSTTRKKIQEWIPPSTPYK >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_6|660_bp atggagttagtgcaggtcctgaaacgcgggctgcagcagatcaccggccacggcggtctc cgaggctatctacgggtttttttcaggacaaatgatgcgaaggttggtacattagtgggg gaagacaaatatggaaacaaatactatgaagacaacaagcaattttttggccgtcaccga tgggttgtatatactactgaaatgaatggcaaaaacacattctgggatgtggatggaagc atggggaaggcagaaaaacatcctttatataattgtggtaatgaaaaatgcaaggaagat aatatgaagctgggcctcatgggaatcaagagcaatggaagggtcaggtggctgttttat acagcagttagagaaagcttctttgagaaagtgacatttgaggaaagacgtagaagcggg gagaaagcgttccaggcagagggaaccacagaagcctttgtagagaaacagggtggcagg catcgttggcttcacagtatgactgatgatcctccaacaacaaaaccacttactgctcgt aaattcatttggacgaaccataaattcaacgtgactggcaccccagaacaatatgtacct tattctaccactagaaagaagattcaggagtggatcccaccttcaacaccttacaagtaa >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_7|111_aa MLKKSIRTDGLDLDKEQISELVHKCKEMVQNVKHKDENGKYAEKTKYSGNVKKIIGDQSR RVNVHLIPRFRKEVTENVGEKLFKTISTAKNVTSPNSKMMLNSNKHQLQKP >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_7|336_bp atgttgaaaaaatctatcaggactgatggactggacctagacaaggaacaaattagtgaa ctggtacataaatgcaaagaaatggttcagaatgtaaagcacaaagatgaaaatggaaaa tatgcagaaaaaacaaaatattcaggtaacgttaaaaagataataggggatcaatccaga agggtcaatgttcacctaatacccagatttcggaaagaagtaacagaaaatgtaggggaa aaattatttaaaacaatttctactgctaaaaatgtcacatctccaaattcaaagatgatg ctgaattccaacaaacatcaactacaaaagccctag >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_8|557_aa MATIEEIAHQIIEQQMGEIVTEQQTGQKIQIVTALDHNTQGKQFILTNHDGSTPSKVILA RQDSTPGKVFLTTPDAAGVNQLFFTTPDLSAQHLQLLTDNSPDQGPNKVFDLCVVCGDKA SGRHYGAVTCEGCKGFFKRSIRKNLVYSCRGSKDCIINKHHRNRCQYCRLQRCIAFGMKQ DSVQCERKPIEVSREKSSNCAASTEKIYIRKDLRSPLTATPTFVTDSESTRSTGLLDSGM FMNIHPSGVKTESAVLMTSDKAESCQGDLSTLANVVTSLANLGKTKDLSQNSNEMSMIES LSNDDTSLCEFQEMQTNGDVSRAFDTLAKALNPGESTACQSSVAGMEGSVHLITGDSSIN YTEKEGPLLSDSHVAFRLTMPSPMPEYLNVHYIGESASRLLFLSMHWALSIPSFQALGQE NSISLVKAYWNELFTLGLAQCWQVMNVATILATFVNCLHNSLQQDHPSLENMEQIEKFQE KAYVEFQDYITKTYPDDTYRLSRLLLRLPALRLMNATITEELFFKGLIGNIRIDSVIPHI LKMEPADYNSQIIGHSI >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_8|1674_bp atggcaaccatagaagaaattgcacatcaaattattgaacaacagatgggagagattgtt acagagcagcaaactgggcagaaaatccagattgtgacagcacttgatcataatacccaa ggcaagcagttcattctgacaaatcacgacggctctactccaagcaaagtcattctggcc aggcaagattccactccgggaaaagttttccttacaactccagatgcagcaggtgtcaac cagttattttttaccactcctgatctgtctgcacaacacctgcagctcctaacagataat tctccagaccaaggaccaaataaggtttttgatctttgcgtagtatgtggagacaaagca tcaggacgtcattatggagcagtaacttgtgaaggctgcaaaggattttttaaaagaagc atccgaaaaaatttagtatattcatgtcgaggatcaaaggattgtattattaataagcac caccgaaaccgctgtcaatactgcaggttacagagatgtattgcgtttggaatgaagcaa gactctgtccaatgtgaaagaaaacccattgaagtatcacgagaaaaatcttccaactgt gccgcttcaacagaaaaaatctatatccgaaaggaccttcgtagcccattaactgcaact ccaacttttgtaacagatagtgaaagtacaaggtcaacaggactgttagattcaggaatg ttcatgaatattcatccatctggagtaaaaactgagtcagctgtgctgatgacatcagat aaggctgaatcatgtcagggagatttaagtacattggccaatgtggttacatcattagcg aatcttggaaaaactaaagatctttctcaaaatagtaatgaaatgtctatgattgaaagc ttaagcaatgatgatacctctttgtgtgaatttcaagaaatgcagaccaacggtgatgtt tcaagggcatttgacactcttgcaaaagcattgaatcctggagagagcacagcctgccag agctcagtagcgggcatggaaggaagtgtacacctaatcactggagattcaagcataaat tacaccgaaaaagaggggccacttctcagcgattcacatgtagctttcaggctcaccatg ccttctcctatgcctgagtacctgaatgtgcactacattggggagtctgcctccagactg ctgttcttatcaatgcactgggcactttcgattccttctttccaggctctagggcaagaa aacagcatatcactggtgaaagcttactggaatgaactttttactcttggtcttgcccag tgctggcaagtgatgaatgtagcaactatattagcaacatttgtcaattgtcttcacaat agtcttcaacaagatcatccaagcctagaaaacatggaacagatagagaaatttcaggaa aaggcttatgtggaattccaagattatataaccaaaacatatccagatgacacctacagg ttatccagactactactcagattgccagctttaagactgatgaatgctaccatcactgaa gaattgtttttcaaaggtctcattggcaatatacgaattgacagtgttatcccacatatt ttgaaaatggagcctgcagattataactctcaaataattggtcacagcatttga >gi568815586r:94871443_95103680|GENSCAN_predicted_peptide_9|538_aa MPRLTYLLKIMGQPGNIRTAQKWYLPELEELFLNKRAIYRKLGHAPCSENPGAVPFLLKR SRPTGSDDPGQICRRRPGDALGPRPLAVGVKGTPWPPPPTRSLVSPPSVSYRRFCALLTP ASGADATVPRLPLVDWGALREERLKKADGMWDRDSRRRELSVFGYFWGRLRISNGEKRRE EVVTFPGWGGGIGGPSSGSVSPLPPFGLEAECPARATRSQPGTLYARHCGTQFLACSHLL NFTTSSLNQGRHVIILLSFYGCRKQLRKPGDSKDHNGDGCYHRQPFHRPGSGVVSSITAK TRCHLTVTTSANSAKCLANVTMSHPQSFPMEYTLDLTPLAPNYNIITHLHHLQHNAWPKV EAPYSLLNEGMNCRWRLLTKAQTGLPRPAGCLLPWLHCLAHHVTTAAIHKAPNSEANRTV GCICSHDFLGDSCGILFSHPWDSKPVCTMELGRAAKLAPEFTKSNVKLTALATDSAEDHL AWSKDINAYNSDERTEKCPFPNIDGKDQDLAVLLGMLDPPELEEKGMGVRAYGVYFCS >gi568815586r:94871443_95103680|GENSCAN_predicted_CDS_9|1617_bp atgcccaggctgacctacttactgaagattatgggacaacctggcaacattcgtactgca caaaaatggtatctgcccgagttggaagaattatttcttaacaaaagagctatctatcgg aagcttggacatgccccctgcagtgagaaccctggggccgttccatttcttctgaaaaga tccaggcccacaggcagcgacgaccctggccagatttgccgacgccggccgggggatgcg ctgggaccgcgtccgttggcggttggggtgaagggcaccccctggcccccgcccccgacg cggtcgctggtgtccccacccagcgtttcttaccggcgcttttgcgccctgctgactccg gcgtcgggcgccgacgcgacagtcccgcggctgccactcgtggattggggggcgctccgg gaagagaggttgaagaaagccgacgggatgtgggatcgagattcacggcggagagagctt tctgtgtttgggtatttctgggggcgcctgcgcattagcaacggggagaagcggcgagag gaggtcgtgacgttcccagggtggggtggagggatcggagggccgagcagcggctctgtg agtccactgcccccttttggcttggaggcagagtgccccgccagagctacgcgttcccag ccaggtaccctatatgccaggcactgtggcacacaatttctggcatgttcacatctactg aatttcacaacatcctcactaaaccaaggaaggcacgtgattatattactttcattttat ggatgcagaaaacagctcagaaagcctggagacagtaaagaccacaatggagatgggtgt tatcatcgccaacctttccacaggcccggcagtggtgtcgtctccagcattactgccaaa acaagatgccatttaacagttactacgtcagccaactctgctaaatgcttagcaaatgtc accatgtctcaccctcagagcttccccatggaatacacattggatcttacaccccttgcc cccaactacaacatcatcacccaccttcaccatttacagcacaatgcctggcccaaggta gaggctccatattccttactgaacgaagggatgaactgcaggtggagactactaacaaaa gcacaaacaggactgcccagaccagctggctgcttgctgccgtggctacactgtctcgcc caccatgtcactactgctgccatccacaaggctcccaactcggaggctaatcgcactgtt ggctgtatctgttcccacgactttctaggagactcctgtggcattcttttttcccaccct tgggactctaaaccagtgtgcactatggagcttggcagagctgcaaagctggcaccagaa ttcaccaagagcaatgtgaagttgactgcccttgcaacagacagtgctgaggaccatctt gcctggagcaaggatatcaatgcttacaatagtgatgagcgaacagaaaaatgccctttt cccaacattgatggtaaggatcaggaccttgccgtcttgttgggcatgctggatcctcct gagttggaagaaaagggaatgggtgtcagagcttatggtgtttatttttgctcctga