GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:28:02 Sequence gi568815596f:206666694_206891799 : 225106 bp : 39.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.22 PlyA - 985 980 6 1.05 1.21 Term - 3868 3794 75 1 0 107 49 50 0.420 -0.14 1.20 Intr - 12482 12323 160 1 1 13 109 149 0.180 8.57 1.19 Intr - 19941 19765 177 2 0 91 -12 139 0.048 2.31 1.18 Intr - 20345 20069 277 0 1 -10 15 287 0.492 7.25 1.17 Intr - 20701 20385 317 0 2 -54 45 245 0.040 0.78 1.16 Intr - 26633 26482 152 2 2 61 109 139 0.566 11.34 1.15 Intr - 28217 28073 145 1 1 73 92 54 0.611 3.66 1.14 Intr - 33197 33034 164 2 2 78 92 61 0.562 3.35 1.13 Intr - 33523 33452 72 1 0 66 84 64 0.508 2.48 1.12 Intr - 38339 38150 190 1 1 95 99 99 0.959 10.37 1.11 Intr - 40810 40609 202 2 1 80 94 172 0.993 14.42 1.10 Intr - 43905 43831 75 1 0 109 84 4 0.584 0.67 1.09 Intr - 51598 51568 31 1 1 81 113 35 0.311 2.09 1.08 Intr - 52360 52261 100 0 1 87 65 45 0.171 1.29 1.07 Intr - 70641 70364 278 0 2 64 6 211 0.008 5.89 1.06 Intr - 79733 79594 140 0 2 92 95 149 0.911 15.26 1.05 Intr - 82490 82327 164 1 2 123 94 74 0.999 10.20 1.04 Intr - 84382 84241 142 2 1 87 92 22 0.981 0.99 1.03 Intr - 88812 88316 497 2 2 42 84 446 0.857 31.31 1.02 Intr - 90347 90205 143 2 2 59 75 148 0.952 8.83 1.01 Init - 90738 90544 195 0 0 20 76 126 0.284 3.58 1.00 Prom - 92776 92737 40 -6.65 2.00 Prom + 94951 94990 40 -3.55 2.01 Init + 98562 98684 123 2 0 97 50 68 0.070 4.44 2.02 Intr + 100238 100777 540 1 0 71 98 350 0.058 26.47 2.03 Intr + 103398 103501 104 2 2 47 94 86 0.941 3.15 2.04 Intr + 104489 104597 109 2 1 66 82 92 0.989 5.77 2.05 Intr + 105201 105324 124 2 1 86 82 72 0.957 5.64 2.06 Intr + 105488 105627 140 0 2 32 87 77 0.801 1.26 2.07 Intr + 111979 112053 75 0 0 49 105 70 0.603 3.59 2.08 Intr + 115601 115673 73 1 1 109 63 14 0.515 -0.94 2.09 Intr + 116407 116600 194 2 2 54 86 161 0.906 10.79 2.10 Intr + 120044 120206 163 1 1 21 91 85 0.038 0.73 2.11 Intr + 121244 121411 168 0 0 120 57 98 0.566 8.90 2.12 Term + 130811 130932 122 0 2 105 44 79 0.341 2.86 2.13 PlyA + 132433 132438 6 1.05 3.05 PlyA - 133041 133036 6 1.05 3.04 Term - 142888 142415 474 1 0 83 33 192 0.854 7.10 3.03 Intr - 147063 146953 111 0 0 81 115 45 0.640 6.16 3.02 Intr - 159460 159272 189 1 0 55 121 127 0.958 11.56 3.01 Init - 161385 161365 21 0 0 72 91 15 0.198 -0.85 3.00 Prom - 162998 162959 40 -3.25 4.00 Prom + 166412 166451 40 -7.95 4.01 Init + 166825 167027 203 0 2 53 86 89 0.676 3.60 4.02 Intr + 167449 167846 398 1 2 56 58 245 0.769 11.60 4.03 Intr + 181201 181282 82 2 1 66 19 106 0.015 -0.72 4.04 Intr + 189024 189144 121 0 1 13 79 172 0.348 8.38 4.05 Intr + 193262 193404 143 1 2 57 59 66 0.094 -1.17 4.06 Term + 197071 197296 226 1 1 51 42 129 0.033 -0.13 4.07 PlyA + 199763 199768 6 1.05 5.04 PlyA - 201961 201956 6 1.05 5.03 Term - 203987 203930 58 1 1 60 42 61 0.045 -5.12 5.02 Intr - 207968 207847 122 2 2 51 111 103 0.185 7.27 5.01 Intr - 224160 224110 51 0 0 58 94 43 0.126 0.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 120040 120206 167 1 2 90 91 93 0.875 8.48 S.002 Init + 195591 195657 67 2 1 96 80 19 0.874 3.19 S.003 Term + 197028 197296 269 1 2 95 42 141 0.852 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:206666694_206891799|GENSCAN_predicted_peptide_1|1231_aa MKLNSQADLIYDFTWDIIQQDWLKDVCEKNKWSHKNSPIIWRELLDRGGKGLLLGGYNEF LEHAQLYYDVTSSMTTELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACY NLIPILTSGEVFGMHTEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAF RQAHVIVVLDDSTNKEVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKT VLLMRYAPRIAHNIIAVALGVEGEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKT RVYRYESAIWGPLHYSRPVLNLIFDSEWVKREFVAILKNLTTTGRQFGGILAAHSIATTL KYWYHGSPPGEIVSLGILSEGQFGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTR MTSDLIQGPQPKECDHREQELDGKSNGALRRAGHMAIAPQHTGLLMIKLSEIRNCDSDQE AQAKFPAPYPLNQLCDSLITHNQIEKLDQRTQPRTHHLSNRIHVALKSYFQTDILSFLML QTKSCSDMLQQSSEICEMDPDKQDALNSIENSIYRTAFKLQSVQTLCQLDLIDSSLIQQV LLRPSFWEARKHSLSVQQLSQALQELFQKAREENPGQVHPRAPELTLSLLTTMYNSHKET VTSKQVLFKDTLQVFNLKIVPPTCLALFQLYAENSRGGYDSGPRMTRRVLRKLLTDLQQI PTFVGESRALCPVESATRSCFQGVLSPAIKEEKFLSWVQSEPPILLWLPTCHRLSAAERV THPARCTLCRTFPITGLSDVSCASILTGRYRCLKCLNFDICQMCFLSGLHSKSHQKSHPV IEHCIQQMSAMQNTKLLFRTLRNNLLQGRCRKKEAARRQQLLDQVNPKGVPHHAQARSVG CCGEFKTGTSLDIKMEEQIVSHAGEMLSSVVVVSCKDSVPHQVVSLTMEGIAIPQLSAKS VGSFEAFYNSIKPIQIINSAIERMKPGKFPYGKTNLFRISSVYIQYTLRCDMRQSLLTKD LTKAGEFTIHSAPQEGKLTPSPVDFTDYTCNLTEYQRESLAPQISHSRTPQLNELCHRAA AYRRAGGEKAWGCNWLFTCRMLKTNFKVEFEATIDMRLHANHLVTENFLLQLCRTQPRRR EAQKTGVALWISCCNSVSIQYKSSKGTADVRPTGPMETAATGASKPIPVPERMSRGQQTE DSNSCYQTMVSTRNELPRTHESINALHRERR >gi568815596f:206666694_206891799|GENSCAN_predicted_CDS_1|3696_bp atgaaattgaattctcaggcagatttaatatatgacttcacttgggatatcattcaacag gattggctaaaagatgtgtgtgaaaagaataagtggagtcacaagaattcccctatcatc tggagagagctgttggatcgtggaggaaagggtttgcttttgggaggatataatgagttc ctggagcatgctcagctttactatgatgtcacctctagcatgacgactgaactgatgatg gtaattgctcaagagaacctgggggcacatatagaaaaagagcaggaggaagaagccctg aaaacttgcatcaaccccttgcaggtctggatcaccagtgcctctgctcctgcctgctac aacctaattcccatattgacgagtggcgaagtgtttgggatgcatacagaaattagcata actctatttgacaacaagcaggcggaagaacatctcaaaagccttgtggtggagacccaa gacctggcatctcccgtcctgcgcagtgtctccatctgcacgaaggtggaggaggccttc cgccaggcccacgtcattgtggtgctggatgacagcaccaacaaggaggtgttcactctg gaggactgcctccgaagcagggtgcctctctgcaggctctatgggtacctgatagagaaa aatgctcatgagtctgtcagagtcatcgtgggagggagaacctttgtaaacctgaagaca gttttactcatgagatatgccccacgcattgcacacaacattattgctgtggcgctgggg gtggaaggtgaagcgaaagccatactggccagaaaactgaagacagctccttcatacatt aaagacgtgatcatttggggtaatatcagtggaaataattacgttgatctgagaaaaaca agggtgtacagatatgagagtgccatttggggacctcttcattattcacgccctgtttta aacttgatttttgacagtgagtgggtaaaaagagaatttgtggcaattcttaaaaacttg accaccacaggaagacaatttggaggcattttggctgcacacagtatagccactacactg aaatactggtaccatggctcaccacctggggagattgtatctttaggaatattgagtgaa ggccagtttggtattccgaaagggatcgtcttttctatgcctgtgaaatttgagaatgga acttgggtggttcttacagatctcaaagatgttgaaataagtgaacaaataatgacccga atgacaagtgatctaattcagggtcctcaacccaaggaatgtgaccacagagaacaagag ctggatgggaaatcaaatggtgctttacgcagagctgggcacatggcgattgctcctcaa catacagggcttcttatgataaaactaagcgagattagaaactgtgattcagaccaagaa gcccaagctaaatttcccgctccataccccttgaatcagttatgtgactcacttattacc cataaccagatagaaaagctagaccaaagaacacaaccacgaacacatcacctttcaaac agaattcatgttgctctcaagagctattttcagactgatatcttatccttcttaatgctt caaactaaatcctgttcagacatgcttcaacaatctagtgaaatttgtgaaatggatcca gataaacaagatgctcttaatagtattgagaattccatttatagaacagccttcaaatta caatcagtgcaaactctgtgccagttggacttgattgacagctccctgattcagcaggtc ctactgcgtccaagtttctgggaagctcgcaagcactccctttctgtgcagcaactttct caggcactccaagagctgtttcagaaggccagggaggaaaacccaggacaagtgcatccc agagctccggaactcactctgagccttctcacgacaatgtacaacagccacaaagagact gttacttcaaagcaagtactcttcaaagatacattgcaagtattcaatttaaagattgtt cctcctacatgtctagctctttttcaactctatgcagaaaatagcaggggaggctatgat tctgggccacgcatgactcgaagggttttgagaaaactactaacagatctacagcagatc ccaactttcgtgggagagagtcgtgctctgtgccctgtggaaagtgccacccgcagctgt ttccaaggggtgttgagcccagcaatcaaagaagaaaaattcctgtcttgggtccaatct gagcctcccatcctcctgtggctcccgacctgccaccggttatcagctgctgaaagggtc actcaccctgctcggtgcactctctgcaggactttcccaatcacgggactcagtgacgta agctgtgcttctattttaactggcagataccgctgtctgaagtgtctcaactttgacatc tgccagatgtgtttcttatctggtcttcacagcaagtcccatcagaagtctcatcctgtc attgagcactgcattcagcagatgtcagcaatgcagaatacaaaacttctcttcaggacc ctcagaaacaaccttcttcaggggcgctgtaggaagaaagaagcagcgagaaggcagcag ctgctggaccaggtgaatccaaagggtgtgcctcaccatgcgcaggccagaagtgttggc tgctgtggagaattcaaaacagggacctccctggacatcaagatggaagagcaaatagtt tctcatgctggagaaatgctctccagtgtggtggtcgtatcttgtaaggattcagtccca catcaagtagtgtctttgacaatggaaggaattgcaatcccccagctcagtgccaaaagt gtgggcagttttgaagctttttataattccattaaacctatccagattatcaacagtgcc atagaaaggatgaagccagggaaatttccctatggcaaaacaaatttatttcgcatttct tctgtatacattcagtatacactacgctgtgacatgaggcagtctctgttgaccaaggac ttgacaaaggccggtgaatttaccatccactctgctcctcaggaggggaagttgactcca agtcctgtggacttcacagattatacctgcaaccttacagaatatcaaagagagagcctt gctccccagatttctcattcgagaacaccccaactcaatgaattgtgtcatcgtgcagct gcttacaggagagctggtggtgaaaaggcatggggctgcaactggctgtttacctgccgc atgctgaagaccaacttcaaagtggaatttgaggccaccatcgacatgcgtcttcatgct aaccacctggtcacagagaacttcctattgcagctctgcaggactcagcccaggaggagg gaagcacagaaaacaggggtggccttgtggatatcctgctgtaatagcgtctccattcag tacaagagttctaaaggaacagcagatgtaaggccaactggaccaatggaaacagcagcc acaggtgcctccaagcctatacctgtacctgaaagaatgtctcggggtcaacaaactgag gacagcaactcatgttaccagaccatggtcagcactaggaatgaactgccgaggactcat gaatccattaatgctctccatagagaaagaagatag >gi568815596f:206666694_206891799|GENSCAN_predicted_peptide_2|644_aa MTNLAMVERDSEAGTAASRFPGNHAAKGKAQAHYKVWRPAEDAFIFKSDVGFQTKGISTL TALRIERLLYAKRLFFDSKQSLVPVDKSDDELKKVNLNHEVSNEDVLTKETKPNRISSRK LSEECNSLSDVLDAFSKAPTFPSSNYFTAMWTIAKRLSDDQKRFEKRLMFSHPAFNQLCE HMMREAKIMQYKYLLFSLHAIVKLGIPQNTILVQTLLRVTQERINECDEICLSVLSTVLE AMEPCKNVHVLRTGFRILVDQQVWKIEDVFTLQVVMKCIGKDAPIALKRKLEMKALRELD RFSVLNSQHMFEVLAAMNHRSLILLDECSKVVLDNIHGCPLRIMINILQSCKDLQYHNLD LFKGLADYVAATFDIWKFRKQGTPAPRPRTGTSPWPVRNWATQQECNSSWFCSCLEYDNI LIGFWTFHKGAVNQLSGVRPTKKQLPTVLSTEWTDSAHTAFPVSSLIVTVRVPVLVMSYP VSGAALTTGCTGDWFVEVMASALTGYLHTISSENLLDAVYSFCLMNYFPLAPFNQLLQKD IISELLTSDDMKNAYKLHTLDTCLKLDDTVYLRDIALSLPQLPRELPSSHTNAKVAEVLS SLLGGYCNGGSTSSRPQAKPWMHLLFAQLPQGRALHRSQCLKTG >gi568815596f:206666694_206891799|GENSCAN_predicted_CDS_2|1935_bp atgacgaatttggccatggtcgagagagactcagaggcagggaccgcggcttcgcggttt cctggcaaccacgcagccaagggcaaggcgcaggcgcactacaaagtctggcgcccagca gaggatgcattcatttttaaatcagatgttggctttcaaacaaagggcataagcactcta acagcccttagaattgaaagactactttatgctaaaagactgttttttgactcaaagcag tctcttgtccctgttgataaatctgatgatgaattgaagaaagtaaaccttaatcatgaa gtctccaatgaagatgttcttaccaaggaaacaaaaccaaaccgtatcagcagtagaaaa ctgtctgaggaatgtaattccctgagtgatgtgttagatgcattttcaaaagcgcccaca tttcctagtagcaactatttcacagcaatgtggacaattgccaaaagactgtctgatgac cagaagcgctttgaaaaacgactgatgtttagccaccctgcatttaatcagctctgtgaa catatgatgagagaagccaagatcatgcagtataagtacctactgttcagtcttcacgcc atagtgaagcttggaatccctcagaacactattttggtgcagactttgctgagggtgacc caggaacgtatcaatgagtgtgatgagatatgcctttcagttttgtcaactgttttagag gcaatggaaccatgcaagaatgttcatgttctacgaacgggattcagaatactagttgat cagcaagtttggaaaatagaagatgtcttcacattacaagttgtgatgaagtgtattgga aaagatgcaccgattgctcttaagaggaaactggagatgaaagccttgagggaattagac agattttctgttttgaatagccaacacatgtttgaagtactagctgccatgaatcaccga tctcttatactcctggatgaatgcagtaaggtggtcctagataatatccatgggtgtcct ttaagaataatgatcaacatattgcagtcctgcaaagacctccagtaccataatttggat ctcttcaagggacttgcagattatgtggctgcaactttcgacatctggaagttcagaaaa caggggaccccagcccccaggccacggactggtaccagtccgtggcctgttaggaactgg gccacacagcaggagtgcaactcttcttggttttgttcttgtctggagtacgacaatatc ttaattggtttctggacttttcataaaggggcagttaatcaattaagtggtgtcaggcca actaaaaagcagcttcccactgtactgtccactgaatggacagactcagctcatactgca tttcctgtgtcttccctaattgtcaccgtcagagtccctgtgcttgtcatgtcttaccca gtctccggtgcagccttgacaacaggctgcaccggagactggttcgtggaagttatggct agtgctctgactggttatcttcacactatttcttctgaaaacttattggatgcagtatat tcattttgcttgatgaattactttcccctggctccttttaatcagcttctgcaaaaagac atcatcagtgagctgctgacatcagatgacatgaagaatgcttacaagctgcatactttg gatacttgtctaaaacttgatgatactgtctatctgagggacatagccttgtcactccca cagctgccgcgggagctgccatcgtcacatacaaatgcaaaggtggcagaggtgctgagc agccttctgggaggctattgtaacgggggttccacgtcatccaggccacaggccaagcct tggatgcacctgctctttgcccagctgccccagggcagagctctgcacagatcacagtgc cttaagacaggatga >gi568815596f:206666694_206891799|GENSCAN_predicted_peptide_3|264_aa MVMGNLQQILIARFQGQSCGTTQENPHHTSWNRYDLIIDWLSGEEDAQILGLGQKTGEGV ALILEYASRKIFNTKNLQNTKILYFQNPGSRGKVLIHGLGISGLEIKHALKRLKPVITRL LQHGLLKPINSPYNSPILPVLKPDKPYKLVQDLRLSNQIVLPIHPVVPNPYTLLSSIPPS TTHYSVLDLKHAFFTIPLHPSLSSLSLGLTLTPIKLSKLPRLYCRKAHRQPHYFNQAQIS SSSVTYLGIILIKTHVLSLPIVSD >gi568815596f:206666694_206891799|GENSCAN_predicted_CDS_3|795_bp atggtgatggggaacttgcagcaaatcttaattgccaggttccaaggacagagctgtgga acaacacaagaaaatccccatcatacaagttggaaccgatacgacttgataattgattgg cttagtggtgaggaagatgcccagattcttggcttaggccagaagacaggagaaggagtg gcattaatacttgaatatgcaagtaggaagatattcaatactaagaatctccaaaatacg aagattctttattttcagaatccaggatcaagaggaaaggtgctaattcatggattgggt atctcagggttagaaataaagcatgctttaaaaagattaaagcctgttatcactcgcctg ctacagcatggccttttaaagcctataaactctccttacaattcccccattttacctgtc ctaaaaccagacaagccttacaagttagttcaggatctgcgccttagcaaccaaattgtt ttgcctatccaccccgtggtgccaaacccatatactctcctatcctcaatacctccctct acaacccattattctgttctggatctcaaacatgctttctttactattcctttgcatccc agcctctcttcgctttcacttggactgaccctgacacccatcaagctcagcaaattacct aggctgtactgccgcaaagctcacagacagccccattacttcaatcaagcccaaatttct tcctcatctgttacctatctcggcataattctcataaaaacacacgtgctctccctgcca atcgtgtccgactaa >gi568815596f:206666694_206891799|GENSCAN_predicted_peptide_4|390_aa MLQARFSMKQRFACRKLIGGTLAIDTCKEWRKKIWQRKKSGCEAIIEKDFGQFQEELWNS SAYLSELGHSVLAVHWSIIASTLSPMSKFSPPTSVFCWDPVVYIAARELNSMIASPIVNH ELQKLLNTELPSDAAAPCTGPFLMGLLIERFLGPLVDQSHLVAFSALSATPVIATLLMLL IFPVTCATVVLYYLVWAIAFSAAVADGSRTRGLCAEKHHEAAEAMRARREAPVTALPESG PAAQDLVLLVYGKAGRWCLVLADDAAGQIALQVFFCISVIPHSAKDAMSYCVLWVEYLSP PKLMLKFKPQCGSIESFNFDQCLYPIGGQVPEGKEKNSLEHMAGKDHLCVPAEKTNDSSS LPSRCVSDPCRSADHRQSCALANLLLFKAN >gi568815596f:206666694_206891799|GENSCAN_predicted_CDS_4|1173_bp atgttgcaggccaggttctccatgaaacagagatttgcttgcagaaagttgattgggggc actcttgcgatcgatacctgtaaggagtggaggaaaaaaatatggcagaggaaaaagtca ggctgtgaagcaatcattgaaaaggactttggtcaattccaagaagaactttggaatagc tctgcatatttgtctgaattaggccactctgtccttgctgttcattggtcaataattgct tctaccttgtccccgatgtctaagttctcccctcccacatctgtattctgttgggatcca gtggtctacattgctgccagggagctcaattctatgatagcatctcctattgtcaaccat gaactacagaagttgctcaacactgagcttcccagtgatgctgctgccccttgcactggg ccattccttatgggtttgttgatagaacgctttctgggccctcttgtggatcaaagtcat ctggtggccttttcagctttatctgctacacccgtgattgccactcttctgatgctttta atctttcctgtcacctgtgccacagtagttctatactacttagtgtgggccattgctttc tcagcagcagtagcagatggtagcaggactcgaggcttatgtgcagagaaacatcatgag gctgcagaagccatgagagcgaggcgagaggctcccgtaactgcgctgcctgagtctggg cccgcggcacaggacctggtgctgttggtttatggtaaagcaggaaggtggtgcctcgtg ctggcagatgatgctgcgggccagattgctttgcaggtcttcttctgcatcagtgtcatc cctcactcggccaaagatgccatgtcctactgtgtgctatgggttgaatatttgtcccct ccaaaactcatgttgaaatttaagccccaatgtggtagtattgagagcttcaactttgat caatgcctctatccaatagggggtcaagtacctgaagggaaggaaaagaattctctggaa catatggcaggaaaggatcatttgtgtgtgcctgctgaaaagacaaatgactcgtcctcc ctgccctctcgctgtgtcagtgatccttgccgctcagctgatcatagacaatcttgtgct ctagctaatctgctgcttttcaaagcaaactga >gi568815596f:206666694_206891799|GENSCAN_predicted_peptide_5|76_aa FWSLKVQGQGSGTIQFQAKERGSRGSKSVRNIERAAVCGAVPILKFLQGFQMLQEPGRRD LADVGQKKMGNKASNV >gi568815596f:206666694_206891799|GENSCAN_predicted_CDS_5|231_bp ttctggagccttaaagtccaaggtcaaggttctggcacaattcagtttcaggcaaaggaa cgaggctctagaggaagcaagtctgttaggaacatagagcgagcagcagtgtgtggtgct gtccctatcctcaagttccttcagggctttcagatgctacaggagccaggcaggagggac ctggcagatgtggggcaaaagaagatgggaaataaagctagtaacgtttaa