GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:00:33 Sequence gi568815582f:56508991_56683176 : 174186 bp : 47.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 1044 967 78 0 0 85 94 117 0.863 11.75 1.04 Intr - 1931 1869 63 2 0 77 87 39 0.661 1.61 1.03 Intr - 10883 10756 128 0 2 47 86 238 0.557 19.90 1.02 Intr - 14507 14431 77 1 2 82 105 27 0.229 2.96 1.01 Init - 26146 26115 32 1 2 65 115 17 0.139 1.31 1.00 Prom - 29154 29115 40 -3.76 2.00 Prom + 30949 30988 40 -4.86 2.01 Init + 52296 52333 38 2 2 112 99 7 0.978 3.72 2.02 Term + 54396 54522 127 0 1 106 45 115 0.932 6.76 2.03 PlyA + 55241 55246 6 -0.45 3.00 Prom + 56048 56087 40 1.64 3.01 Init + 58697 58826 130 1 1 93 70 76 0.732 4.86 3.02 Term + 62477 62604 128 0 2 65 54 62 0.345 -1.06 3.03 PlyA + 64188 64193 6 1.05 4.00 Prom + 66472 66511 40 -2.26 4.01 Init + 80197 80324 128 0 2 60 27 243 0.903 13.06 4.02 Intr + 80824 80945 122 1 2 66 70 48 0.414 0.94 4.03 Term + 81850 81959 110 2 2 68 54 88 0.403 2.07 4.04 PlyA + 82076 82081 6 1.05 5.00 Prom + 85686 85725 40 -3.16 5.01 Init + 99666 99693 28 2 1 92 111 24 0.848 5.12 5.02 Intr + 100002 100067 66 1 0 154 106 28 0.987 10.08 5.03 Intr + 106212 106424 213 1 0 25 92 115 0.007 4.29 5.04 Intr + 108100 108331 232 2 1 33 46 126 0.008 -0.27 5.05 Intr + 108398 108600 203 2 2 40 95 84 0.729 3.23 5.06 Term + 109599 109690 92 1 2 134 41 100 0.746 7.78 5.07 PlyA + 109808 109813 6 1.05 6.00 Prom + 114930 114969 40 -2.86 6.01 Init + 116862 116889 28 2 1 76 95 19 0.803 1.21 6.02 Intr + 117476 117584 109 0 1 130 47 83 0.926 7.74 6.03 Term + 117890 118010 121 2 1 130 36 105 0.954 7.45 6.04 PlyA + 118100 118105 6 1.05 7.00 Prom + 119750 119789 40 -7.36 7.01 Init + 122987 123518 532 1 1 95 62 115 0.641 4.62 7.02 Intr + 124350 124415 66 1 0 145 70 40 0.980 6.78 7.03 Term + 124761 124852 92 1 2 131 41 102 0.981 7.68 7.04 PlyA + 124970 124975 6 1.05 8.00 Prom + 125964 126003 40 -9.16 8.01 Init + 126828 126855 28 2 1 76 99 31 0.947 2.80 8.02 Intr + 127449 127557 109 1 1 139 53 80 0.783 8.94 8.03 Intr + 129126 129294 169 0 1 -28 47 150 0.391 -0.75 8.04 Intr + 129703 129776 74 0 2 66 95 86 0.860 5.30 8.05 Intr + 130274 130339 66 2 0 157 70 2 0.787 3.32 8.06 Term + 130869 130989 121 0 1 136 43 74 0.889 5.65 8.07 PlyA + 131078 131083 6 1.05 9.00 Prom + 132760 132799 40 -3.06 9.01 Init + 134380 134539 160 0 1 82 66 104 0.865 7.59 9.02 Intr + 134947 135018 72 2 0 54 75 72 0.717 1.88 9.03 Intr + 135121 135186 66 2 0 158 70 44 0.973 8.48 9.04 Intr + 139836 139901 66 1 0 130 70 56 0.731 6.88 9.05 Term + 140433 140524 92 1 2 133 44 48 0.733 2.78 9.06 PlyA + 140646 140651 6 1.05 10.00 Prom + 142860 142899 40 -1.46 10.01 Init + 142964 142991 28 1 1 46 110 24 0.642 0.20 10.02 Intr + 143581 143646 66 2 0 137 65 39 0.959 5.38 10.03 Term + 143984 144075 92 0 2 137 41 74 0.983 5.48 10.04 PlyA + 144194 144199 6 1.05 11.00 Prom + 146613 146652 40 -5.36 11.01 Init + 149069 149096 28 1 1 59 95 31 0.654 0.91 11.02 Intr + 149685 149773 89 1 2 125 2 71 0.473 1.89 11.03 Intr + 151432 151562 131 0 2 55 93 141 0.759 10.79 11.04 Term + 154181 154307 127 2 1 97 52 142 0.930 9.26 11.05 PlyA + 155155 155160 6 1.05 12.07 PlyA - 156075 156070 6 1.05 12.06 Term - 157980 157851 130 2 1 142 46 111 0.961 9.95 12.05 Intr - 158390 158279 112 0 1 111 53 81 0.552 6.34 12.04 Intr - 161421 161318 104 2 2 110 115 18 0.009 6.52 12.03 Intr - 166507 166378 130 2 1 50 3 115 0.008 -1.05 12.02 Intr - 173217 173103 115 0 1 126 89 14 0.658 5.32 12.01 Intr - 173640 173351 290 1 2 77 60 110 0.728 4.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 108095 108331 237 2 0 83 46 129 0.837 4.93 S.002 Term + 135720 135775 56 1 2 128 48 49 0.908 2.62 S.003 Init - 159003 158976 28 0 1 69 95 36 0.973 2.28 S.004 Intr + 161516 161624 109 0 1 104 53 82 0.935 5.64 S.005 Term + 161908 162037 130 1 1 129 47 65 0.930 4.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_1|126_aa MLSAGEHAFCRPRSSGVPAVTDVRMTRVSSEEPMIPAAAIMLLPVFTLKLRHKISPRMVA IGRYDGTHPCLAAATQTGKVTGDNVNSLALCDFDGDGKKELLVGSEDFDIRVFKEDEIVA EMTETE >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_1|378_bp atgctttctgcaggtgagcacgccttctgcaggcctagatcctcgggagtcccagctgtc actgatgtgaggatgacacgtgtctcctctgaagaaccaatgattcctgccgccgccatc atgctgctgcctgtgttcaccctgaaactgcgccacaaaatcagcccccgaatggtggcc atagggcgctacgacgggactcacccgtgcctggcggccgccacccaaacgggcaaggtt actggagacaatgttaattccttggccttgtgtgactttgatggtgatggaaagaaagag cttcttgttggatctgaggattttgatatccgagtttttaaggaagatgagattgtggca gaaatgacagaaacagag >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_2|54_aa MEADKTVGGSSRSACHSDLLMTMDPNPFLLVRSSQSKIRGSMGQTEAQVVLMGE >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_2|165_bp atggaggcagacaagactgtgggaggatctagcaggagtgcctgccacagtgacctgctg atgaccatggatccaaatccctttctcctcgttagaagctctcagtccaaaatccgtggt tccatgggtcagactgaggcccaggtggtccttatgggtgaatga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_3|85_aa MPYVPHSISILTFVVALSCLLGGICMCGDNCKCTTCNCKTYWKNGDAEAPGKSMAESHKT SRGQNWELSSGLPDPEAMLLSMRYG >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_3|258_bp atgccttatgttccccactccatctccatcctcacgtttgtggtggctctgtcctgtctt ctaggaggaatctgcatgtgtggagacaactgcaaatgcacaacctgcaactgtaaaaca tattggaagaatggagatgctgaggctccaggaaagtcaatggctgaaagtcacaagact tccagggggcagaactgggaactgagctcaggtctgcctgacccagaggccatgctcttg tctatgcgatatggatga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_4|119_aa MVRARFHAVAAAAASTSPAGPTRGVGWQCAHSGGEWVVHADAGRGDPSSSTLTLPVASPS LGGSCTCADSCKCEGCKCTSCKKSCCSCCPAECEKCAKDCVCKGGEAAEAEAEKCSCCQ >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_4|360_bp atggtacgtgcgcgcttccacgcagtggcggctgctgcggcgagcacgtcccctgcggga cccacgcggggagtgggctggcagtgcgcgcacagcggcggcgagtgggtcgtgcacgcg gatgcggggcgtggggacccgagttcgtccacattaacccttcctgtggcgtcgccctct ctaggtggctcctgcacctgcgcggactcctgcaagtgcgagggatgcaaatgcacctcc tgcaagaagagctgctgctcctgctgccctgcggagtgtgagaagtgtgccaaggactgt gtgtgcaaaggcggagaggcagctgaggcagaagcagagaagtgcagctgctgccagtga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_5|277_aa MDPNCSCAAGDSCTCAGSCKCKECKCTSCKKMLSFGPCHIGAKLLPLHIGFPIQWQQFPL NFYLQKRVTTELFQDARAPLTNLFLTDVVKVVSFGHSQVEWAGLPRDEAGVAHRLREWRN TADGVVGVGISRPPEAGFPERSGTEMASNHKEMGNMGKLRGQDGTDAADDRAHRPERASD PSTYPQEAQTQRGGCKRGAGPLRPVPSPPAKGAAGSRLQRAFQLPDCLFASPVISWLEMD PNCSCATGCCSCCPMGCAKCAQGCVCKGASEKCSCCA >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_5|834_bp atggatcccaactgctcctgcgccgccggtgactcctgcacctgcgccggctcctgcaaa tgcaaagagtgcaaatgcacctcctgcaagaaaatgctgtcctttggcccctgccacatt ggggccaaattattgccattgcatataggattcccaattcagtggcagcagtttccactg aatttctacctgcagaaaagagtgaccacagagctcttccaggatgctagggcgcccctc acaaatttatttctcacagatgtggtgaaagtggtgtcctttggacactcccaggttgaa tgggcaggcctacccagggacgaagctggggttgcacaccggctccgggaatggcgaaat accgcagatggggtggtgggggtgggaatatcgcgaccaccagaggctgggttcccggaa cgctcggggacggagatggccagcaaccacaaggaaatggggaatatggggaagttgcgc gggcaggacggcacggacgccgcggacgaccgggcgcacaggcctgagcgagcgagcgat ccctccacgtacccacaggaggcccagactcagcggggcgggtgcaagcgcggggcgggg cctctgcgtccggtcccatctccgcctgcaaaaggagcagctggctccaggctccaacgt gccttccagctgcctgactgcctcttcgcctctcccgtcatttcttggctcgaaatggac cccaactgctcctgcgccactggctgctgctcctgctgccccatgggctgtgccaagtgt gcccagggctgcgtctgcaaaggggcgtcggagaagtgcagctgctgtgcctga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_6|85_aa MDPNCSCATGGSCTCAGSCKCKECKCTSCKKSECGAISRNLGLWLRLLFLLPRGLCQVCP GLRLQRGIGEVQLLCLMWEQLFSQM >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_6|258_bp atggaccccaactgctcttgcgccactggtggctcctgcacgtgcgccggctcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggggccatctccaggaat ctggggctgtggctcaggctgctgttcctgctgccccgtgggctgtgccaagtgtgccca gggctgcgtctgcaaaggggcatcggagaagtgcagctgctgtgcctgatgtgggaacag ctcttctcccagatgtaa >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_7|229_aa MEIYLQRRTWTNVPPHPLRRREWTGERGRPVFPVLLCTEEPVRGTAVWTGTGKAGKEEKR KPHWWRVLCTRLAPYHTPPAPHTTDPGTGAGGLHRDSGTGPVENGGARGWGGDARDAKAG VPESAGRRVEGKGNFGETGKGGRDLGDTAYHPAHSPSRANPSQRGGRAAHSERSSGDGVS CACTGSCTCKECKCTSCKKSCCSCCPVGCAKCAHGCVCKGTLENCSCCA >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_7|690_bp atggagatttatctgcaaaggaggacctggacaaatgttcccccacatcctctcaggcga agagaatggacgggagagagaggccgaccagtgttccccgtgttgctgtgtacggaggag ccagtccgagggaccgcggtgtggacagggacaggcaaggcggggaaggaggagaaacga aagccacattggtggcgggtgctctgcacacgactcgctccctaccacacgccccccgct ccgcacacgaccgatccggggactggagcaggagggctgcaccgggactccgggacaggc ccagttgaaaacggcggggcgagggggtggggtggagacgcccgcgacgccaaggctggg gtcccggaaagcgcggggaggagggtggaaggcaaaggcaacttcggggaaactgggaaa ggcggccgggacctcggggacactgcgtaccacccggcgcacagcccctcccgcgcaaac ccgagccaaaggggcggtcgagcggcgcactcggagcggagctcaggggatggtgtctcc tgcgcctgcaccggctcctgcacgtgcaaagagtgcaaatgcacctcctgcaagaagagc tgctgctcctgctgccccgtgggctgtgccaagtgtgcccacggctgtgtctgcaaaggg acgttggagaactgcagctgctgtgcctga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_8|188_aa MDPNYSCTTGGSCTCAGSCKCKECKCTSCKKSECGAISRNLGLWLRDRQGDSEEKRKSHR WRLLCTQLARYRTLHALHYADPGTGAGGCGCTQTSGQAELKTPLNFLLGISNLTAARNGP QLLLRHWWLLHLHWLLQMQRVQMHLLQEELLLLLPHELCQVCPGLHLQRGIREVQLLCLM SGQPCSKI >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_8|567_bp atggaccccaactactcctgcaccactggtggctcctgcacgtgcgccggctcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggggccatctccaggaat ctggggctgtggttaagggacaggcaaggcgacagcgaggagaaacgaaaatcacatcgg tggcggttgctctgcacacaactcgctcgctaccgcacgctccacgctctgcactacgcc gatccggggacaggagcaggaggctgtggctgcactcagacttcgggacaggccgagctg aaaacccctctcaacttcttgcttgggatctccaacctcaccgcggctcgaaatggaccc caactgctcctgcgccactggtggctcctgcacctgcactggctcctgcaaatgcaaaga gtgcaaatgcacctcctgcaagaagagctgctgctcctgctgccccatgagctgtgccaa gtgtgcccagggctgcatctgcaaaggggcatcagagaagtgcagctgctgtgcctgatg tccggacagccctgctcgaagatatag >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_9|151_aa MECNLREIREGRDYSRDAAYRQEHSPSKASVSQRAAPRVRNSHVKEDPAPGPPVPPISEA RGLRLKTARLQVTLKAKGGSCTCASSCKCKEYKCTSCKKTGSCTYASFCKCKEYKCTSCK KNCCSCYPVGCAKCAQGCICKGASDKCSCCA >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_9|456_bp atggagtgcaatctccgggaaattcgggagggccgggattatagcagggacgccgcgtac cgccaggagcacagcccctccaaagcgagcgtgagccaaagggccgcccccagggtgcgc aacagccacgttaaggaggatcctgcgcccggcccgcctgtgcctccgatttctgaggcg agaggactgaggctgaaaactgcccggctgcaggtcaccctcaaggccaaaggtggctcc tgcacctgtgccagctcctgcaaatgcaaagagtacaaatgcacctcctgcaagaagact gggtcctgcacttatgccagcttctgcaaatgcaaagagtacaaatgcacctcttgcaag aagaactgctgctcctgctaccctgtgggctgtgccaagtgtgcccaaggctgcatttgc aaaggggcatcagataagtgcagctgctgtgcctga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_10|61_aa MDPNCSCTTGGSCACAGSCKCKECKCTSCKKCCCSCCPVGCAKCAQGCVCKGSSEKCRCC A >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_10|186_bp atggatcccaactgctcctgcaccacaggtggctcctgtgcctgcgccggctcctgcaag tgcaaagagtgcaaatgtacctcctgcaagaagtgctgctgctcttgctgccccgtgggc tgtgccaagtgtgcccagggctgtgtctgcaaaggctcatcagagaagtgccgctgctgt gcctga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_11|124_aa MDPNCSCAAGVSCTCAGSCKCKECKCTSCKKSECEAISMGSKVTGDVEESVISATQSHEK DWVQMKDVECADAVVMAYSAVGRWSRYPHPPPCTFVISFASPVSVCSSILVLAFFIGGSW EPRE >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_11|375_bp atggaccccaactgctcctgcgccgctggtgtctcctgcacctgcgctggttcctgcaag tgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgtgaggccatctccatgggc agcaaagtgactggagatgtggaggaaagcgtcatctcggcaacgcagagtcatgagaag gactgggtgcagatgaaggatgttgagtgtgctgatgcagtggtgatggcatattctgca gttggaaggtggtcgcgttatcctcatcccccgccctgcaccttcgtcatcagctttgcc tctcccgtgtccgtgtgtagctccatcctcgtgctcgccttcttcattggcggaagctgg gagccgagagagtga >gi568815582f:56508991_56683176|GENSCAN_predicted_peptide_12|293_aa XRVCTVAGKRASQGAELGVPYQASRSSWGPFRGKEKQEFPIKRKTRSGTDEKRGGAQKPA ALFIVREEGRAQRPRLVLAPAPAESAPPAILGRVQQPACRETRNPGGLHTPSPRSTVCSF AFPRVPGSRPWIGGSVLRDFPGLPATPLQDGSFENEGCPPKRMTLGDMQTLRSIKERSGA SRQSSFPTQQKKRRPISSPFMLGKPSTLRLYSKAGVSCTCASSCKCKECKCTSCKKSECG AISRNLGLWLRLLLLLPCGLCQVCPGLHLQRGIGEVQLLRLMSGQPCSQVQIE >gi568815582f:56508991_56683176|GENSCAN_predicted_CDS_12|882_bp ngaagagtctgtactgtggctgggaaacgggcatcccaaggcgcggagctaggtgtccct taccaggcgagcaggagcagttggggtccatttcgaggcaaggagaagcaggagttcccg atcaagaggaaaacacgcagcgggacagatgaaaagcgtggtggagcccagaagccggcg gctctctttatagtccgggaagagggccgggcgcagaggccccgcctcgtccttgcaccc gcccctgctgagtctgcaccgcccgcgatcctgggccgggtgcagcaacccgcgtgccgg gaaactcggaatcccggcggtctacacacccccagcccccgctccactgtgtgtagcttt gcatttcccagagtccctggatcacgtccctggatcggcgggagcgttctccgggacttt ccaggcctgccggccaccccactgcaggatggcagttttgaaaatgagggatgccctcca aagaggatgactttaggtgacatgcagaccctgcggtcaatcaaggagagaagtggggcc agccgtcagagctcctttcccactcagcagaagaagagaagacccatctcctctccattc atgctgggaaaaccaagcaccctccgtttgtattccaaagcaggtgtctcctgcacctgc gccagctcctgcaagtgcaaagagtgcaaatgcacctcctgcaagaagagtgagtgcggg gccatctccaggaatctggggctgtggctaaggctgctgctcctgctgccctgtgggctg tgccaagtgtgcccagggctgcatctgcaaaggggcatcggagaagtgcagctgctgcgc ctgatgtcgggacagccctgctcccaagtacaaatagagtga