GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:19:36 Sequence gi568815589f:126514450_126796448 : 281999 bp : 50.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4760 4856 97 0 1 104 101 43 0.612 6.27 1.02 Intr + 12370 12442 73 1 1 105 75 30 0.231 2.81 1.03 Intr + 17958 18015 58 2 1 85 91 17 0.094 0.16 1.04 Intr + 42648 42767 120 1 0 -2 86 108 0.003 2.07 1.05 Intr + 50462 50799 338 0 2 72 90 94 0.011 3.24 1.06 Intr + 55354 55391 38 0 2 86 97 26 0.047 0.36 1.07 Intr + 62190 62245 56 0 2 62 106 69 0.102 4.72 1.08 Term + 68037 68128 92 1 2 73 49 92 0.378 1.68 1.09 PlyA + 70668 70673 6 -0.45 2.03 PlyA - 70852 70847 6 1.05 2.02 Term - 71501 71394 108 2 0 53 35 105 0.303 0.21 2.01 Init - 74527 74423 105 1 0 89 38 100 0.304 5.40 2.00 Prom - 83612 83573 40 -4.26 3.00 Prom + 84846 84885 40 -2.06 3.01 Init + 98162 98288 127 1 1 75 105 125 0.921 11.22 3.02 Intr + 99846 100139 294 1 0 7 92 275 0.606 16.78 3.03 Term + 100670 101124 455 0 2 -32 47 544 0.865 33.62 3.04 PlyA + 103580 103585 6 1.05 4.14 PlyA - 103597 103592 6 1.05 4.13 Term - 108954 108839 116 1 2 85 39 122 0.752 5.73 4.12 Intr - 111517 111008 510 2 0 64 52 187 0.467 5.64 4.11 Intr - 117766 117694 73 1 1 50 100 16 0.089 -2.02 4.10 Intr - 129791 129664 128 0 2 68 76 76 0.449 4.80 4.09 Intr - 130847 130734 114 0 0 134 78 -1 0.550 3.92 4.08 Intr - 138325 138150 176 0 2 137 75 -26 0.470 0.58 4.07 Intr - 140634 140508 127 1 1 130 73 -20 0.170 0.44 4.06 Intr - 151847 151739 109 2 1 126 77 38 0.729 6.36 4.05 Intr - 152309 152174 136 1 1 133 75 37 0.595 7.47 4.04 Intr - 152958 152829 130 1 1 58 35 77 0.287 -0.85 4.03 Intr - 157770 157623 148 0 1 93 36 78 0.417 2.91 4.02 Intr - 162382 162231 152 2 2 99 78 39 0.634 3.88 4.01 Init - 168878 168824 55 2 1 32 92 193 0.944 13.55 4.00 Prom - 169412 169373 40 -10.45 5.00 Prom + 169806 169845 40 -2.56 5.01 Init + 170960 170967 8 1 2 67 68 10 0.232 -3.32 5.02 Intr + 176387 176619 233 2 2 102 93 533 0.981 52.62 5.03 Intr + 178672 178874 203 2 2 93 100 280 0.638 28.70 5.04 Intr + 179075 179152 78 1 0 49 116 52 0.927 3.85 5.05 Intr + 179297 179363 67 1 1 91 84 130 0.736 11.38 5.06 Intr + 181390 181554 165 2 0 46 103 246 0.976 21.83 5.07 Term + 181845 182002 158 1 2 110 54 288 0.999 25.70 5.08 PlyA + 186563 186568 6 1.05 6.05 PlyA - 189075 189070 6 -0.45 6.04 Term - 191180 191007 174 2 0 51 42 111 0.048 0.56 6.03 Intr - 192661 192565 97 0 1 77 94 23 0.029 1.81 6.02 Intr - 202873 202753 121 2 1 71 87 71 0.133 4.85 6.01 Init - 209133 209025 109 0 1 37 23 160 0.105 2.78 6.00 Prom - 234260 234221 40 -3.36 7.05 PlyA - 234424 234419 6 1.05 7.04 Term - 241101 240851 251 1 2 63 44 136 0.439 2.57 7.03 Intr - 244303 244146 158 0 2 85 110 0 0.056 1.55 7.02 Intr - 263551 263458 94 2 1 72 99 61 0.364 4.62 7.01 Intr - 265731 265630 102 1 0 93 77 47 0.574 4.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 50039 49970 70 2 1 100 71 70 0.917 7.71 S.002 Term + 228978 229042 65 1 2 67 54 98 0.849 2.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_1|290_aa XTADNLSLLLEWRLIPANVAMSLFVKTDMEHNRAAAQDSRGSGRPLDTASFQGIESEGEP ELVFWKMRTDALSARFESDQRIGKGKIHGPSEGPFASEAAPALHVEINHRQQGKIKDGLQ VALGTQGHSEPAVAPDLKTSAQTLPPLPALQMLALGGRVDPQLHLLLQRPWQDVWVKSMV APASHGLILRMMMAWPTVGPHSIGICPPPNPTISTALLGIMESEGPNWRVGHSEVAIPDR SWHRMPTSDDASAVDGSRLLGLSPRMLENVYISRHPVDVYLHSLELTGDP >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_1|873_bp nccaccgcggacaatttgtctctccttctcgagtggaggcttataccagccaatgtggct atgtctctttttgtaaaaacagatatggagcataacagagctgctgcccaggacagccgg ggctcagggagacctctggacactgcatccttccagggcatagagagtgagggtgagcct gagcttgtcttctggaaaatgcggactgatgccctttctgcaaggtttgagagtgaccag aggatcggcaaaggaaagattcatggaccatcagaagggccctttgcctccgaggccgcc ccagcccttcacgtggagataaatcacaggcagcagggaaagataaaagatggcctgcag gtggctctggggacacagggacattcagagcctgccgtggcccctgacctgaagacatca gcccaaaccctcccacctctacctgcactgcagatgctggccctaggtggccgggtggac ccccagctccacctcctactgcagaggccttggcaagatgtttgggtcaagtcaatggtg gcgcctgcatcccatgggttgatattaagaatgatgatggcatggcccacagttggacct cattcgattggcatatgcccaccccccaaccccaccatctccacagctctactgggaata atggagtctgaaggacccaactggagggttgggcactcagaggtggccataccagatagg tcttggcacaggatgcccaccagtgatgatgcctcagctgttgacggcagcagactcctg ggactttctccacggatgttggagaatgtctacatctccaggcacccggttgatgtttac ctacattctctggagctcacgggggacccctag >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_2|70_aa MAATNPHGSTWPAVHVELLLPVDFLCSDDLGGNGKQLTTISHLKRRLARLPPPYELSGPF ARLFGLILHP >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_2|213_bp atggctgccactaatccacatggctccacttggcctgctgtccatgtggagctgctcctc cctgtggacttcctgtgctctgatgaccttggaggaaacggcaagcagcttactaccatc tctcacctgaaacggcggctggctcggctgcccccaccctacgagttgtctggtcccttt gccaggctttttggcctcatcctacacccttaa >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_3|291_aa MPSAPARLRAAASLREGPGPPRSGSRGVSPPLPRAATAPLLPARRRSTIAGGRRNPCPAG AAPPRFQGRGGGERVDGPAGEQPGRRGPQRAPRPMDIATGPESLERCFPRGQTDCAKMLD GIKMEEHALRPGPATLGVLLARILRRRGEHSAEPRLEARGLSTPEPEDWDGLAGAARPLA RRSGERRRQAVIPGGPSPRGRGLWARCDRDAGAGPGGADGRAFALCATGSDCPHPAVCEG CQRPISDRFLMRVNESSWHEECLQCAACQQALTTSCYFRDRKLYCKQDYQQ >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_3|876_bp atgccctccgcccccgcccggctccgcgccgctgcctcgctccgggaggggccggggccg ccccgcagcggatcgcggggggtcagcccgccgctaccccgggccgcaaccgcccccctg ctccccgcgcgtcgccgctccacgatcgccgggggccggcgcaacccctgccctgcgggg gccgcgcctccccggttccagggccgcggcggcggagagcgggtggacgggccggcgggc gagcagcccggccggcggggtccgcagcgcgccccgcgtcccatggatatagcaacaggt cccgagtcgctggagaggtgcttccctcgcgggcagacggactgcgccaagatgttggac ggcatcaagatggaggagcacgccctgcgccccgggcccgccactctgggggtgctgctg gcgcggattctccggagaaggggagagcacagcgccgagccaaggctcgaggcccgcggc ctctccacgccggagcccgaggactgggacggactagccggggccgcccggcccctggcg cggcggtccggggagcgcaggcggcaggcggtgatcccgggcggcccgagccctcggggc cgagggctgtgggcccggtgcgaccgggacgccggggctgggccgggcggcgctgacggc cgggctttcgccctgtgcgctacaggctccgactgcccgcatcccgccgtctgcgagggc tgccagcggcccatctccgaccgcttcctgatgcgagtcaacgagtcgtcctggcacgag gagtgtttgcagtgcgcggcgtgtcagcaagccctcaccaccagctgctacttccgggat cggaaactgtactgcaaacaagactaccaacagtaa >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_4|657_aa MRRPRAGRAAAGAAGGGPAVLPAFTQRQLCASPPYGDLRTPPLVTPKGLKGLCQHPHFAD EQTEAQRGERVLLSALLKVTSQYWASQDGLQQTGPRDLNPVLPTPELETFTPCGVASLSK HWQLPKGLSVGKWVHKSKFTHSLDCYVVAARYSKSACTDVRRMKIKLHHGLLRSLQSLST YTQQDRAALTTQCVISVQHIEHAYVKHGCQLNAFHPSRLDSMGTSFREPACSQHLPVGCP PGGGVHEEDPNTTAPFHRVKGLAPGHNRKQQTQDWVSQLWGMGVWQLQVLHSGDRTWVGL AGTPIMWTPSSAQSSPGPDLSPFFLLCRGQMYSVPIHTHTQSRTGLTRPRGDASERVCQG SSAEFWRKVPIWSGVWERWSLGAAELMGETLQPGKRLQQIPPGGGTGHRGPGETSAGPPA PLPGRKNGSHGLVWLFEHSISSRSPHNLEAGAVTRNSAAASPTPAAPHCAEASATAKAQA SAPAAPAGVLLVRAGSGAQARRRRRRASRQTGLGLCGQSAAGKPCKRWDEGGGGGGGGGE KGATGLAEKRVSVGVEQRNSPSEERGRACSREWPTLETCRSASAATLVPGIQRNLPRRGP QPGISIRPLSSGQSRFNGGEALLQAASLFSKALLLQVHIYTDHAQESLGKNGTYVSQ >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_4|1974_bp atgcggcggccgcgggccgggcgggcggcggcgggagccgcgggcgggggccccgcagtt ctcccagcatttacccagcgccaactgtgcgcaagtcctccctatggggatctgcggacc ccacccctggtaactccgaaaggtctgaaaggtctctgccaacacccccattttgctgat gagcaaactgaggctcagagaggggagagggtactactgagtgccttgctcaaggtcacc agccagtactgggccagccaggatggcctccagcagacaggcccaagggatttgaacccg gtccttccaactccagagctggagaccttcaccccttgtggtgtggcctccctatcaaag cactggcagctgcccaaagggctgtcagtagggaaatgggttcataaatcaaagttcacc cattccctggactgttacgtcgtggctgcaagatacagcaaatctgcgtgtactgatgtg cgcagaatgaaaatcaaactgcatcacggcctgctcagaagtcttcagagtctctccact tacacgcaacaggacagggcagccctgactacccagtgtgttatcagcgtacagcacata gaacatgcgtacgtaaagcacggctgtcagctgaatgcttttcatccttccaggctggat tcgatgggcacttccttcagggagcctgcctgctcccagcatcttccagtgggctgcccc ccaggaggtggagtgcatgaggaagatcctaacaccaccgccccatttcacagggtaaag ggacttgccccaggtcacaacaggaagcaacagacccaggactgggtgtcacagctgtgg gggatgggggtatggcagctccaggtacttcactctggagacaggacatgggtgggattg gcagggacccccatcatgtggacgccaagcagtgcccagtccagtccaggcccagatctg agtcccttctttctcctgtgccgagggcagatgtactctgtgcccatccacacacacaca cagtcacgcacagggctgactaggcccaggggagatgccagtgaacgggtctgccaggga tcaagtgcagagttctggagaaaggtgcccatctggagtggtgtctgggagaggtggtct ctgggagctgctgagctgatgggggagactctccagcctgggaaacgacttcagcagata ccccccggtgggggcacaggacaccgaggacccggagagaccagcgcgggcccaccagcc ccgttacctggcaggaaaaacggctcccatggactggtatggctctttgaacacagcatc tcatctcgctctccccacaacctggaagccggtgcagtgacccggaacagcgcggccgct tctccgacgcccgcggctccccactgcgccgaggcgagcgccacggctaaagcccaagcc tcggcgcccgctgcgcctgccggagttctgcttgtccgagcaggcagcggggctcaggca cggcggcgacgtcgaagggccagtcggcagacagggctgggtctctgcggccagagcgcc gcaggaaagccctgcaaacggtgggacgagggtggcggcggcggcggcggaggtggcgag aaaggcgccacgggcctggcggagaagagggtttccgtgggggtggaacaaaggaattca ccctcggaagaaaggggccgagcatgcagccgggaatggccaaccctggagacctgcagg agtgcgagcgctgcgacattagtcccagggatccaacgaaatctgccccgccgagggccc cagcctggcatctccattcggcctctctcttctggtcaatcccgctttaatgggggtgag gcactactgcaggctgcctccctgttcagtaaggcgctgctcctgcaagttcacatctac actgaccatgctcaggagtctcttggaaaaaatgggacctacgtgtctcagtag >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_5|303_aa MKRLFAAKCSGCMEKIAPTEFVMRALECVYHLGCFCCCVCERQLRKGDEFVLKEGQLLCK GDYEKEKDLLSSVSPDESDSGRVVSPTVKSEDEDGDMKPAKGQGSQSKGSGDDGKDPRRP KRPRTILTTQQRRAFKASFEVSSKPCRKVRETLAAETGLSVRVVQVWFQNQRAKMKKLAR RHQQQQEQQNSQRLGQEVLSSRMEGMMASYTPLAPPQQQIVAMEQSPYGSSDPFQQGLTP PQMPGDHMNPYGNDSIFHDIDSDTSLTSLSDCFLGSSDVGSLQARVGNPIDRLYSMQSSY FAS >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_5|912_bp atgaagaggctcttcgcggccaagtgcagcggctgcatggagaagatcgcccccaccgag ttcgtgatgcgggcgctggagtgcgtgtaccacctgggctgcttctgctgctgcgtgtgt gaacggcagctacgcaagggcgacgaattcgtgctcaaggagggccagctgctgtgcaag ggtgactacgagaaggagaaggacctgctcagctccgtgagccccgacgagtccgactcc ggccgggttgtgtcccccacagtgaagagcgaggatgaagatggggacatgaagccggcc aaggggcagggcagtcagagcaagggcagcggggatgacgggaaggacccgcggaggccc aagcgaccccggaccatcctcaccacgcagcagcgaagagccttcaaggcctccttcgag gtctcgtcgaagccttgccgaaaggtccgagagacactggcagctgagacgggcctcagt gtgcgcgtggtccaggtctggtttcagaaccaaagagcaaagatgaagaagctggcgcgg cggcaccagcagcagcaggagcagcagaactcccagcggctgggccaggaggtcctgtcc agccgcatggagggcatgatggcttcctacacgccgctggccccaccacagcagcagatc gtggccatggaacagagcccctacggcagcagcgaccccttccagcagggcctcacgccg ccccaaatgccaggtgaccacatgaacccctatgggaacgactccatcttccatgacatc gacagcgatacctccttaaccagcctcagcgactgcttcctcggctcctcagacgtgggc tccctgcaggcccgcgtggggaaccccatcgaccggctctactccatgcagagttcctac ttcgcctcctga >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_6|166_aa MRGGRRVSAPVAGSPAAACVAAERCFWEGRQVPGLGAKRNVVGGQGDNKEKVSNKHVSKR IYVIMKFKGRYYDWTCTKSLDNILLDAICRKNVSKAMPPLKYRLKIRAQGCVGESAVSSV SSAHAVRGSQKPLRRRRPRSCLLWPRSGASSVPAAPQRSAGHVGYT >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_6|501_bp atgcgtggggggcgccgcgtgagtgcgccggtggctgggagtccggcggccgcgtgtgtg gcagcggagcggtgtttctgggaggggcgccaggttccggggctgggggcaaaaaggaat gtagtaggaggacagggtgataataaggagaaggtcagcaacaaacatgtgagcaaaaga atctacgtcataatgaagttcaagggaaggtactatgactggacgtgcacgaaatctctt gacaatatcctgttggatgcaatttgcagaaaaaatgtctccaaagcaatgcctcctctt aaatacaggctcaaaatccgtgcacagggttgcgtgggggaatccgcggtgtcctcggtg agcagcgcccacgcagtgcggggctcacagaagcctctgcggaggcggcgtcctcgctcc tgccttctttggccacgatcaggagcctcctcagtccccgccgcccctcagcgctccgcc ggccacgtgggctacacctga >gi568815589f:126514450_126796448|GENSCAN_predicted_peptide_7|201_aa XNQEDCKRGLRTGAQTKSTTFLFFVARFDFLKRGAQPGHPGTLWGSENTFLWDGLEVVLL QPVQDRMEERQMTLAASYPEATWNTYISLSKLDLMAIPVAKEPGRKITAFWPIVDGGKEL GEGFDSTQRPFPQRPRAPQSRPCSSKAPAGQHPPELRCEVRNGPPKLLLPLLLPDIPHVQ CVIFYDRSYIRHNQIYGSCIL >gi568815589f:126514450_126796448|GENSCAN_predicted_CDS_7|606_bp nacaaccaagaagattgcaaacgggggctcaggacaggagcccaaaccaaatcaaccaca ttcctgttttttgttgcacgtttcgacttcctgaagagaggtgcccaaccaggtcatcca ggcacactgtggggttctgaaaatacatttctctgggatggcctggaggtggtgctgctg cagccggtccaggacaggatggaggaaaggcagatgacactagctgcttcttacccagaa gccacctggaatacttacatttcattgtcaaaactggatctcatggctatcccagttgca aaagagcctgggagaaaaataacagctttctggcctatagtggatggtggcaaggagctt ggggaaggttttgactcgacacagcggcccttccctcagaggcccagggccccacagagc aggccctgctcctccaaagcacctgccggccagcacccaccggaattgcggtgcgaggtt cggaacgggccccccaagctgctcctgcccctgctgcttcccgacatcccccatgttcaa tgcgtgattttttacgatcgatcttatatccggcacaatcagatttatggctcctgcatc ttatga