GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:07:46 Sequence gi568815576r:28958745_29160844 : 202100 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5927 5982 56 0 2 63 98 71 0.275 4.30 1.02 Intr + 28332 28457 126 2 0 91 121 54 0.667 9.88 1.03 Intr + 48312 48420 109 2 1 41 90 10 0.001 -3.64 1.04 Intr + 67102 67209 108 2 0 50 42 85 0.241 0.36 1.05 Intr + 71172 71319 148 1 1 45 95 92 0.672 4.89 1.06 Intr + 72636 72723 88 0 1 78 73 33 0.490 0.77 1.07 Intr + 74888 74950 63 1 0 44 99 76 0.457 3.21 1.08 Intr + 77764 77860 97 0 1 101 71 -12 0.045 -2.02 1.09 Intr + 83676 83704 29 1 2 58 116 44 0.896 2.03 1.10 Intr + 83751 83825 75 2 0 75 115 73 0.995 8.41 1.11 Intr + 84555 84686 132 2 0 99 46 223 0.997 20.04 1.12 Intr + 86036 86146 111 1 0 116 72 108 0.998 12.58 1.13 Intr + 87972 88139 168 2 0 84 80 168 0.937 15.74 1.14 Intr + 89645 89747 103 1 1 83 61 112 0.999 7.75 1.15 Intr + 90453 92204 1752 1 0 115 94 1673 0.950 157.92 1.16 Intr + 94411 94505 95 2 2 119 83 -42 0.868 -1.92 1.17 Term + 94835 94933 99 1 0 151 38 73 0.933 6.73 1.18 PlyA + 94994 94999 6 1.05 2.03 PlyA - 96311 96306 6 1.05 2.02 Term - 100438 99998 441 1 0 83 49 299 0.984 20.76 2.01 Init - 100977 100939 39 0 0 80 91 57 0.295 5.43 2.00 Prom - 103326 103287 40 -6.06 3.00 Prom + 103534 103573 40 -6.06 3.01 Init + 114387 114483 97 2 1 76 117 220 0.754 22.17 3.02 Term + 114613 115430 818 2 2 108 44 312 0.742 22.10 3.03 PlyA + 116171 116176 6 1.05 4.03 PlyA - 116716 116711 6 1.05 4.02 Term - 122093 121937 157 1 1 65 41 172 0.633 7.61 4.01 Init - 141437 141424 14 2 2 112 73 23 0.002 1.41 4.00 Prom - 141673 141634 40 -3.56 5.00 Prom + 141771 141810 40 -3.46 5.01 Init + 152045 152097 53 1 2 89 47 54 0.017 2.03 5.02 Intr + 161095 161171 77 1 2 36 94 62 0.012 0.76 5.03 Intr + 162613 162737 125 2 2 94 59 42 0.627 2.20 5.04 Intr + 166519 166672 154 0 1 71 73 131 0.690 9.55 5.05 Intr + 178598 178930 333 0 0 108 116 367 0.997 37.04 5.06 Intr + 179880 180038 159 1 0 103 69 229 0.969 22.46 5.07 Intr + 181538 181622 85 0 1 93 89 79 0.777 7.38 5.08 Intr + 189267 189385 119 0 2 55 22 85 0.355 -1.29 5.09 Term + 192048 192223 176 1 2 40 49 171 0.389 6.32 5.10 PlyA + 194153 194158 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 133344 133452 109 2 1 55 25 110 0.837 1.78 S.002 Intr + 135514 135676 163 2 1 102 110 55 0.987 8.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:28958745_29160844|GENSCAN_predicted_peptide_1|1119_aa XSDEDELLYSSPLRIELQKMHPLGLCNNNDEEDLYEYGWVGVVKLEQPELDPKPCLTVLG KYPRRELLSDDHFAANAGRRGQNDLNNVCGHAAGLGRVDCKPLTAFQPERGIDGIVNTGQ KVVLMIIVTPRMRVAICIEGEWSFLKFLVADYIRFGLKWDFWKLNQDNDFQVLPKAYLPG IGRIFSPVTRVLLVCDLISYGMKSFCRKTGPLGGSSGDEKTMDGFGSQNQQNLLINRPPN PEIKSVDKFACIVCASLSILSFMQPSLIAKANFEKAKRAVQRGATAVIFDVSENPEAIDQ LNQGSEDPLKRPVVYVKGADAIKLMNIVNKQKVARARIQHRPPRQPTEYFDMGIFLAFFV VVSLVCLILLVKIKLKQRRSQNSMNRLAVQALEKMETRKFNSKSKGRREGSCGALDTLSS SSTSDCAICLEKYIDGEELRVIPCTHRFHRKCVDPWLLQHHTCPHCRHNIIEQKGNPSAV CVETSNLSRGRQQRVTLPVHYPGRVHRTNAIPAYPTRTSMDSHGNPVTLLTMDRHGEQSL YSPQTPAYIRSYPPLHLDHSLAAHRCGLEHRAYSPAHPFRRPKLSGRSFSKAACFSQYET MYQHYYFQGLSYPEQEGQSPPSLAPRGPARAFPPSGSGSLLFPTVVHVAPPSHLESGSTS SFSCYHGHRSVCSGYLADCPGSDSSSSSSSGQCHCSSSDSVVDCTEVSNQGVYGSCSTFR SSLSSDYDPFIYRSRSPCRASEAGGSGSSGRGPALCFEGSPPPEELPAVHSHGAGRGEPW PGPASPSGDQVSTCSLEMNYSSNSSLEHRGPNSSTSEVGLEASPGAAPDLRRTWKGGHEL PSCACCCEPQPSPAGPSAGAAGSSTLFLGPHLYEGSGPAGGEPQSGSSQGLYGLHPDHLP RTDGVKYEGLPCCFYEEKQVARGGGGGSGCYTEDYSVSVQYTLTEEPPPGCYPGARDLSQ RIPIIPEDVDCDLGLPSDCQGTHSLGSWGGTRGPDTPRPHRGLGATREEERALCCQARAL LRPGCPPEEAGAVRANFPSALQDTQESSTTATEAADLRLGNTPAPSEITYTPSVHFCHPI AKLLCLKDRDLTQQTAAAREPELRRNSYLEIGNCMETPN >gi568815576r:28958745_29160844|GENSCAN_predicted_CDS_1|3360_bp ngttctgatgaggatgaacttctgtactccagccccctcagaatagagctgcagaagatg cacccactgggcctatgtaataacaatgacgaagaggacttgtatgaatatggctgggta ggagtggtgaagctggaacagccagaattggacccgaaaccatgcctcactgtcctaggc aagtatccccggagagaattgttatcagatgatcattttgctgctaatgcagggaggagg ggacagaacgacttgaataacgtgtgtggtcatgcagctggtctgggcagagtggattgc aagcccctcactgctttccagcccgaaaggggcattgatgggattgtgaacacaggacag aaagtagtattaatgattattgttacaccccggatgcgtgttgccatatgcattgaaggg gagtggtcctttttgaagtttctagttgctgactatataagatttggcctgaagtgggac ttctggaaactcaaccaagataacgacttccaggttcttccaaaagcttatcttccaggc ataggcaggattttctcaccggttacccgtgtgctcttggtgtgtgacctcatttcctat ggaatgaaaagtttctgtcggaaaacagggcccttgggtggcagcagtggggatgaaaaa acgatggatggatttggaagtcagaatcagcaaaatttgctcatcaacagaccgcctaac ccagaaattaaatctgttgacaaatttgcctgtattgtctgtgccagtctatctatcttg tcatttatgcagccaagcctgattgccaaggccaactttgaaaaggccaagcgagcagta cagcggggagctactgcagtcatctttgatgtgtctgaaaacccagaagctattgatcag ctgaaccagggctctgaagacccgctcaagaggccggtggtgtatgtgaagggtgcagat gccattaagctgatgaacatcgtcaacaagcagaaagtggctcgagcaaggatccagcac cgccctcctcgacaacccactgaatactttgacatggggattttcctggctttcttcgtc gtggtctccttggtctgcctcatcctccttgtcaaaatcaagctgaagcagcgacgcagt cagaattccatgaacaggctggctgtgcaggctctagagaagatggaaaccagaaagttc aactccaagagcaaggggcgccgggaggggagctgtggggccctggacacactcagcagc agctccacgtccgactgtgccatctgtctggagaagtacattgatggagaggagctgcgg gtcatcccctgtactcaccggtttcacaggaagtgcgtggacccctggctgctgcagcac cacacctgcccccactgtcggcacaacatcatagaacaaaagggaaacccaagcgcggtg tgtgtggagaccagcaacctctcacgtggtcggcagcagagggtgaccctgccggtgcat taccccggccgcgtgcacaggaccaacgccatcccagcctaccctacgaggacaagcatg gactcccacggcaaccccgtcaccttgctgaccatggaccggcacggggagcagagcctc tattccccgcagacccccgcctacatccgcagctacccacccctccacctggaccacagc ctggccgctcaccgctgcggcctggagcaccgggcctactccccagcccaccccttccgc aggcccaagttgagtggccgcagcttctccaaggcagcttgcttctcccagtatgagacc atgtaccagcactactacttccagggcctcagctacccggagcaggaggggcagtcccca cctagcctcgcaccccggggcccggcccgtgcctttcctccgagcggcagtggcagcctg ctcttccccaccgtggtgcacgtggccccgccctcccacctggagagcggcagcacgtcc agcttcagctgctatcacggccaccgctcggtgtgcagtggctacctggccgactgccca ggcagcgacagcagcagcagcagcagctccggccagtgccactgttcctccagtgactct gtggtagactgcactgaggtcagcaaccagggcgtgtacgggagctgctccaccttccgc agctccctcagcagcgactatgaccccttcatctaccgcagccggagcccctgtcgtgcc agtgaggcggggggctcgggcagctcgggccggggacctgccctgtgcttcgagggctcc ccgcctcccgaggagctcccggcggtgcacagtcatggtgctgggcggggcgagccttgg ccgggccctgcctctccctcgggggatcaggtgtccacctgcagcctggagatgaactac agcagcaactcctccctggagcacagggggcccaatagctctacctcagaagtggggctc gaggcttctcctggggccgcccctgacctcaggaggacctggaaggggggccacgagttg ccgtcgtgtgcctgctgctgcgagccccagccctccccagccgggcctagcgccggagca gctggcagcagcaccttgttcctggggccccacctctacgagggctctggcccggcgggt ggggagccccagtcaggaagctcccagggcttgtacggccttcaccccgaccatttgccc aggacagatggggtgaaatacgagggtctgccctgctgcttctatgaagagaagcaggtg gcccgcgggggcggagggggcagcggctgctacactgaggactactcggtgagtgtgcag tacacgctcaccgaggaaccaccgcccggctgctaccccggggcccgggacctgagccag cgcatccccatcattccagaggatgtggactgtgatctgggcctgccctcggactgccaa gggacccacagcctcggctcctggggtgggacgcgaggcccggataccccacggccccac aggggcctgggagcaacccgggaagaggagcgggctctgtgctgccaggctagggcccta ctgcggcctggctgccctccggaggaggcgggtgctgtcagggccaacttccctagtgcc ctccaggacactcaggagtccagcaccactgccactgaggctgcagatcttaggttaggt aacactccagccccctctgagatcacctacacaccttctgttcacttctgtcatcctatt gctaaattactttgtctcaaggaccgagatctcactcagcagacagcagcagcccgggag cctgagctcaggaggaactcttacctggaaattgggaactgtatggagactccaaactga >gi568815576r:28958745_29160844|GENSCAN_predicted_peptide_2|159_aa MENLAYEGLIGIQSKESSKEKKLTVRQDLEDRYAEHVAATQALPQDSGTAAWKGRVLLPE TQKRQQLSEDTLTIHGLPTEGYQALYHAVVEPMLWNPSGTPKRYSLELGKAIKQKLWEAL CSQGAISEGAQRDRFPGRKQPGVHEEPVLKKWPKLKSKK >gi568815576r:28958745_29160844|GENSCAN_predicted_CDS_2|480_bp atggagaacttggcctacgaaggcctgattggaatccagagtaaagaaagttcaaaggag aaaaaactaacagtccgccaagatcttgaggacagatatgctgaacatgtggctgccacc caagcgctaccccaggacagtgggacagcagcctggaagggccgagtgttgcttcctgaa acccaaaagagacagcagttgtcggaggacacgctaaccatccatggtctccccacagag ggttaccaggctctgtaccacgctgtggtggagccaatgctgtggaatccttcagggacc cccaagaggtacagcctggagctgggcaaggccattaaacaaaagctctgggaggctctt tgcagtcagggtgccatctctgaaggtgctcagagggaccggttccctggcaggaagcag ccaggtgtccacgaggagcctgtactcaagaaatggcccaagttaaagagcaaaaaatag >gi568815576r:28958745_29160844|GENSCAN_predicted_peptide_3|304_aa MAPPAARLALLSAAALTLAARPAPSPGLGPGPGDAPQAGMVPSWDPGYPQARHRRPRARY CPPAAGPGAPQRRPSAQEALSALSLPRPSATPPGLGHLLCLGIWDPLPESLSDPNQAGTP PLPGTSWDPVPSPQRLPPGRDVLCSPVPPRIPSQNPQRLPTGQEAPCSPVPPGIPSPSPQ RPLPGRDDPCSPVPPGIPPQVLHRRTLHRDDSPATRGYTPLSETSSDIPPLGQGPLPEPH CDAPASPSPLLPPTPVEPGASPRRAERRREEPALSPFSRVLGCTRLADIITSASRGTGWT CSPC >gi568815576r:28958745_29160844|GENSCAN_predicted_CDS_3|915_bp atggcgccgccagccgcccgcctcgccctgctctccgccgcggcgctcacgctggcggcc cggcccgcgcctagccccggcctcggccccggacccggcgacgcccctcaggccgggatg gtcccttcctgggacccgggctacccccaggcccgtcatcgacgcccccgggcccggtac tgtcccccggctgcaggacccggtgctcctcagcgacgcccctcagcccaggaggccctc tccgccttgagtcttccaagaccctctgcgacgccccccgggctgggacatcttctctgt ctcgggatctgggacccgctgcccgagtccctcagcgaccccaaccaggccgggacgccc cctctccccggtacctcctgggatcccgtcccaagtccccagcgacttcccccgggccgg gacgtcctctgctccccggtacctcctaggatcccgtcccaaaatccccagcgactcccc acgggccaggaggccccctgctccccggtacctcctgggatcccgtccccaagtccccag cgacccctcccgggccgggacgacccctgctccccagtacctcctgggatcccgccccaa gtccttcatcgacgcaccttgcaccgggacgactcccccgctacaagaggctatacgccc ctctccgagacctccagcgacatccctcccctgggccaaggtcccctccctgagcctcac tgcgacgcccccgcgtcccccagtcctctcctcccgcctacaccggtggaacccggcgcc tccccgcgcagagcagagcggaggcgggaggagccggcgctcagccccttttcccgagtc ctcggctgcacccgcttggcggacattataacttctgcctcgcgaggaacgggatggact tgttcgccctgctag >gi568815576r:28958745_29160844|GENSCAN_predicted_peptide_4|56_aa MAPLQMFSSSIPGLYRLDASSTSLVVTAKMSPSIAKCPLGAKSPPVEITELEEMVK >gi568815576r:28958745_29160844|GENSCAN_predicted_CDS_4|171_bp atggcgccgttgcagatgtttagcagcagcatccctggcctctaccggctagatgccagt agtacctctttagttgtgacagccaaaatgtctccaagtattgcaaaatgtcccctgggg gcaaaatctcctccagttgaaatcactgagttagaagaaatggtcaaataa >gi568815576r:28958745_29160844|GENSCAN_predicted_peptide_5|426_aa MAAVRFGTEVADEQHRVQICEYVTLDGIGDFADMMKDFVIDYPVPGNLGCYKDHGNPPPL TGTSKTSNKLTIQTCISFCRSQRFKFAGMESGYACFCGNNPDYWKYGEAASTECNSVCFG DHTQPCGGDGRIILFDTLVGACGGNYSAMSSVVYSPDFPDTYATGRVCYWTIRVPGASHI HFSFPLFDIRDSADMVELLDGYTHRVLARFHGRSRPPLSFNVSLDFVILYFFSDRINQAQ GFAVLYQAVKEELPQERPAVNQTVAEVITEQANLSVSAARSSKVLYVITTSPSHPPQTVP GWTVYGLATLLILTVTAIVAKILLHVTFKIEVGVKNEEVSGMVLRYLTWVPERREGPLFD IRNVKGARLRTVYQTLCQVTNRHYRIHLCRNPTRELRVREGVQPGAAKVRTQPGTTGSKA GGPYSC >gi568815576r:28958745_29160844|GENSCAN_predicted_CDS_5|1281_bp atggcagctgtcagatttggaacggaagttgcagatgaacagcacagggtgcaaatctgt gaatacgttaccttagatggcataggagattttgcagatatgatgaaagattttgtgata gattatccagtgcctggaaaccttggctgctacaaggatcatggaaacccacctcctcta actggcaccagtaaaacgtccaacaaactcaccatacaaacttgcatcagtttttgtcgg agtcagaggttcaagtttgctgggatggagtcaggctatgcttgcttctgtggaaacaat cctgattactggaagtacggggaggcagccagtaccgaatgcaacagcgtctgcttcggg gatcacacccaaccctgtggtggcgatggcaggatcatcctctttgatactctcgtgggc gcctgcggtgggaactactcagccatgtcttctgtggtctattcccctgacttccccgac acctatgccacggggagggtctgctactggaccatccgggttccgggggcctcccacatc cacttcagcttccccctatttgacatcagggactcggcggacatggtggagcttctggat ggctacacccaccgtgtcctagcccgcttccacgggaggagccgcccacctctgtccttc aacgtctctctggacttcgtcatcttgtatttcttctctgatcgcatcaatcaggcccag ggatttgctgttttataccaagccgtcaaggaagaactgccacaggagaggcccgctgtc aaccagacggtggccgaggtgatcacggagcaggccaacctcagtgtcagcgctgcccgg tcctccaaagtcctctatgtcatcaccaccagccccagccacccacctcagactgtccca ggatggacagtctatggtctggcaactctcctcatcctcacagtcacagccattgtagca aagatacttctgcacgtcacattcaaaatagaggtgggggttaagaatgaggaggtgtct gggatggtgctccggtacctgacatgggtgcctgagcggagagaggggccactcttcgac ataaggaatgtaaaaggagccaggttacgtactgtgtaccagacgctgtgccaagtgacc aacagacattaccgaattcatctttgccgcaacccaacaagggaactgagggtcagagaa ggggtccagccaggggccgccaaggttcggactcagcctgggaccactggctccaaagct gggggtccttactcctgttga