GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:49:00 Sequence gi568815586f:116811038_117128169 : 317132 bp : 46.81% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 920 1015 96 0 0 72 93 91 0.916 7.98 1.02 Intr + 5353 5471 119 2 2 27 64 39 0.007 -4.42 1.03 Intr + 7787 7906 120 1 0 59 110 30 0.031 2.99 1.04 Intr + 22755 22904 150 2 0 119 100 217 0.999 26.36 1.05 Intr + 24923 24988 66 1 0 96 109 69 0.995 8.90 1.06 Intr + 25144 25245 102 0 0 72 99 118 0.992 11.67 1.07 Intr + 38277 38402 126 2 0 84 43 138 0.520 9.78 1.08 Intr + 41544 41667 124 2 1 11 89 145 0.426 7.06 1.09 Intr + 48262 48293 32 2 2 128 87 -9 0.352 0.85 1.10 Intr + 48695 49019 325 1 1 59 54 160 0.669 5.05 1.11 Intr + 51575 51662 88 0 1 57 97 44 0.386 1.23 1.12 Term + 68921 69014 94 2 1 127 49 50 0.033 2.30 1.13 PlyA + 69157 69162 6 1.05 2.02 PlyA - 69633 69628 6 1.05 2.01 Sngl - 70270 69995 276 1 0 70 49 254 0.405 13.70 2.00 Prom - 71914 71875 40 -2.96 3.00 Prom + 86583 86622 40 -5.86 3.01 Init + 100001 100318 318 1 0 79 105 427 0.753 40.84 3.02 Intr + 116986 117090 105 0 0 129 110 32 0.975 9.91 3.03 Intr + 134327 134491 165 1 0 102 86 178 0.958 19.16 3.04 Intr + 138581 138669 89 1 2 105 86 85 0.841 8.77 3.05 Intr + 153660 153817 158 0 2 71 115 203 0.748 20.95 3.06 Intr + 174169 174365 197 2 2 80 93 206 0.996 19.43 3.07 Intr + 177626 177832 207 1 0 85 81 119 0.750 10.17 3.08 Intr + 199114 199242 129 0 0 43 70 106 0.704 5.19 3.09 Intr + 199286 199413 128 1 2 100 94 125 0.784 13.78 3.10 Intr + 213110 213283 174 2 0 78 90 269 0.753 25.15 3.11 Intr + 216357 216467 111 0 0 88 88 182 0.985 17.69 3.12 Term + 216991 217135 145 1 1 133 50 168 0.999 14.88 3.13 PlyA + 217988 217993 6 1.05 4.05 PlyA - 218583 218578 6 -0.45 4.04 Term - 219166 219096 71 2 2 58 49 67 0.127 -2.10 4.03 Intr - 222102 221907 196 0 1 55 37 89 0.204 -0.51 4.02 Intr - 222345 222158 188 1 2 113 91 103 0.648 12.51 4.01 Init - 227084 226991 94 2 1 100 69 63 0.569 6.32 4.00 Prom - 227496 227457 40 -6.26 5.18 PlyA - 227916 227911 6 1.05 5.17 Term - 228173 228096 78 2 0 109 46 92 0.983 4.86 5.16 Intr - 230957 230910 48 2 0 112 98 135 0.944 15.78 5.15 Intr - 235629 235522 108 0 0 72 89 241 0.996 23.08 5.14 Intr - 235801 235704 98 2 2 86 11 188 0.931 10.63 5.13 Intr - 238121 237982 140 1 2 110 38 366 0.945 33.91 5.12 Intr - 245849 245769 81 1 0 25 80 203 0.005 11.95 5.11 Intr - 248044 247923 122 1 2 93 105 -56 0.005 -4.11 5.10 Intr - 249825 249666 160 2 1 60 80 136 0.863 9.99 5.09 Intr - 250368 250157 212 0 2 100 45 69 0.529 1.51 5.08 Intr - 254253 254145 109 2 1 115 33 28 0.188 0.29 5.07 Intr - 264303 264234 70 1 1 74 103 70 0.749 5.34 5.06 Intr - 270002 269752 251 1 2 60 44 168 0.092 6.68 5.05 Intr - 273913 273867 47 1 2 83 75 16 0.231 -2.99 5.04 Intr - 284446 284346 101 2 2 83 86 30 0.914 2.13 5.03 Intr - 288062 287889 174 0 0 60 51 118 0.967 5.21 5.02 Intr - 288433 288188 246 2 0 4 68 225 0.779 9.53 5.01 Init - 288933 288855 79 0 1 74 57 115 0.540 6.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 17104 17038 67 1 1 77 110 87 0.936 9.17 S.002 Term + 117318 117383 66 2 0 81 49 65 0.859 -0.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:116811038_117128169|GENSCAN_predicted_peptide_1|480_aa XIYPRVSEFTGSNSDASSVYMTDDVTWLQEEPAWPQGLCQLAPGTIAPEAIAASSSPSSP EEQREDLYRKLGWSVMWNRTQHCGGTPQVGPLPLFTPEQGGTGVKVQRFGVQGKFYLVIE ELSQLFRSLVPIQLWYKYIMGDDSSNSYFLGGVLIVLYSLCKSFDICGRVGGVRKALKLL CTSQNYGVRATGQQCTEAGDICAICQAEFREPLILLCQHVFCEECLCLWLDRERTCPLCR SVAVDTLRCWKDGATSAHFQEKPKGSNRNFCNWFLSERSSCLQMLLKGHKKLELEKIDES AGGQHAPTTILKGYLGKHKERILKGRQKEIQLPACSSTERLYILKDLVIYSKAYGGIIPI SAASTLTNVDPTAGFLELYSKACGGIIPISAASTLNMDPTAGVLELERCLLTSLLDPLSI GKRKFSKVDCEDGCKILCILLIIDLYTLKSAPRYSRIQSSKTMLTLCASSKRLAQCQAHS >gi568815586f:116811038_117128169|GENSCAN_predicted_CDS_1|1443_bp nctatataccctcgggtgagtgagttcaccgggtcaaactctgatgcctcttctgtgtac atgactgatgatgttacctggctccaggaggagcccgcctggccccaaggactctgccag ctggcccctgggaccatagcaccagaggcgatcgcggcctcctcctccccctcctcgcca gaagagcagagagaggatctttataggaaattggggtggtctgtcatgtggaacaggacc cagcattgcgggggcacgccgcaagtgggaccgttaccactgtttacaccagagcaagga ggaactggggtaaaggtgcagaggtttggggtacagggaaagttctatctggtcatcgag gagctgagccagctgttccgatcccttgtccccatccagctgtggtacaaatacatcatg ggtgacgactcctccaacagctacttcctgggcggggtcctgatcgttctctacagcctc tgcaagtccttcgacatctgtggacgtgtgggcggagttaggaaagccctgaagcttctc tgtacctctcagaactatggagtccgagccaccgggcagcagtgcacagaagctggtgac atctgcgccatctgtcaggccgagttccgagagcctctgattctcctgtgccagcacgtg ttctgtgaggagtgcctctgcctgtggctggaccgtgagcgcacctgcccgctctgccgc tcggtcgccgtggacaccctgcgctgctggaaggacggcgccacgtccgcacacttccag gaaaaaccaaagggaagcaacaggaacttctgcaactggtttttatcggaaagatcatcc tgcctgcagatgctgttgaaggggcacaagaaattggagctggagaagattgatgaaagt gcaggtggtcagcatgcccccactacgatcttgaagggctatttaggaaaacacaaggaa cggattttaaaaggaagacagaaagaaatccagctcccagcttgcagcagcactgagagg ttgtacattctaaaggacctggtcatttacagcaaagcctatggagggataattcccatt tcagccgcatccaccttaacaaatgtggatcccactgctggcttcttagaactgtacagc aaagcctgtggggggataattcccatttcagccgcatccaccttaaacatggatcccact gctggcgtcttagaactggagcggtgtctgctaaccagcctgctggaccctctgtctata gggaaacgaaaattttccaaagttgactgtgaagatggctgcaaaattctatgtatactc ctcatcattgacttgtacaccttaaaaagcgctcctcgctactccaggatccagagtagt aagacgatgcttacgctgtgtgcttcctcaaagcgcctagcacagtgccaggcgcacagc tag >gi568815586f:116811038_117128169|GENSCAN_predicted_peptide_2|91_aa MCPCPLHRGRGPPAVCACSAGRLGLRSSAAQLTAARLKALGDELHQRTMWRRRARSRRAP APGALPTYWPWLCAAAQVAALAAWLLGRRNL >gi568815586f:116811038_117128169|GENSCAN_predicted_CDS_2|276_bp atgtgcccgtgccccctgcaccgcggccgcggccccccggccgtgtgcgcctgcagcgcg ggtcgcctggggctgcgctcgtccgccgcgcagctcaccgccgcccggctcaaggcgcta ggcgacgagctgcaccagcgcaccatgtggcggcgccgcgcgcggagccggagggcgccg gcgcccggcgcgctccccacctactggccttggctgtgcgcggccgcgcaggtggcggcg ctggcggcctggctgctcggcaggcggaacttgtag >gi568815586f:116811038_117128169|GENSCAN_predicted_peptide_3|641_aa MDDYSLDEFRRRWQEELAQAQAPKKRRRPEAAERRARRPEVGSGRGEQASGDPALAQRLL EGAGRPPAARATRAEGQDVASRSRSPLAREGAGGGEQLVDQLIRDLNEMNDVPFFDIQLP YELAINIFQYLDRKELGRCAQVSKTWKVIAEDEVLWYRLCQQEGHLPDSSISDYSCWKLI FQECRAKEHMLRTNWKNRKGAVSELEHVPDTVLCDVHSHDGVVIAGYTSGDVRVWDTRTW DYVAPFLESEDEEDEPGMQPNVSFVRINSSLAVAAYEDGFLNIWDLRTGKYPVHRFEHDA RIQALALSQDDATVATASAFDVVMLSPNEEGYWQIAAEFEVPKLVQYLEIVPETRRYPVA VAAAGDLMYLLKAEDSARTLLYAHGPPVTCLDVSANQVAFGVQGLGWVYEGSKPCPYTIS NLKKIRKLSGRTSSRELSDKDHALHDGEMKVFDVGLILVYSLEAGRRLLKLGNVLRDFTC VNLSDSPPNLMVSGNMDGRVRIHDLRSGNIALSLSAHQLRVSAVQMDDWKIVSGGEEGLV SVWDYRMNQKLWEVYSGHPVQHISFSSHSLITANVPYQTVMRNADLDSFTTHRRHRGLIR AYEFAVDQLAFQSPLPVCRSSCDAMATHYYDLALAFPYNHV >gi568815586f:116811038_117128169|GENSCAN_predicted_CDS_3|1926_bp atggacgactacagcctggatgagttccgtcggcgctggcaggaggagctggcgcaggcc caggcgccgaagaagcggcgacggcccgaggctgccgagaggcgggctcggcggccggag gtgggctccgggcgcggcgaacaggcctcgggggacccggcgctggcccagcgtctcctg gagggcgcggggaggcccccggcggcgcgggcgactcgggccgaggggcaggacgtagcg agccgctcacgttctcctctggcccgcgagggcgccgggggcggggagcagctggtggac cagctcatccgcgacctgaatgaaatgaatgatgtgcctttctttgatatccaactgcct tacgaattggcaatcaatatatttcagtatctggacaggaaagaactaggaagatgtgca caggtgagcaagacgtggaaggtgattgcagaggatgaggtgctgtggtacaggctgtgc cagcaggaagggcaccttccggatagcagcatctctgactattcttgctggaagctcatc ttccaagagtgccgagccaaggaacacatgttacgaaccaactggaagaatcgcaaaggt gccgtgagcgagctggagcatgttcctgacacagttttgtgtgatgtgcattctcacgat ggtgtggtcattgcgggatatacatcaggggatgtgagagtgtgggacacccgcacctgg gactacgtagcccccttcctggaatcagaggacgaggaggatgagcctggaatgcagcca aatgtctcctttgtgaggataaacagctcgttggcagtagcagcttatgaggatgggttt cttaatatttgggatttaaggaccggaaagtaccctgttcatcgttttgagcacgatgca agaatacaggcactagccctcagccaggacgatgcaaccgtggccacagcttctgctttt gatgtcgtgatgttatcccccaatgaggaggggtactggcagatagctgcggaatttgaa gttccgaaactggttcagtaccttgaaatagttccagaaaccagaaggtaccctgtggca gtagccgctgctggagatctgatgtacctgctcaaagccgaagactccgccagaaccctc ctttacgcccacggcccgcctgtcacatgtctagacgtctcggccaaccaagttgctttt ggtgtacagggtctgggatgggtgtacgaaggaagcaagccatgtccctatactatcagt aacctgaaaaagatcaggaaattaagtggcagaacatcttcacgggagctcagtgataag gatcatgccctccacgatggtgaaatgaaagtatttgatgtcggcctgatcctggtgtat agcctggaagcaggacgccgcctcttgaagctgggtaacgttctccgtgacttcacgtgt gtcaacctcagcgacagccctcccaacctcatggtcagtggcaacatggacgggagggtg aggatccacgacctccgcagtggtaacatcgccctgtcgctctccgcccatcagctcagg gtctctgctgtgcagatggatgactggaagatcgtcagtggaggcgaggaaggcctggtg tccgtgtgggattatcggatgaaccagaagctgtgggaggtgtattccgggcacccggtg cagcacatctcattcagcagccacagcctcatcacggccaacgtgccttaccagacggta atgcgaaacgccgacctggacagcttcactactcacaggagacaccgggggctgatccgc gcctatgagtttgcggtggaccagctggccttccagagccctctccctgtctgccgttca tcctgtgacgccatggccactcactactacgacctcgcactggcctttccctataaccat gtttag >gi568815586f:116811038_117128169|GENSCAN_predicted_peptide_4|182_aa MAVPLALSGSPGPVAGPRALQPSSKQRQWKLGKDTAPSAEQSNGCFWKPTLGIRGRKWAG DNMAPARAQQEGICHRTLEAALVQVATGPCSGSQHLGPAGQRRELTAAAAAAQSEGSVRQ WETALLEPLWWPAWNSRGTVVTRRLDPHQGKPFLSNCKQIGDEYGMAAYLKQSEKPSRTF MT >gi568815586f:116811038_117128169|GENSCAN_predicted_CDS_4|549_bp atggctgtgcccctggctctttctggcagtcctgggccagttgcagggcccagggcccta cagccttccagcaagcagcggcagtggaagctgggaaaagacacggcccccagtgcagag cagtcaaatggctgcttctggaagcccacgctgggcatcagaggccgaaagtgggctggg gacaacatggcacccgccagagcccagcaagagggaatctgccatcgcacattagaggca gcccttgtccaagtcgccacaggcccctgcagcggtagccagcacctcggcccggccggc cagcgccgtgagctcactgctgctgcagctgctgctcagagcgaaggctccgtcaggcag tgggagacggccctgttggagcccctgtggtggcccgcctggaatagccgcgggacagtt gttacgagaaggctggaccctcaccaggggaagccgtttctcagcaactgtaaacaaatt ggagatgaatatggaatggctgcgtatcttaagcaatctgaaaaaccttccagaaccttc atgacgtga >gi568815586f:116811038_117128169|GENSCAN_predicted_peptide_5|707_aa MRPARSCSLRCLSATGLCRAHTGRHVGPRRAALRRYMRPRSGPTRNPRLRAFAGVPTRGR TRGQSRRCAAEASAGPERDARPGAPAAGTMGAAHSASEEVRELEGKTGFSSSFKWVPAGH LAGCGEELRGDAQCRSRSASDWPGNSLAGGLRTGEGAGPRLPKRRAGLCTKAVPLPGALF SFDVYNQVFTYGSLSLSPPRQAPSKPEVLRGPCKHRLKSKTRNNEVQLPGTKAHSAHHPH GNCDLRALEKGSEVRSKETGKATNPGALEEKAGPSVGFGALMQADMHQVLMRGALLGAQV SSDQIEQLHRRFKQLSGDQPTIRPCPHPAAMVSGLGYPNSLLPGSPVSVFTVHPPSRLTG LDTWALGKAPGEGQCLRMGQTKREALRAAIFIIRFIIRSANHLLTTLPSPAVQGTEDGVL WTSQGPHPARFPLELHVVPFSGIWALENGFYSDGQGESGSRVSGKSMAVRYLASKLTLPC PHWWADTLCLPYLGLTPFSLGSPACVLGPSQDTARGGPWSRRKSKENFNNVPDLELNPIR SKIVRAFFDNRNLRKGPSGLADEINFEDFLTIMSYFRPIDTTMDEEQVELSRKEKLRFLF HMYDSDSDGRITLEEYRNVKYAPLARPGLLVVEELLSGNPHIEKESARSIADGAMMEAAS VCMGQMEPDQVYEGITFEDFLKIWQGIDIETKMHVRFLNMETMALCH >gi568815586f:116811038_117128169|GENSCAN_predicted_CDS_5|2124_bp atgcgcccagcccgctcctgcagtttgaggtgcctctcagccaccgggctgtgcagggcc cacacgggccgccacgtgggaccgcggcgcgccgccctccgccgttatatgaggccccgc tccggccccacgcggaacccgcggctccgagccttcgccggcgtcccgacccgaggccgg acccgaggccagtcccgccgctgcgcagccgaagccagtgcggggcctgagagggacgcg cgccccggggcccccgccgcgggcaccatgggcgctgcccactccgcgtctgaggaggtg cgggagctcgagggcaagaccggcttctcctcctctttcaaatgggtccccgctggccac ctcgcgggctgcggcgaggagctccgcggcgatgctcagtgccgcagccgctcagcttcg gactggcccgggaactccctggccgggggactgcggaccggggagggagcggggcccagg cttcccaagcgcagggcaggcctttgcacaaaagctgttcctctgcctggagcactgttt tcatttgatgtgtacaatcaggtgtttacgtacgggagcctttctctgtctccacccagg caagcacctagcaaaccagaggtgctcagagggccatgcaagcacaggttgaaatctaaa accagaaataatgaggtccagctgccaggaaccaaagcccattctgcccatcatccgcat ggcaactgtgacctcagggccctggagaagggctcagaggtgaggagcaaggaaactgga aaagctaccaacccaggagcccttgaggaaaaggcaggtccttccgtgggctttggggcc ctgatgcaggcagacatgcatcaggtgcttatgcgaggggccctgttgggggcccaggtc tcatcggatcagatcgagcagctccatcggagatttaagcagctgagtggagatcagcct accattcgcccctgccctcatccagctgccatggtctctggcctgggctaccccaacagc ctcctccctggttcccctgtgtctgtcttcactgtccatccaccatccaggctcacgggt ctggacacctgggccctggggaaggcacctggggaagggcagtgcctcaggatgggccag acaaagagggaagccctgcgtgcagccatcttcattattcgcttcattattcgctcagca aaccatttattgaccaccttgccaagcccagcagtgcaaggcactgaggacggggtgctg tggacatcacagggtccccatcctgcaagattccctctggagctccacgtggtccctttc tcagggatctgggccttggaaaatggattctattcggatggccagggagagtcgggcagc agggtatctggtaagagcatggctgtacgctacctggcttccaagctgactcttccatgc ccccactggtgggctgataccctctgccttccctacctgggcctgacacctttttcccta ggatcaccagcctgtgtgctgggtccttcccaagacactgcccgtggaggcccatggagc agaaggaaaagcaaggagaacttcaacaatgtcccggacctggagctcaaccccatccga tccaaaattgttcgtgccttcttcgacaacaggaacctgcgcaagggacccagtggcctg gctgatgagatcaatttcgaggacttcctgaccatcatgtcctacttccggcccatcgac accaccatggacgaggaacaggtggagctgtcccggaaggagaagctgagatttctgttc cacatgtacgactcggacagcgacggccgcatcactctggaagaatatcgaaatgtaaag tatgctcctctggcccgcccggggctcctggtggtcgaggagctgctgtcgggaaaccct cacatcgagaaggagtccgctcgctccatcgccgacggggccatgatggaggcggccagc gtgtgcatggggcagatggagcctgatcaggtgtacgaggggatcaccttcgaggacttc ctgaagatctggcaggggatcgacattgagaccaagatgcacgtccgcttccttaacatg gaaaccatggccctctgccactga