GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:36:06 Sequence gi568815591f:129089152_129312448 : 223297 bp : 43.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2508 2643 136 2 1 57 64 90 0.528 3.64 1.02 Intr + 14063 14219 157 0 1 53 44 57 0.070 -3.13 1.03 Intr + 18812 18944 133 2 1 125 46 51 0.146 5.25 1.04 Intr + 29080 29208 129 0 0 65 99 44 0.363 4.09 1.05 Term + 37166 38509 1344 1 0 56 33 2285 0.809 211.30 1.06 PlyA + 39282 39287 6 1.05 2.00 Prom + 41979 42018 40 -5.06 2.01 Init + 55830 55931 102 2 0 112 92 183 0.999 19.52 2.02 Intr + 58436 58498 63 1 0 100 49 40 0.504 0.21 2.03 Intr + 59233 59335 103 0 1 111 101 24 0.984 5.75 2.04 Intr + 60550 60749 200 2 2 28 47 138 0.578 2.77 2.05 Intr + 72528 72585 58 2 1 131 99 24 0.731 6.26 2.06 Intr + 73243 73370 128 2 2 67 113 159 0.990 16.70 2.07 Intr + 73682 73756 75 1 0 93 87 135 0.953 13.61 2.08 Intr + 75323 75520 198 1 0 103 95 207 0.954 22.45 2.09 Intr + 77250 77402 153 2 0 62 103 12 0.554 0.37 2.10 Intr + 77627 77755 129 1 0 75 99 52 0.979 5.89 2.11 Intr + 78248 78409 162 1 0 66 93 170 0.976 15.47 2.12 Term + 78622 78723 102 0 0 91 54 126 0.807 7.78 2.13 PlyA + 79659 79664 6 1.05 3.00 Prom + 80922 80961 40 -6.16 3.01 Init + 100001 100331 331 1 1 87 114 530 0.999 50.97 3.02 Intr + 114233 114438 206 0 2 84 113 226 0.277 23.62 3.03 Intr + 116052 116261 210 2 0 67 53 473 0.999 40.81 3.04 Intr + 116459 116631 173 1 2 100 63 282 0.999 25.64 3.05 Intr + 116999 117218 220 2 1 74 80 288 0.999 24.90 3.06 Intr + 117313 117436 124 0 1 116 105 234 0.997 28.06 3.07 Intr + 119608 119700 93 2 0 87 92 157 0.997 15.94 3.08 Intr + 120138 120246 109 1 1 133 68 229 0.999 24.74 3.09 Intr + 121212 121397 186 0 0 72 80 333 0.974 29.70 3.10 Intr + 121814 121962 149 2 2 116 80 247 0.999 26.58 3.11 Intr + 122485 122619 135 2 0 97 80 91 0.999 9.74 3.12 Term + 122873 123300 428 0 2 90 52 358 0.986 27.87 3.13 PlyA + 124374 124379 6 1.05 4.00 Prom + 132900 132939 40 -7.76 4.01 Init + 135836 136288 453 1 0 100 79 573 0.693 52.76 4.02 Intr + 161675 161699 25 1 1 92 95 -3 0.012 -1.60 4.03 Intr + 171299 171400 102 0 0 97 52 93 0.454 6.75 4.04 Intr + 194039 194130 92 0 2 84 66 58 0.007 2.91 4.05 Term + 199067 199114 48 1 0 109 47 36 0.523 -1.10 4.06 PlyA + 200941 200946 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 193981 194130 150 0 0 88 66 90 0.942 7.04 S.002 Sngl - 212335 212057 279 1 0 38 38 163 0.811 1.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:129089152_129312448|GENSCAN_predicted_peptide_1|632_aa LSVAISPSCGRKRDSAWLGEQTPGADCNKDRKLFLDPRQVQGGKKAKEGESEPLFHGIRF PGSGFDLCANFQGGLSIMMLSTSSDALGGGLTEMVDLWTPVILESGPILISGPHPNLTNY IFKDTVSKPGYILRFRMDTNFELFEAIYCYIQSSYSGIEQGSPAPEPWSHNNGLWPVRNR ATQQERRRLARLSPSAPRLPSDQARSAARPRPPRRRPGPRSQPRARVGAERSRPAHYGPR LEAPPPPARGARMLARRKPMLPALTINPTIAEGPSPTSEGASEANLVDLQKKLEELELDE QQKRLEAFLTQKAKVGELKDDDFERTSELDAGNGGVVTKVQHRPSGLIMARKLIHLEIKP AIRNQIIREHQVLHECNSPYIVGFYGAFYCDREISICMEHMDGGSLDQGLKEAKRIPEDI LGKVSIAVLRGLAYLREKHQIMHRNVKPSNILVNSRGEIKLCDFGVSGQLIDSMANSFVG TRSYMAPERLQGTHYSVQSVIWSMDLSLVELAIERYPIPPPDAKELEAIFGQPVVDREEG EPHSISSWPGSPGRPNSGYGMDSLPAMAIFELLDYIVKEPPPKLPNGVFTPDFQEFVNKC LIKNPTERADLKMLTNHAFIKRSEVKEADFAC >gi568815591f:129089152_129312448|GENSCAN_predicted_CDS_1|1899_bp ctcagtgtggccatatcgccctcttgcggtagaaagcgggactccgcctggctaggggaa cagacgccaggggctgactgtaacaaagaccggaagctcttcttggatccaaggcaggtg caaggagggaaaaaagctaaggaaggagagagtgagcctctctttcatggcattcggttt cctggctctggatttgacctatgcgcaaacttccagggtggcttgagcataatgatgcta tcaaccagctcagatgccctaggaggtggccttacagaaatggtagacttgtggacacct gtcatattggagtcaggacccatcctgatctcaggacctcatcctaacttaactaattat atcttcaaagacactgtttccaaaccgggctacattctgaggttccgcatggacacgaat tttgagctatttgaagctatatattgttacatccagtcatcctacagtggtatagaacag gggtccccagcccctgagccatggtcgcataacaacggtctatggcctgttaggaaccgg gccacacagcaggagcggcggcggctcgctcgcctcagccccagcgcccctcggctaccc tcggaccaggcccgcagcgccgcccgccctcggccgccccgacgccggcctgggccgcgg tcgcagccccgggctcgcgtaggcgccgaacgctcccggcccgcccactacgggcccagg ctagaggcgccgccgccaccggcccgcggagcccggatgctggcccggaggaagccgatg ctgccggcgctcaccatcaaccctaccatcgccgagggcccgtccccaaccagcgagggc gcctccgaggcaaacctggtggacctgcagaagaagctggaggagctggaacttgacgag cagcagaagcggctggaagcctttctcacccagaaagccaaggtcggcgaactcaaagac gatgacttcgaaaggacctcagagctggacgcgggcaacggcggggtggtcaccaaagtc cagcacagaccctcgggcctcatcatggccaggaagctgatccaccttgagatcaagccg gccatccggaaccagatcatccgcgagcaccaggtcctgcacgagtgcaactcaccgtac atcgtgggcttctacggggccttctactgtgacagggagatcagcatctgcatggagcac atggatggcggctccctggaccaggggctgaaagaggccaagaggattcccgaggacatc ctggggaaagtcagcattgcggttctccggggcttggcgtacctccgagagaagcaccag atcatgcaccgaaatgtgaagccctccaacatcctcgtgaactctagaggggagatcaag ctgtgtgacttcggggtgagcggccagctcatcgactccatggccaactccttcgtgggc acgcgctcctacatggctccggagcggttgcagggcacacattactcggtgcagtcggtc atctggagcatggacctgtccctggtggagctggccatcgaaaggtaccccatccccccg cccgacgccaaggagctggaggccatctttggccagcccgtggtcgacagggaagaagga gagcctcacagcatctcctcttggccagggtcccccgggcgccccaacagcggttacggg atggacagcctgcccgccatggccatcttcgaactgctggactatattgtgaaagagccg cctcctaagctgcccaacggtgtgttcacccccgacttccaggagtttgtcaataaatgc ctcatcaaaaacccaacggagcgggcggacctaaagatgctcacaaaccacgccttcatc aagcggtccgaggtgaaagaagcggattttgcctgctag >gi568815591f:129089152_129312448|GENSCAN_predicted_peptide_2|490_aa MARRPRAPAASGEEFSFVSPLVKYLLFFFNMLFWTSTILIKFCSSAQEAPMDEMMTSGSR GWARPYSLLDPQCPAVFGINWVLTECLLDEQSQPWPEEDAKGLAPDIVEEGALWTPPLQG DSVPQSRGRALTASLPWTAATSDTHNSSAMESTERPVISMVMVAVGVYARLMKHAEAALA CLAVDPAILLIVVGVLMFLLTFCGCIGSLRENICLLQTFSLCLTAVFLLQLAAGILGFVF SDKARGKVSEIINNAIVHYRDDLDLQNLIDFGQKKVWVSQWSGGLWVKVNVIPRDASPSM PVGLFITCQEKFFSTDTVIISANTFDHFHLPAKPETSLKKIIAEYVKQNICTTVIPTSTV FSCCGGISYKDWSQNMYFNCSEDNPSRERCSVPYSCCLPTPDQAVINTMCGQGMQAFDYL EASKVIYTNGCIDKLVNWIHSNLFLLGGVALGLAIPQLVGILLSQILVNQIKDQIKLQLY NQQHRADPWY >gi568815591f:129089152_129312448|GENSCAN_predicted_CDS_2|1473_bp atggcgcggagaccccgggcgccggccgcctccggggaggagttctccttcgtcagcccg ctggtgaaatacctgctcttcttcttcaacatgctcttctggacttccaccatcctaatc aaattctgctcttctgcccaggaagcacctatggatgagatgatgacttcgggctctcga gggtgggcacggccctattcacttctggatccccagtgcccagctgtgtttggcataaat tgggtgcttactgaatgtttattggatgaacaaagccagccgtggcctgaggaagatgcc aaaggcttggccccggacattgttgaggaaggagccctgtggactcctccactccagggg gacagtgtcccccaatccagagggagagcgctgacagccagcctcccatggacagccgcc acctcagacacccataacagcagcgccatggagagcacagaaagaccagtgatttccatg gtgatggtggctgtgggtgtctacgctcggctaatgaagcatgcagaagcagccctagcc tgcctggcagtggaccctgccatcctgctgatcgtggtgggtgtcctcatgttcctgctc accttctgtggctgcattgggtccctccgcgagaacatctgcctcctgcagacgttctcc ctctgcctcaccgctgtgttcctgctgcagctggccgctgggatcctgggcttcgtcttc tcagacaaggctcgagggaaagtgagtgagatcatcaacaatgccattgtgcactaccga gatgacttggatctgcagaacctcattgattttggccagaaaaaggtatgggtcagccag tggtctgggggactgtgggtaaaagtgaatgtcatcccaagagatgcctcaccctctatg cctgtggggctcttcattacctgccaggagaagtttttcagcactgatactgtaattatt tcagccaatacttttgatcacttccatttgcctgcaaagccagaaacttctctcaagaag attatagctgagtatgtaaaacaaaatatatgtacaactgtaataccaacaagcacagtg tttagctgctgtggagggatttcctacaaggactggtctcagaacatgtatttcaactgc tcagaagacaaccccagtcgagagcgctgctctgtgccttactcctgttgcttgcctact cctgaccaggcagtgatcaacactatgtgtggccaaggtatgcaggcctttgactacttg gaagctagcaaagtcatctacaccaatggctgtattgacaagttggtcaactggatacac agcaacctattcttacttggtggtgtggctctaggcctggccatcccccagctggtggga attctgctgtcccagatcctagtgaatcagatcaaagatcagatcaagctacagctctac aaccagcagcaccgggctgacccatggtactga >gi568815591f:129089152_129312448|GENSCAN_predicted_peptide_3|787_aa MAAARPARGPELPLLGLLLLLLLGDPGRGAASSGNATGPGPRSAGGSARRSAAVTGPPPP LSHCGRAAPCEPLRYNVCLGSVLPYGATSTLLAGDSDSQEEAHGKLVLWSGLRNAPRCWA VIQPLLCAVYMPKCENDRVELPSRTLCQATRGPCAIVERERGWPDFLRCTPDRFPEGCTN EVQNIKFNSSGQCEVPLVRTDNPKSWYEDVEGCGIQCQNPLFTEAEHQDMHSYIAAFGAV TGLCTLFTLATFVADWRNSNRYPAVILFYVNACFFVGSIGWLAQFMDGARREIVCRADGT MRLGEPTSNETLSCVIIFVIVYYALMAGVVWFVVLTYAWHTSFKALGTTYQPLSGKTSYF HLLTWSLPFVLTVAILAVAQVDGDSVSGICFVGYKNYRYRAGFVLAPIGLVLIVGGYFLI RGVMTLFSIKSNHPGLLSEKAASKINETMLRLGIFGFLAFGFVLITFSCHFYDFFNQAEW ERSFRDYVLCQANVTIGLPTKQPIPDCEIKNRPSLLVEKINLFAMFGTGIAMSTWVWTKA TLLIWRRTWCRLTGQSDDEPKRIKKSKMIAKAFSKRHELLQNPGQELSFSMHTVSHDGPV AGLAFDLNEPSADVSSAWAQHVTKMVARRGAILPQDISVTPVATPVPPEEQANLWLVEAE ISPELQKRLGRKKKRRKRKKEVCPLAPPPELHPPAPAPSTIPRLPQLPRQKCLVAAGAWG AGDSCRQGAWTLVSNPFCPEPSPPQDPFLPSAPAPVAWAHGRRQGLGPIHSRTNLMDTEL MDADSDF >gi568815591f:129089152_129312448|GENSCAN_predicted_CDS_3|2364_bp atggccgctgcccgcccagcgcgggggccggagctcccgctcctggggctgctgctgctg ctgctgctgggggacccgggccggggggcggcctcgagcgggaacgcgaccgggcctggg cctcggagcgcgggcgggagcgcgaggaggagcgcggcggtgactggccctccgccgccg ctgagccactgcggccgggctgccccctgcgagccgctgcgctacaacgtgtgcctgggc tcggtgctgccctacggggccacctccacactgctggccggagactcggactcccaggag gaagcgcacggcaagctcgtgctctggtcgggcctccggaatgccccccgctgctgggca gtgatccagcccctgctgtgtgccgtatacatgcccaagtgtgagaatgaccgggtggag ctgcccagccgtaccctctgccaggccacccgaggcccctgtgccatcgtggagagggag cggggctggcctgacttcctgcgctgcactcctgaccgcttccctgaaggctgcacgaat gaggtgcagaacatcaagttcaacagttcaggccagtgcgaagtgcccttggttcggaca gacaaccccaagagctggtacgaggacgtggagggctgcggcatccagtgccagaacccg ctcttcacagaggctgagcaccaggacatgcacagctacatcgcggccttcggggccgtc acgggcctctgcacgctcttcaccctggccacattcgtggctgactggcggaactcgaat cgctaccctgctgttattctcttctacgtcaatgcgtgcttctttgtgggcagcattggc tggctggcccagttcatggatggtgcccgccgagagatcgtctgccgtgcagatggcacc atgaggcttggggagcccacctccaatgagactctgtcctgcgtcatcatctttgtcatc gtgtactacgccctgatggctggtgtggtttggtttgtggtcctcacctatgcctggcac acttccttcaaagccctgggcaccacctaccagcctctctcgggcaagacctcctacttc cacctgctcacctggtcactcccctttgtcctcactgtggcaatccttgctgtggcgcag gtggatggggactctgtgagtgggatttgttttgtgggctacaagaactaccgataccgt gcgggcttcgtgctggccccaatcggcctggtgctcatcgtgggaggctacttcctcatc cgaggagtcatgactctgttctccatcaagagcaaccaccccgggctgctgagtgagaag gctgccagcaagatcaacgagaccatgctgcgcctgggcatttttggcttcctggccttt ggctttgtgctcattaccttcagctgccacttctacgacttcttcaaccaggctgagtgg gagcgcagcttccgggactatgtgctatgtcaggccaatgtgaccatcgggctgcccacc aagcagcccatccctgactgtgagatcaagaatcgcccgagccttctggtggagaagatc aacctgtttgccatgtttggaactggcatcgccatgagcacctgggtctggaccaaggcc acgctgctcatctggaggcgtacctggtgcaggttgactgggcagagtgacgatgagcca aagcggatcaagaagagcaagatgattgccaaggccttctctaagcggcacgagctcctg cagaacccaggccaggagctgtccttcagcatgcacactgtgtcccacgacgggcccgtg gcgggcttggcctttgacctcaatgagccctcagctgatgtctcctctgcctgggcccag catgtcaccaagatggtggctcggagaggagccatactgccccaggatatttctgtcacc cctgtggcaactccagtgcccccagaggaacaagccaacctgtggctggttgaggcagag atctccccagagctgcagaagcgcctgggccggaagaagaagaggaggaagaggaagaag gaggtgtgcccgctggcgccgccccctgagcttcacccccctgcccctgcccccagtacc attcctcgactgcctcagctgccccggcagaaatgcctggtggctgcaggtgcctgggga gctggggactcttgccgacagggagcgtggaccctggtctccaacccattctgcccagag cccagtccccctcaggatccatttctgcccagtgcaccggcccccgtggcatgggctcat ggccgccgacagggcctggggcctattcactcccgcaccaacctgatggacacagaactc atggatgcagactcggacttctga >gi568815591f:129089152_129312448|GENSCAN_predicted_peptide_4|239_aa MGEGRGREVGGGADQEQELESEPVVAAEAVMSVQVVSAAAAAKVPEVELKDLSPSEAESQ LGLSTAAVGAMAPPAGGGDPEAPAPAAERPPVPGPGSGPAAALSPAAGKVPQASAMKRSD PHHQHQRHRDGGEALVSPDGTVTEAPRTVKKNGATLMIPYSLFSMEIIPASLNHSSYDVV GQAITTILAISITYSQLNLFNEDSPGLYLGSSFIKLHLGNILKAFFVVEIKSFVLAEFP >gi568815591f:129089152_129312448|GENSCAN_predicted_CDS_4|720_bp atgggggagggaagaggaagggaggtgggaggcggggccgaccaagagcaggagctggag tctgagccggtggttgcagcggaggcggtgatgtcggtgcaggttgtgtcagccgcggct gccgccaaggtgcctgaggtggagctgaaggacctgagcccctccgaggcggagtcgcaa ctaggactgagcacggccgccgtgggcgccatggcccccccggcgggcggtggagaccct gaggctccagctcccgccgcggagcggcccccggtccccggcccgggctcggggcccgcc gccgctctcagccccgccgccgggaaggtgcctcaggcgtcggccatgaagcggagcgac ccacatcaccagcaccagcggcaccgcgacggcggcgaggccctggtcagccccgacggc accgtcaccgaggcgccgcgcacagtcaagaagaatggagcaacccttatgataccctat agtctcttctccatggaaatcattcctgcttctcttaatcattcttcatatgatgtagtc ggccaggccattactaccatcctggctatcagcatcacttactctcagctcaatctcttc aacgaagatagccctgggctctacttgggttcctccttcattaaactgcaccttggaaat attctcaaggcattttttgttgtagaaatcaagtcatttgtgttggcagaatttccctga